UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 5 | May 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 10 Issue 7
July-2023
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2307244


Registration ID:
520930

Page Number

c381-c386

Share This Article


Jetir RMS

Title

A DEEP LEARNING APPROACH TO IMAGE-TEXT EMBEDDING

Abstract

Image-text matching is an important task in computer vision and natural language processing that aims to find the semantic relationship between an image and its corresponding text description. One approach to image-text matching is deep learning, which has shown promising results in various computer vision and natural language processing tasks. The first step is to pre-process the image and text data to make them suitable for deep learning models. For images, this may involve resizing, cropping, and normalization. For text, this may involve tokenization, stopword removal, and stemming. In image processing, this may involve using convolutional neural networks (CNNs) to extract features from images. In natural language processing, this may involve using pre-trained language models such as BERT or GPT to extract features from text. Once the features are extracted, the next step is to create a common embedding space for both the image and text features. This can be done by training a deep learning model that learns to map both the image and text features into a shared embedding space. The final step is to measure the similarity between the embedded image and text features. This can be done using various similarity measures such as cosine similarity or Euclidean distance.

Key Words

Resnet-101, MS COCO,Word2Vec,Corpus,Cross-gram model, Dual Path

Cite This Article

"A DEEP LEARNING APPROACH TO IMAGE-TEXT EMBEDDING", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.10, Issue 7, page no.c381-c386, July-2023, Available :http://www.jetir.org/papers/JETIR2307244.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"A DEEP LEARNING APPROACH TO IMAGE-TEXT EMBEDDING", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.10, Issue 7, page no. ppc381-c386, July-2023, Available at : http://www.jetir.org/papers/JETIR2307244.pdf

Publication Details

Published Paper ID: JETIR2307244
Registration ID: 520930
Published In: Volume 10 | Issue 7 | Year July-2023
DOI (Digital Object Identifier):
Page No: c381-c386
Country: Hyderebad, Telangana, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

000103

Print This Page

Current Call For Paper

Jetir RMS