UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 5 | May 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 11 Issue 4
April-2024
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2404511


Registration ID:
536312

Page Number

f104-f108

Share This Article


Jetir RMS

Title

Multi Modal Text and Image Summarization using Deep Learning and Natural Language Processing Techniques – A Review

Abstract

The notion of automated information retrieval and text summarization is challenging in natural language processing due to the great complexity and irregular structure of the texts. A lengthy text is paraphrased during the text summarizing process to provide a summary. The automatic generation of a phrase to describe an image is known as image captioning, and it is a field that combines natural language processing with computer vision. The study of picture captioning has a significant influence on how visually impaired individuals comprehend their environment and may prove advantageous for sentence-level photo organizing. Convolution neural networks (CNN) and recurrent neural networks (RNN) were the primary building blocks of contemporary techniques. Making precise and evocative subtitles is still a difficult endeavor, though. Sentences that match the visual material are referred to as accurate captions; sentences that provide a variety of descriptions, as opposed to simple, everyday language, are referred to as descriptive captions. Generally speaking, the language model must consistently translate the graphical representation into a legible phrase, and the vision model must encode the context completely.

Key Words

Information retrieval, text summarization, deep learning, word2vec, dense captioning, Stanford, NLP

Cite This Article

"Multi Modal Text and Image Summarization using Deep Learning and Natural Language Processing Techniques – A Review", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.11, Issue 4, page no.f104-f108, April-2024, Available :http://www.jetir.org/papers/JETIR2404511.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Multi Modal Text and Image Summarization using Deep Learning and Natural Language Processing Techniques – A Review", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.11, Issue 4, page no. ppf104-f108, April-2024, Available at : http://www.jetir.org/papers/JETIR2404511.pdf

Publication Details

Published Paper ID: JETIR2404511
Registration ID: 536312
Published In: Volume 11 | Issue 4 | Year April-2024
DOI (Digital Object Identifier):
Page No: f104-f108
Country: Pune, Maharashtra, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

00036

Print This Page

Current Call For Paper

Jetir RMS