UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 3 | March 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 5 Issue 5
May-2018
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR1805728


Registration ID:
182735

Page Number

754-761

Share This Article


Jetir RMS

Title

A Language Identification System Using CRF-based Approach for North-East Indian Regional Social Media Text

Abstract

Identification of the languages at the document level has been considered an almost solved problem in some application areas, but language detectors fail to perform well in the social media context due to phenomena such as utterance internal code-switching, lexical borrowings and phonetic typing. In such an environment, automatic language identification for the code-mixed Social Media Texts has captured attention from the Natural Language Processing Research community. We describe our Conditional Random Field(CRF)–based system for automatic language identification of social media content of code-mixed English and Manipuri texts. A dataset of Twitter and Facebook posts that exhibit code-mixing between English and Manipuri was selected. Experimentation on CRF models was done using various features and the performances have been observed.

Key Words

Natural Language Processing,code-mixed,CRF,trigrams,bigrams.

Cite This Article

"A Language Identification System Using CRF-based Approach for North-East Indian Regional Social Media Text", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.5, Issue 5, page no.754-761, MAY-2018, Available :http://www.jetir.org/papers/JETIR1805728.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"A Language Identification System Using CRF-based Approach for North-East Indian Regional Social Media Text", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.5, Issue 5, page no. pp754-761, MAY-2018, Available at : http://www.jetir.org/papers/JETIR1805728.pdf

Publication Details

Published Paper ID: JETIR1805728
Registration ID: 182735
Published In: Volume 5 | Issue 5 | Year May-2018
DOI (Digital Object Identifier): http://doi.one/10.1729/IJCRT.17737
Page No: 754-761
Country: Imphal West, Manipur, INDIA .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

0002997

Print This Page

Current Call For Paper

Jetir RMS