UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 5 | May 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 7 Issue 8
August-2020
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIR2008286


Registration ID:
300275

Page Number

2151-2157

Share This Article


Jetir RMS

Title

An Unsupervised Word Level Language Identification of English and Kokborok Code-Mixed and Code-Switched Sentences

Abstract

: Considering the increasing uses of multilingual text, the need for an automatic word level language identification model is raised. In this regard, we present an unsupervised model for word level language identification of English and Kokborok code-mixed and code-switched sentences. Several works have already been reported for various languages, including various Indian languages. But, to the best of our knowledge, ours is the first language identification work dedicated to low resource English-Kokborok language pairs. The proposed model combines a frequency lexicon based, character n-gram language model and a language dependent morphological dictionary-based model for correctly classifying each word. The model which is suitable for low resource languages that do not have a large number of annotated dataset is able to achieve a good performance with word accuracy level of 84%.

Key Words

Word level language identification, code-mixing, code-switching, dictionaries, affixes, kokborok, English.

Cite This Article

"An Unsupervised Word Level Language Identification of English and Kokborok Code-Mixed and Code-Switched Sentences ", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.7, Issue 8, page no.2151-2157, August-2020, Available :http://www.jetir.org/papers/JETIR2008286.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"An Unsupervised Word Level Language Identification of English and Kokborok Code-Mixed and Code-Switched Sentences ", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.7, Issue 8, page no. pp2151-2157, August-2020, Available at : http://www.jetir.org/papers/JETIR2008286.pdf

Publication Details

Published Paper ID: JETIR2008286
Registration ID: 300275
Published In: Volume 7 | Issue 8 | Year August-2020
DOI (Digital Object Identifier): http://doi.one/10.1729/Journal.34280
Page No: 2151-2157
Country: Gomati, Tripura, India .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

0003328

Print This Page

Current Call For Paper

Jetir RMS