International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 8, Issue 3 (May-June 2026) Submit your research before last 3 days of June to publish your research paper in the issue of May-June.

Low Resource Language Model for Indian Regional language

Author(s) Ms. Soundarya C S, Prof. Harish T A, Prof. Dr. Shantala C P
Country India
Abstract Although many languages exist in India, they all cannot be said to have adequate support for proper language processing. The present project aims at designing a program which would aid language processing from selected Indian languages that have limited availability in digital format. The program will facilitate basic text-processing procedures such as language identification, sentence and word segmentation, stop-word elimination, word frequency count, and POS tagging.. To implement its functionality, the toolkit would use Unicode-based algorithms and simple rules-based methods, thus being able to work with several languages at once, for example, Hindi, Tamil, Telugu, Kannada, Bengali, etc. Moreover, other features like transliteration and translation would be available in order to provide additional comfort for the user. The GUI would be implemented via the Tkinter library allowing people to enter some text and conduct certain language processing actions, while watching the results. The goal of this project is to develop a simple multilingual toolkit for text processing without using complex models or large datasets.
Keywords Natural Language Processing (NLP), Indian Regional Languages, Multilingual Text Processing, Rule-Based Language Processing.
Field Engineering
Published In Volume 8, Issue 3, May-June 2026
Published On 2026-05-26

Share this