International Journal For Multidisciplinary Research
E-ISSN: 2582-2160
•
Impact Factor: 9.24
A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal
Home
Research Paper
Submit Research Paper
Publication Guidelines
Publication Charges
Upload Documents
Track Status / Pay Fees / Download Publication Certi.
Editors & Reviewers
View All
Join as a Reviewer
Get Membership Certificate
Current Issue
Publication Archive
Conference
Publishing Conf. with IJFMR
Upcoming Conference(s) ↓
Conferences Published ↓
DePaul-2026
IC-AIRCM-T3-2026
SPHERE-2025
AIMAR-2025
SVGASCA-2025
ICCE-2025
Chinai-2023
PIPRDA-2023
ICMRS'23
Contact Us
Plagiarism is checked by the leading plagiarism checker
Call for Paper
Volume 8 Issue 3
May-June 2026
Indexing Partners
ReadAble , a lip reading and sign language interpretation machine learning and deep learning model
| Author(s) | Mr. Prajyot Kawatkar, Mr. Sujal Wankhede, Mr. Dev Katre, Mr. Rafe Sheikh, Prof. Leelkanth Dewangan |
|---|---|
| Country | India |
| Abstract | Silent communication involving lip movements and sign language continues to be greatly under-used in popular media and new media. This paper describes ReadAble, a two-part AI system consisting of a Lip Reading module and a Sign Language Detection module that transforms silent video into output that is made accessible via text and synthesized audio output (lip reading) and translated text output (sign language). The Lip Reading module first locates the lip region of the video, tracks the spatiotemporal features of the lip movement, and maps the resulting movement to phonemes/words, producing both text and speech output. The Sign Language Detection module detects the hand and body movements (e.g. ASL, ISL in this case) and interprets the movement sequences from arbitrary sign language expressions into signed sentences or phrases, before producing them in the form of subtitles. The implementation of the system was done in Python (Jupyter Notebook) utilizing multiple state-of-theart models (3D CNN, LSTM / Transformer, MediaPipe / YOLO or SSD), utilizing video data captured from user or recorded feed.Overall, we recognize several remaining challenges to deploying the system in real world noisy, occluded, and continuous settings; and we provide some thoughts on the roadmap for getting us from a lab and into a real-world setting with a better product. |
| Keywords | Lip Reading, Sign Language Detection, Computer Vision, Deep Learning, Accessibility, HumanComputer Interaction |
| Field | Engineering |
| Published In | Volume 8, Issue 3, May-June 2026 |
| Published On | 2026-05-29 |
Share this

E-ISSN 2582-2160
CrossRef DOI is assigned to each research paper published in our journal.
IJFMR DOI prefix is
10.36948/ijfmr
Downloads
All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.
Powered by Sky Research Publication and Journals