International Journal For Multidisciplinary Research
E-ISSN: 2582-2160
•
Impact Factor: 9.24
A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal
Home
Research Paper
Submit Research Paper
Publication Guidelines
Publication Charges
Upload Documents
Track Status / Pay Fees / Download Publication Certi.
Editors & Reviewers
View All
Join as a Reviewer
Get Membership Certificate
Current Issue
Publication Archive
Conference
Publishing Conf. with IJFMR
Upcoming Conference(s) ↓
Conferences Published ↓
DePaul-2026
IC-AIRCM-T3-2026
SPHERE-2025
AIMAR-2025
SVGASCA-2025
ICCE-2025
Chinai-2023
PIPRDA-2023
ICMRS'23
Contact Us
Plagiarism is checked by the leading plagiarism checker
Call for Paper
Volume 8 Issue 3
May-June 2026
Indexing Partners
Real-Time Sign Language Translator
| Author(s) | Mr. Ansh Raj Mittal, Mr. Satvik Shrivastava, Mr. Bhavya Kumar |
|---|---|
| Country | India |
| Abstract | This paper presents a real-time sign language translation system integrating Google MediaPipe Handslandmark extraction, sequence-anchored coordinate normalization, and a Transformer-based deep learning architecture.The system recognizes fifteen distinct sign language gesture classes—wave,yes, no, stop, wait, yo, good, bad, peace,call_me, promise, up, down, circle, and idle—from a standard webcam without specialized sensors. The Transformerencoder employs four-head multi-head self-attention (key dimension 256), Conv1D feed-forward sublayers, residualconnections, and Layer Normalization to capture long-range temporal dependencies across 60-frame gesture sequences.A rolling buffer and five-frame majority-vote stabilization mechanism suppress prediction noise for stable real-timeoutput. Evaluation on a 27,000-frame dataset yields 100% classification accuracy with precision, recall, and F1-scoreeach equal to 1.0000. The confusion matrix exhibits perfect diagonal dominance with zerointer-class misclassification.Mean end-to-end inference latency is 39.0 ms on CPU-only consumer hardware, confirming practical real-timedeployment suitability. |
| Keywords | Sign Language Recognition, Deep Learning, Transformer Networks, MediaPipe, Gesture Recognition, Computer Vision, Human-Computer Interaction, Real-Time Translation, Multi-Head Attention, Temporal Sequence Classification |
| Field | Computer > Artificial Intelligence / Simulation / Virtual Reality |
| Published In | Volume 8, Issue 3, May-June 2026 |
| Published On | 2026-05-11 |
Share this

E-ISSN 2582-2160
CrossRef DOI is assigned to each research paper published in our journal.
IJFMR DOI prefix is
10.36948/ijfmr
Downloads
All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.
Powered by Sky Research Publication and Journals