International Journal For Multidisciplinary Research
E-ISSN: 2582-2160
•
Impact Factor: 9.24
A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal
Home
Research Paper
Submit Research Paper
Publication Guidelines
Publication Charges
Upload Documents
Track Status / Pay Fees / Download Publication Certi.
Editors & Reviewers
View All
Join as a Reviewer
Get Membership Certificate
Current Issue
Publication Archive
Conference
Publishing Conf. with IJFMR
Upcoming Conference(s) ↓
Conferences Published ↓
DePaul-2026
IC-AIRCM-T3-2026
SPHERE-2025
AIMAR-2025
SVGASCA-2025
ICCE-2025
Chinai-2023
PIPRDA-2023
ICMRS'23
Contact Us
Plagiarism is checked by the leading plagiarism checker
Call for Paper
Volume 8 Issue 3
May-June 2026
Indexing Partners
English audio-video to Marathi audio-video using Machine learning
| Author(s) | Ms. Karishma Karande, Ms. Ankita Thepale, Ms. Vidhi Karade, Mr. Swapnil Kohle, Mr. Soham Wankhede, Mr. Mandar Deo, Mr. Chaitanya Thapa |
|---|---|
| Country | India |
| Abstract | In this paper, we explore different techniques of overcoming the challenges of low-resource in Neural Machine Translation (NMT), specifically focusing on the case of English-Marathi NMT. This report details the objective, methodology, and system overview for developing an expert-level Speech-to-Speech Translation (S2ST) system for the resource-constrained English-to-Marathi language pair using machine learning. The traditional cascaded approach (Automatic Speech Recognition -> Machine Translation -> Text-to-Speech) is critically assessed and deemed suboptimal due to its inherent susceptibility to compounded errors, high computational latency, and significant loss of prosodic information during the intermediate text representation stage. To circumvent these limitations, a Unit-to-Unit Sequence-to-Sequence (Seq2Seq) framework is proposed. |
| Keywords | Neural Machine Translation (NMT), Audio-Visual Machine Translation, English-to-Marathi Translation, Low-Resource Language Translation, Automatic Speech Recognition (ASR), Text-to-Speech (TTS), End-to-End Translation, Unit-to-Unit, Sequence-to-Sequence (Seq2Seq), Lip Synchronization (Lip-Sync). |
| Field | Engineering |
| Published In | Volume 7, Issue 6, November-December 2025 |
| Published On | 2025-11-16 |
| DOI | https://doi.org/10.36948/ijfmr.2025.v07i06.60826 |
Share this

E-ISSN 2582-2160
CrossRef DOI is assigned to each research paper published in our journal.
IJFMR DOI prefix is
10.36948/ijfmr
Downloads
All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.
Powered by Sky Research Publication and Journals