International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 8, Issue 3 (May-June 2026) Submit your research before last 3 days of June to publish your research paper in the issue of May-June.

English audio-video to Marathi audio-video using Machine learning

Author(s) Ms. Karishma Karande, Ms. Ankita Thepale, Ms. Vidhi Karade, Mr. Swapnil Kohle, Mr. Soham Wankhede, Mr. Mandar Deo, Mr. Chaitanya Thapa
Country India
Abstract In this paper, we explore different techniques of overcoming the challenges of low-resource in Neural Machine Translation (NMT), specifically focusing on the case of English-Marathi NMT. This report details the objective, methodology, and system overview for developing an expert-level Speech-to-Speech Translation (S2ST) system for the resource-constrained English-to-Marathi language pair using machine learning. The traditional cascaded approach (Automatic Speech Recognition -> Machine Translation -> Text-to-Speech) is critically assessed and deemed suboptimal due to its inherent susceptibility to compounded errors, high computational latency, and significant loss of prosodic information during the intermediate text representation stage. To circumvent these limitations, a Unit-to-Unit Sequence-to-Sequence (Seq2Seq) framework is proposed.
Keywords Neural Machine Translation (NMT), Audio-Visual Machine Translation, English-to-Marathi Translation, Low-Resource Language Translation, Automatic Speech Recognition (ASR), Text-to-Speech (TTS), End-to-End Translation, Unit-to-Unit, Sequence-to-Sequence (Seq2Seq), Lip Synchronization (Lip-Sync).
Field Engineering
Published In Volume 7, Issue 6, November-December 2025
Published On 2025-11-16
DOI https://doi.org/10.36948/ijfmr.2025.v07i06.60826

Share this