International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 8, Issue 3 (May-June 2026) Submit your research before last 3 days of June to publish your research paper in the issue of May-June.

Landmark-Driven Touchless Audio Control Using OpenCV and MediaPipe

Author(s) Mr. Jayesh Balu Sonar, Ms. Akshita Santosh Patil, Ms. Prajakta Yuvraj Borse, Mr. Jagdish Nandu Jadhav, Mr. Aditya Jagdish Shinde
Country India
Abstract Touchless human-computer interaction provides a natural way to control digital systems without relying on physical input devices. This paper presents a landmark-driven audio interaction framework that uses real-time hand gesture recognition for desktop volume control. The proposed prototype uses OpenCV for webcam frame acquisition and interface rendering, MediaPipe for hand landmark detection, and Windows-compatible audio-control libraries for executing system-level audio operations. Hand landmark relationships are used to classify gesture states and map them to commands such as volume adjustment, mute/unmute control, and boosted audio response. The system also includes a dashboard interface that displays gesture status, hand detection state, volume percentage, frame-rate behavior, and interaction history. A pilot evaluation was structured using lighting variation, camera distance, and real-time performance as key parameters. The results indicate that the prototype performs effectively under normal lighting and medium camera distance, while accuracy decreases in low-light and long-distance conditions. In addition to the implemented desktop prototype, the paper discusses browser extension readiness as a future direction for web-based media control. The proposed system demonstrates the feasibility of a lightweight, low-cost, and webcam-based touchless audio-control interface for multimedia interaction and accessibility-oriented applications.
Keywords Hand Gesture Recognition, MediaPipe, OpenCV, Touchless Interaction, Audio Control, Human-Computer Interaction, Browser Media Control
Field Computer > Artificial Intelligence / Simulation / Virtual Reality
Published In Volume 8, Issue 3, May-June 2026
Published On 2026-05-11
DOI https://doi.org/10.36948/ijfmr.2026.v08i03.77737

Share this