International Journal For Multidisciplinary Research
E-ISSN: 2582-2160
•
Impact Factor: 9.24
A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal
Home
Research Paper
Submit Research Paper
Publication Guidelines
Publication Charges
Upload Documents
Track Status / Pay Fees / Download Publication Certi.
Editors & Reviewers
View All
Join as a Reviewer
Get Membership Certificate
Current Issue
Publication Archive
Conference
Publishing Conf. with IJFMR
Upcoming Conference(s) ↓
Conferences Published ↓
IC-AIRCM-T3-2026
SPHERE-2025
AIMAR-2025
SVGASCA-2025
ICCE-2025
Chinai-2023
PIPRDA-2023
ICMRS'23
Contact Us
Plagiarism is checked by the leading plagiarism checker
Call for Paper
Volume 8 Issue 2
March-April 2026
Indexing Partners
SMART VISION: DETECT AND DESCRIBE
| Author(s) | Mr. MANJUNATHA G, Ms. Kruthi M, Ms. Deekshitha M R, Mr. Kishan B, Ms. Deeksha S K |
|---|---|
| Country | India |
| Abstract | As the field of computer vision evolves rapidly, the need for integrated systems that can perform multi-faceted visual analysis is growing. Conventional recognition systems are usually limited to single-purpose tasks, thus, solutions that are fragmented and lack versatility are the result. This manuscript introduces "Smart Vision: Detect and Describe," a single AI-based visual perception system that is capable of detecting and understanding the most common entities in the real world such as objects, hand gestures, and handwritten digits in real-time. The system in question is employing a modular architecture based on the Flask web framework and it is integrating cutting-edge deep learning models. In particular, it uses YOLOv8 for fast and accurate object detection, while several Convolutional Neural Networks (CNNs) are being employed for gesture and digit recognition. The system, by merging these separate vision problems into one platform, not only facilitates adaptive detection but also can output contextual descriptions via a connected camera. Testing-with data from live streaming and on-the-fly model execution-shows that the system is very accurate and prompt. Such results are an endorsement of the framework, and they open up a plethora of possibilities for the framework to be used in real-world scenarios. Examples of such scenarios include the employment of the technology in developing devices that aid the visually impaired, intelligent robotics, and sophisticated security surveillance systems. |
| Keywords | Computer Vision, Deep Learning, YOLOv8, CNN, Flask, Real-time Detection, Assistive Technology. |
| Field | Engineering |
| Published In | Volume 7, Issue 6, November-December 2025 |
| Published On | 2025-12-11 |
| DOI | https://doi.org/10.36948/ijfmr.2025.v07i06.62731 |
Share this

E-ISSN 2582-2160
CrossRef DOI is assigned to each research paper published in our journal.
IJFMR DOI prefix is
10.36948/ijfmr
Downloads
All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.
Powered by Sky Research Publication and Journals