International Journal For Multidisciplinary Research
E-ISSN: 2582-2160
•
Impact Factor: 9.24
A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal
Home
Research Paper
Submit Research Paper
Publication Guidelines
Publication Charges
Upload Documents
Track Status / Pay Fees / Download Publication Certi.
Editors & Reviewers
View All
Join as a Reviewer
Get Membership Certificate
Current Issue
Publication Archive
Conference
Publishing Conf. with IJFMR
Upcoming Conference(s) ↓
Conferences Published ↓
DePaul-2026
IC-AIRCM-T3-2026
SPHERE-2025
AIMAR-2025
SVGASCA-2025
ICCE-2025
Chinai-2023
PIPRDA-2023
ICMRS'23
Contact Us
Plagiarism is checked by the leading plagiarism checker
Call for Paper
Volume 8 Issue 3
May-June 2026
Indexing Partners
A Deep Learning Framework for Speech Emotion Recognition: A Gender-Aware Hierarchical Pipeline with Optimized 18-Layer Convolutional Neural Network
| Author(s) | Dr. Savita Jain |
|---|---|
| Country | India |
| Abstract | The field of Affective Computing has emerged as a crucial domain in human-computer interaction, with Speech Emotion Recognition (SER) serving as a cornerstone for developing intuitive, context-aware systems. While traditional Automated Speech Recognition (ASR) frameworks have achieved considerable maturity in decoding semantic content, recognizing the underlying emotional state from spoken language remains a computationally complex challenge. Real-world acoustic signals are heavily influenced by environmental noise, speaker idiosyncrasies, and physical variability across genders. This paper introduces a high-performance, structurally optimized hierarchical framework that addresses these limitations through three primary contributions: (1) a dense 182-feature extraction pipeline unifying spectral, linear predictive, dynamic energy, prosodic, and statistical shape profiles; (2) an early-stage, gender-aware hierarchical pipeline driven by a Gender Recognition (GR) circuit that splits the processing stream based on fundamental frequency distribution to eliminate cross-gender acoustic overlaps; and (3) a customized 18-layer Deep Convolutional Neural Network (CNN) integrated with meta-heuristic hyper-parameter optimization. The system is evaluated on the RAVDESS and SAVEE benchmark corpora, demonstrating superior multi-class emotion classification accuracy and operational efficiency compared to baseline Multi-Layer Perceptron (MLP) and Long Short-Term Memory (LSTM) architectures. |
| Keywords | Speech Emotion Recognition, Convolutional Neural Network, Gender Recognition Circuit, Affective Computing, RAVDESS, SAVEE, Feature Extraction, Deep Learning. |
| Field | Computer |
| Published In | Volume 8, Issue 3, May-June 2026 |
| Published On | 2026-05-30 |
| DOI | https://doi.org/10.36948/ijfmr.2026.v08i03.80011 |
Share this

E-ISSN 2582-2160
CrossRef DOI is assigned to each research paper published in our journal.
IJFMR DOI prefix is
10.36948/ijfmr
Downloads
All research papers published on this website are licensed under Creative Commons Attribution-ShareAlike 4.0 International License, and all rights belong to their respective authors/researchers.
Powered by Sky Research Publication and Journals