Back close

Optimizing Gender Identification with MFCC Feature Engineering and Enhanced Gradient Boosting

Publication Type : Conference Paper

Publisher : IEEE

Source : 2024 Asian Conference on Intelligent Technologies (ACOIT)

Url : https://doi.org/10.1109/acoit62457.2024.10939536

Campus : Bengaluru

School : School of Engineering

Department : Electronics and Communication

Year : 2024

Abstract : Voice-based gender classification has potential significance in the areas of forensic, security and surveillance applications. Biometrics and computer vision [1] are the two areas where gender classification plays vital role. If the audio or the speech sample is noise free, then the recognition accuracy will be high. But, in the case of noisy audio samples and real-time data, existing algorithms failed to achieve greater accuracy. To improve the real-time gender identification, this paper introduces a novel approach that uses feature engineering with optimized Mel Frequency Cepstral Coefficients (MFCC) Features. VoxCeleb2 dataset was used for experimentation which has 6000 data samples. The features are extracted from the audio samples and optimized with polynomial features and logarithmic transformation. In addition, Optuna is used for hyperparameter optimization to improve the model’s performance. The proposed methodology was tested with multiple classifiers. An increase of 7.15% was observed on incorporating feature engineering.

Cite this Research Publication : Manushri Tummala, Lohith Harish, Mahadevi E Malkhed, Siddarth S Kumar, Nizampatnam Neelima, Vivek Venugopal, Optimizing Gender Identification with MFCC Feature Engineering and Enhanced Gradient Boosting, 2024 Asian Conference on Intelligent Technologies (ACOIT), IEEE, 2024, https://doi.org/10.1109/acoit62457.2024.10939536

Admissions Apply Now