Cross-Corpus Speech Emotion Recognition Based on Hybrid Neural Networks

Authors: Rehman, A., Liu, Z.T., Li, D.Y. and Wu, B.H.

Journal: Chinese Control Conference, CCC

Volume: 2020-July

Pages: 7464-7468

eISSN: 2161-2927

ISBN: 9789881563903

ISSN: 1934-1768

DOI: 10.23919/CCC50068.2020.9189368

Abstract:

Speech emotion recognition helps enrich next-generation AI with emotional intelligence abilities by grasping the emotion from voice and words. At the current stage, speech emotion recognition (SER) is only used within experimental boundaries. The current challenge facing the SER research is the lack of robustness across cultures, languages and even minor differences such as age-gaps of speakers. To create a more adaptable SER in adversarial circumstances, we propose hybrid neural networks architecture that creates a holistic model by embedding the Mel Frequency Cepstrum Coefficients as one-hot inputs such that differences in coefficients in each emotional category are inflated according to their importance. We performed experiments on three different databases to test the cross-corpus effectiveness of the proposed model.

Source: Scopus

Cross-Corpus Speech Emotion Recognition Based on Hybrid Neural Networks

Authors: Rehman, A., Liu, Z.-T., Li, D.-Y. and Wu, B.-H.

Journal: PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE

Pages: 7464-7468

ISSN: 2161-2927

Source: Web of Science (Lite)