Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network

Farooq, Misbah; Hussain, Fawad; Baloch, Naveed Khan; Raja, Fawad Riasat; Yu, Heejung; Zikria, Yousaf Bin

doi:10.3390/s20216008

Detailed Information

Cited 3 time in webofscience

Cited 4 time in scopus

Metadata Downloads

Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network

Full metadata record

DC Field	Value	Language
dc.contributor.author	Farooq, Misbah	-
dc.contributor.author	Hussain, Fawad	-
dc.contributor.author	Baloch, Naveed Khan	-
dc.contributor.author	Raja, Fawad Riasat	-
dc.contributor.author	Yu, Heejung	-
dc.contributor.author	Zikria, Yousaf Bin	-
dc.date.accessioned	2021-08-30T09:42:44Z	-
dc.date.available	2021-08-30T09:42:44Z	-
dc.date.created	2021-06-18	-
dc.date.issued	2020-11	-
dc.identifier.issn	1424-8220	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/51984	-
dc.description.abstract	Speech emotion recognition (SER) plays a significant role in human-machine interaction. Emotion recognition from speech and its precise classification is a challenging task because a machine is unable to understand its context. For an accurate emotion classification, emotionally relevant features must be extracted from the speech data. Traditionally, handcrafted features were used for emotional classification from speech signals; however, they are not efficient enough to accurately depict the emotional states of the speaker. In this study, the benefits of a deep convolutional neural network (DCNN) for SER are explored. For this purpose, a pretrained network is used to extract features from state-of-the-art speech emotional datasets. Subsequently, a correlation-based feature selection technique is applied to the extracted features to select the most appropriate and discriminative features for SER. For the classification of emotions, we utilize support vector machines, random forests, the k-nearest neighbors algorithm, and neural network classifiers. Experiments are performed for speaker-dependent and speaker-independent SER using four publicly available datasets: the Berlin Dataset of Emotional Speech (Emo-DB), Surrey Audio Visual Expressed Emotion (SAVEE), Interactive Emotional Dyadic Motion Capture (IEMOCAP), and the Ryerson Audio Visual Dataset of Emotional Speech and Song (RAVDESS). Our proposed method achieves an accuracy of 95.10% for Emo-DB, 82.10% for SAVEE, 83.80% for IEMOCAP, and 81.30% for RAVDESS, for speaker-dependent SER experiments. Moreover, our method yields the best results for speaker-independent SER with existing handcrafted features-based SER approaches.	-
dc.language	English	-
dc.language.iso	en	-
dc.publisher	MDPI	-
dc.title	Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Yu, Heejung	-
dc.identifier.doi	10.3390/s20216008	-
dc.identifier.scopusid	2-s2.0-85094316266	-
dc.identifier.wosid	000593598500001	-
dc.identifier.bibliographicCitation	SENSORS, v.20, no.21	-
dc.relation.isPartOf	SENSORS	-
dc.citation.title	SENSORS	-
dc.citation.volume	20	-
dc.citation.number	21	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Chemistry	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Instruments & Instrumentation	-
dc.relation.journalWebOfScienceCategory	Chemistry, Analytical	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Instruments & Instrumentation	-
dc.subject.keywordAuthor	speech emotion recognition	-
dc.subject.keywordAuthor	deep convolutional neural network	-
dc.subject.keywordAuthor	correlation-based feature selection	-

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Electronics and Information Engineering > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,863,045; Today View :25,374

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE