Speech emotion recognition research: an analysis of research focus

This article analyses research in speech emotion recognition (“SER”) from 2006 to 2017 in order to identify the current focus of research, and areas in which research is lacking. The objective is to examine what is being done in this field of research. Searching on selected keywords, we extracted an...

Full description

Saved in:
Bibliographic Details
Main Authors: Mustafa, Mumtaz Begum, Yusoof, Mansoor A.M., Mohd Don, Zuraidah, Malekzadeh, Mehdi
Format: Article
Published: Springer Verlag 2018
Subjects:
Online Access:http://eprints.um.edu.my/21189/
https://doi.org/10.1007/s10772-018-9493-x
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1831507010772795392
author Mustafa, Mumtaz Begum
Yusoof, Mansoor A.M.
Mohd Don, Zuraidah
Malekzadeh, Mehdi
author_facet Mustafa, Mumtaz Begum
Yusoof, Mansoor A.M.
Mohd Don, Zuraidah
Malekzadeh, Mehdi
author_sort Mustafa, Mumtaz Begum
building UM Library
collection Institutional Repository
content_provider Universiti Malaya
content_source UM Research Repository
continent Asia
country Malaysia
description This article analyses research in speech emotion recognition (“SER”) from 2006 to 2017 in order to identify the current focus of research, and areas in which research is lacking. The objective is to examine what is being done in this field of research. Searching on selected keywords, we extracted and analysed 260 articles from well-known online databases. The analysis indicates that SER research is an active field of research, dozens of articles being published each year in journals and conference proceedings. The majority of articles concentrate on three critical aspects of SER, namely (1) databases, (2) suitable speech features, and (3) classification techniques to maximize the recognition accuracy of SER systems. Having carried out association analysis of the critical aspects and how they influence the performance of the SER system in term of recognition accuracy, we found that certain combination of databases, speech features and classifiers influence the recognition accuracy of the SER system. We have also suggested aspects of SER that could be taken into consideration in future works based on our review.
format Article
id my.um.eprints-21189
institution Universiti Malaya
publishDate 2018
publisher Springer Verlag
record_format eprints
spelling my.um.eprints-211892019-05-09T05:18:29Z http://eprints.um.edu.my/21189/ Speech emotion recognition research: an analysis of research focus Mustafa, Mumtaz Begum Yusoof, Mansoor A.M. Mohd Don, Zuraidah Malekzadeh, Mehdi P Philology. Linguistics QA76 Computer software This article analyses research in speech emotion recognition (“SER”) from 2006 to 2017 in order to identify the current focus of research, and areas in which research is lacking. The objective is to examine what is being done in this field of research. Searching on selected keywords, we extracted and analysed 260 articles from well-known online databases. The analysis indicates that SER research is an active field of research, dozens of articles being published each year in journals and conference proceedings. The majority of articles concentrate on three critical aspects of SER, namely (1) databases, (2) suitable speech features, and (3) classification techniques to maximize the recognition accuracy of SER systems. Having carried out association analysis of the critical aspects and how they influence the performance of the SER system in term of recognition accuracy, we found that certain combination of databases, speech features and classifiers influence the recognition accuracy of the SER system. We have also suggested aspects of SER that could be taken into consideration in future works based on our review. Springer Verlag 2018 Article PeerReviewed Mustafa, Mumtaz Begum and Yusoof, Mansoor A.M. and Mohd Don, Zuraidah and Malekzadeh, Mehdi (2018) Speech emotion recognition research: an analysis of research focus. International Journal of Speech Technology, 21 (1). pp. 137-156. ISSN 1381-2416, DOI https://doi.org/10.1007/s10772-018-9493-x <https://doi.org/10.1007/s10772-018-9493-x>. https://doi.org/10.1007/s10772-018-9493-x doi:10.1007/s10772-018-9493-x
spellingShingle P Philology. Linguistics
QA76 Computer software
Mustafa, Mumtaz Begum
Yusoof, Mansoor A.M.
Mohd Don, Zuraidah
Malekzadeh, Mehdi
Speech emotion recognition research: an analysis of research focus
title Speech emotion recognition research: an analysis of research focus
title_full Speech emotion recognition research: an analysis of research focus
title_fullStr Speech emotion recognition research: an analysis of research focus
title_full_unstemmed Speech emotion recognition research: an analysis of research focus
title_short Speech emotion recognition research: an analysis of research focus
title_sort speech emotion recognition research: an analysis of research focus
topic P Philology. Linguistics
QA76 Computer software
url http://eprints.um.edu.my/21189/
https://doi.org/10.1007/s10772-018-9493-x
url_provider http://eprints.um.edu.my/