Speech emotion recognition using spectrogram based neural structured learning

Human emotions are extremely crucial in our daily life. Emotion analysis based solely on auditory data is difficult due to the lack of visible visual information on human faces. Thus, a unique emotion recognition system based on robust characteristics and machine learning from the audio speech is re...

Full description

Saved in:
Bibliographic Details
Main Authors: Sivan, Dawn, Haripriya, P. H., Jose, Rajan
Format: Conference or Workshop Item
Language:English
English
Published: Universiti Malaysia Pahang 2022
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/36833/1/Speech%20emotion%20recognition%20using%20spectrogram%20based%20neural%20structured%20learning.pdf
http://umpir.ump.edu.my/id/eprint/36833/7/Speech%20Emotion%20Recognition%20Using%20Spectrogram%20Based%20Neural%20Structured_FULL.pdf
http://umpir.ump.edu.my/id/eprint/36833/
https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.ump.umpir.36833
record_format eprints
spelling my.ump.umpir.368332023-02-07T02:37:10Z http://umpir.ump.edu.my/id/eprint/36833/ Speech emotion recognition using spectrogram based neural structured learning Sivan, Dawn Haripriya, P. H. Jose, Rajan HD28 Management. Industrial Management Q Science (General) T Technology (General) Human emotions are extremely crucial in our daily life. Emotion analysis based solely on auditory data is difficult due to the lack of visible visual information on human faces. Thus, a unique emotion recognition system based on robust characteristics and machine learning from the audio speech is reported in this paper. Audio details are used as input to the person-independent emotion recognition system, from which the spectrogram values are extracted as features. The generated features are then used to train and understand the emotions via Neural Structured Learning (NSL), a fast and accurate deep learning approach. During studies on an emotion dataset of audio speeches, the proposed approach of integrating spectrogram and NSL produced improved recognition rates compared to other known models. The system can be used in smart environments like homes or clinics to provide effective healthcare, music recommendations, customer support, and marketing, among several other things. As a result, rather than processing data and making judgments from far distant data sources, the decision-making could be made closer to where the data lives. The Toronto Emotional Speech Set (TESS) dataset that contains 7 emotions has been used for this research. The algorithm is successfully tested with the dataset with an accuracy of ~97%. Universiti Malaysia Pahang 2022-11-15 Conference or Workshop Item PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/36833/1/Speech%20emotion%20recognition%20using%20spectrogram%20based%20neural%20structured%20learning.pdf pdf en http://umpir.ump.edu.my/id/eprint/36833/7/Speech%20Emotion%20Recognition%20Using%20Spectrogram%20Based%20Neural%20Structured_FULL.pdf Sivan, Dawn and Haripriya, P. H. and Jose, Rajan (2022) Speech emotion recognition using spectrogram based neural structured learning. In: The 6th National Conference for Postgraduate Research (NCON-PGR 2022), 15 November 2022 , Virtual Conference, Universiti Malaysia Pahang, Malaysia. p. 80.. https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files
institution Universiti Malaysia Pahang
building UMP Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Pahang
content_source UMP Institutional Repository
url_provider http://umpir.ump.edu.my/
language English
English
topic HD28 Management. Industrial Management
Q Science (General)
T Technology (General)
spellingShingle HD28 Management. Industrial Management
Q Science (General)
T Technology (General)
Sivan, Dawn
Haripriya, P. H.
Jose, Rajan
Speech emotion recognition using spectrogram based neural structured learning
description Human emotions are extremely crucial in our daily life. Emotion analysis based solely on auditory data is difficult due to the lack of visible visual information on human faces. Thus, a unique emotion recognition system based on robust characteristics and machine learning from the audio speech is reported in this paper. Audio details are used as input to the person-independent emotion recognition system, from which the spectrogram values are extracted as features. The generated features are then used to train and understand the emotions via Neural Structured Learning (NSL), a fast and accurate deep learning approach. During studies on an emotion dataset of audio speeches, the proposed approach of integrating spectrogram and NSL produced improved recognition rates compared to other known models. The system can be used in smart environments like homes or clinics to provide effective healthcare, music recommendations, customer support, and marketing, among several other things. As a result, rather than processing data and making judgments from far distant data sources, the decision-making could be made closer to where the data lives. The Toronto Emotional Speech Set (TESS) dataset that contains 7 emotions has been used for this research. The algorithm is successfully tested with the dataset with an accuracy of ~97%.
format Conference or Workshop Item
author Sivan, Dawn
Haripriya, P. H.
Jose, Rajan
author_facet Sivan, Dawn
Haripriya, P. H.
Jose, Rajan
author_sort Sivan, Dawn
title Speech emotion recognition using spectrogram based neural structured learning
title_short Speech emotion recognition using spectrogram based neural structured learning
title_full Speech emotion recognition using spectrogram based neural structured learning
title_fullStr Speech emotion recognition using spectrogram based neural structured learning
title_full_unstemmed Speech emotion recognition using spectrogram based neural structured learning
title_sort speech emotion recognition using spectrogram based neural structured learning
publisher Universiti Malaysia Pahang
publishDate 2022
url http://umpir.ump.edu.my/id/eprint/36833/1/Speech%20emotion%20recognition%20using%20spectrogram%20based%20neural%20structured%20learning.pdf
http://umpir.ump.edu.my/id/eprint/36833/7/Speech%20Emotion%20Recognition%20Using%20Spectrogram%20Based%20Neural%20Structured_FULL.pdf
http://umpir.ump.edu.my/id/eprint/36833/
https://ncon-pgr.ump.edu.my/index.php/en/?option=com_fileman&view=file&routed=1&name=E-BOOK%20NCON%202022%20.pdf&folder=E-BOOK%20NCON%202022&container=fileman-files
_version_ 1758578252932186112
score 13.211869