Speech emotion classification using SVM and MLP on prosodic and voice quality features

In this paper, a comparison of emotion classification undertaken by the Support Vector Machine (SVM) and the Multi-Layer Perceptron (MLP) Neural Network, using prosodic and voice quality features extracted from the Berlin Emotional Database, is reported. The features were extracted using PRAAT tools...

Full description

Saved in:
Bibliographic Details
Main Authors: Idris, Inshirah, Salam, Md. Sah, Sunar, Mohd. Shahrizal
Format: Article
Language:English
Published: Penerbit UTM Press 2016
Subjects:
Online Access:http://eprints.utm.my/id/eprint/71237/1/MdSahSalam2016_SpeechemotionclassificationusingSVM.pdf
http://eprints.utm.my/id/eprint/71237/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-84960154852&doi=10.11113%2fjt.v78.6925&partnerID=40&md5=077bb5e73345f665103c0fb5d9df3473
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, a comparison of emotion classification undertaken by the Support Vector Machine (SVM) and the Multi-Layer Perceptron (MLP) Neural Network, using prosodic and voice quality features extracted from the Berlin Emotional Database, is reported. The features were extracted using PRAAT tools, while the WEKA tool was used for classification. Different parameters were set up for both SVM and MLP, which are used to obtain an optimized emotion classification. The results show that MLP overcomes SVM in overall emotion classification performance. Nevertheless, the training for SVM was much faster when compared to MLP. The overall accuracy was 76.82% for SVM and 78.69% for MLP. Sadness was the emotion most recognized by MLP, with accuracy of 89.0%, while anger was the emotion most recognized by SVM, with accuracy of 87.4%. The most confusing emotions using MLP classification were happiness and fear, while for SVM, the most confusing emotions were disgust and fear.