Impact of acoustical voice activity detection on spontaneous filled pause classification

Filled pause detection is imperative for spontaneous speech recognition as it may degrade speech recognition rate. However, filled pause is commonly confused with elongation as they shared the same acoustical properties. Few attempts of classifying filled pause and elongation employed Hidden Markov...

Full description

Saved in:
Bibliographic Details
Main Authors: Hamzah, Raseeda, Jamil, Nursuriati, Seman, Noraini, Ardi, Norizah, C. Doraisamy, Shyamala
Format: Conference or Workshop Item
Language:English
Published: IEEE 2014
Online Access:http://psasir.upm.edu.my/id/eprint/56314/1/Impact%20of%20acoustical%20voice%20activity%20detection%20on%20spontaneous%20filled%20pause%20classification.pdf
http://psasir.upm.edu.my/id/eprint/56314/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Filled pause detection is imperative for spontaneous speech recognition as it may degrade speech recognition rate. However, filled pause is commonly confused with elongation as they shared the same acoustical properties. Few attempts of classifying filled pause and elongation employed Hidden Markov model. Our proposed method of utilizing Neural Network as a classifier achieved 96% precision rate. We also proved that voice activity detection (VAD) affects the performance of speech recognition. Three acoustical-based VAD are compared and the best precision rate is achieved by incorporating volume and first-order difference features. Experiments are conducted using Malay language spontaneous speeches of Malaysia Parliamentary Debate sessions.