Fuzzy based multi-source data fusion for children's age estimation

Estimation of speaker's age is a challenge in speech processing area. This paper a novel approach for estimating a speaker's age is addressed. The method employs a "divide and conquer" strategy wherein the processing speech data are divided into six groups based on the vowel clas...

全面介紹

Saved in:
書目詳細資料
Main Authors: Mirhassani, S.M., Zourmand, A., Ting, H.N.
格式: Conference or Workshop Item
語言:English
出版: 2014
主題:
在線閱讀:http://eprints.um.edu.my/11391/1/0001.pdf
http://eprints.um.edu.my/11391/
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Estimation of speaker's age is a challenge in speech processing area. This paper a novel approach for estimating a speaker's age is addressed. The method employs a "divide and conquer" strategy wherein the processing speech data are divided into six groups based on the vowel classes. Afterward, Mel-frequency cepstral coefficients are computed for each group and single layer feed-forward neural networks are applied to the features to make a primary decision. The extreme learning machine (ELM) method is used to train the classifiers. Subsequently, fuzzy data fusion is employed to provide an overall decision by aggregating the classifier's outputs. The results are then compared with vowel independent age estimation based on ELM and other well-known classification methods, including support vector machine and Knearest neighbor. The processing speech data include six Malay vowels collected from 360 Malay children aged between 7 and 12 years. Experiments conducted based on six age groups revealed that fuzzy fusion of the classifier's outputs resulted in considerable improvement of up to 72.63% in age estimation accuracy. Moreover, the fuzzy fusion of decisions aggregated complimentary information of a speaker's age from varied speech sources.