Parallelization strategy of speaker identification system for Hybrid Modeling

Over the last decade technological advances have made speaker recognition brought a significant characteristic in forensics science and biometric identifications. Speaker recognition is a process where a person is recognized on the basis of his/her voice signals (R. C Campbel, 1997). Speaker recogni...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahmad, Abdul Manan, Loh, Mun Yee
Format: Book Section
Language:English
Published: Penerbit UTM 2008
Subjects:
Online Access:http://eprints.utm.my/id/eprint/24970/1/AbdMananAhmad2008_ParallelizationStrategyOfSpeakerIdentification.pdf
http://eprints.utm.my/id/eprint/24970/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.24970
record_format eprints
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Ahmad, Abdul Manan
Loh, Mun Yee
Parallelization strategy of speaker identification system for Hybrid Modeling
description Over the last decade technological advances have made speaker recognition brought a significant characteristic in forensics science and biometric identifications. Speaker recognition is a process where a person is recognized on the basis of his/her voice signals (R. C Campbel, 1997). Speaker recognition can be divided into speaker verification and speaker identification. These can furthermore be divided into text dependent and text independent systems. To date, our technology has yet to provide speaker recognition system for many application include access control system, security control for confidential information, transaction authentication and telephone banking. Pattern classification plays as a crucial part in speaker modeling component chain. The result of pattern classification will strongly affect the speaker recognition engine to decide whether to accept or reject a speaker. Many research efforts have been done in speaker recognition pattern classification. There are Dynamic Time Warping (DTW), Vector Quantization (VQ), Hidden Markov Models, Gaussian mixture model (GMM), Support Vector Machine (SVM)(Sadaoki Furui, 1997)and so forth. Building robust speaker recognition systems are often difficult because speech signal is dynamic and influenced by many sources of variation. The past two decades have seen significant progress being made to cope with this problem using different techniques. From among these techniques, hybrid two types of pattern classification have reported promising results in improving the accuracy result. Although producing considerable improvement, these hybrid techniques are still somewhat restricted in terms of recognition accuracy for large data set. Since previous works have reported substantial examples of successful implementation in combining two classification techniques, this research intends to produce a new ways of hybrid techniques in order to solve the accuracy problem for incremental data set condition. We put forward a new VQ-GMM mixture model to improve recognition rate of the speaker identification system in the chapter. VQ and GMM are widely applied to the speaker identification, but both have some disadvantages. To overcome those shortages, we introduce a new hybrid VQ/GMM model to improve recognition rate of the speaker identification system in the chapter. Although in baseline form, the VQ-based solution is less accurate than the GMM, but it offers simplicity in computation. Besides, after some experiments, we found that VQ and GMM techniques are suitable apply for the speaker independent task. Therefore, we hope to make use of their merits via a hybrid VQ/GMM classifier. There are many forms of GMM and other pattern classification techniques adaptation in the past. In hybrid VQ/GMM, most of them use VQ as an optimization function to reduce Expectation Maximization algorithm in order to improve the training speed (Reynolds and Rose, 1995; J. Pelecano, 2000). Besides, some researchers use GMM as a post-processor after VQ cluster the speech signal into regions (Qiguang Lin et al, 1996). In our proposed hybrid modeling, both VQ model and GMM model will run parallel after signal preprocessing process. A comparison performance of hybrid VQ/GMM, DTW, VQ, GMM and SVM techniques for speaker recognition will be done through the experiments and will reported in this chapter. This chapter is organized as follows. In Section 2, we reviews proposed speaker recognition framework. In Section 3, we discuss how we construct the hybrid modeling for pattern classification. Section 4 shows the experimental result for the comparison performance. Finally, section 5 we concludes our work.
format Book Section
author Ahmad, Abdul Manan
Loh, Mun Yee
author_facet Ahmad, Abdul Manan
Loh, Mun Yee
author_sort Ahmad, Abdul Manan
title Parallelization strategy of speaker identification system for Hybrid Modeling
title_short Parallelization strategy of speaker identification system for Hybrid Modeling
title_full Parallelization strategy of speaker identification system for Hybrid Modeling
title_fullStr Parallelization strategy of speaker identification system for Hybrid Modeling
title_full_unstemmed Parallelization strategy of speaker identification system for Hybrid Modeling
title_sort parallelization strategy of speaker identification system for hybrid modeling
publisher Penerbit UTM
publishDate 2008
url http://eprints.utm.my/id/eprint/24970/1/AbdMananAhmad2008_ParallelizationStrategyOfSpeakerIdentification.pdf
http://eprints.utm.my/id/eprint/24970/
_version_ 1643647539972407296
spelling my.utm.249702017-10-10T01:38:54Z http://eprints.utm.my/id/eprint/24970/ Parallelization strategy of speaker identification system for Hybrid Modeling Ahmad, Abdul Manan Loh, Mun Yee QA75 Electronic computers. Computer science Over the last decade technological advances have made speaker recognition brought a significant characteristic in forensics science and biometric identifications. Speaker recognition is a process where a person is recognized on the basis of his/her voice signals (R. C Campbel, 1997). Speaker recognition can be divided into speaker verification and speaker identification. These can furthermore be divided into text dependent and text independent systems. To date, our technology has yet to provide speaker recognition system for many application include access control system, security control for confidential information, transaction authentication and telephone banking. Pattern classification plays as a crucial part in speaker modeling component chain. The result of pattern classification will strongly affect the speaker recognition engine to decide whether to accept or reject a speaker. Many research efforts have been done in speaker recognition pattern classification. There are Dynamic Time Warping (DTW), Vector Quantization (VQ), Hidden Markov Models, Gaussian mixture model (GMM), Support Vector Machine (SVM)(Sadaoki Furui, 1997)and so forth. Building robust speaker recognition systems are often difficult because speech signal is dynamic and influenced by many sources of variation. The past two decades have seen significant progress being made to cope with this problem using different techniques. From among these techniques, hybrid two types of pattern classification have reported promising results in improving the accuracy result. Although producing considerable improvement, these hybrid techniques are still somewhat restricted in terms of recognition accuracy for large data set. Since previous works have reported substantial examples of successful implementation in combining two classification techniques, this research intends to produce a new ways of hybrid techniques in order to solve the accuracy problem for incremental data set condition. We put forward a new VQ-GMM mixture model to improve recognition rate of the speaker identification system in the chapter. VQ and GMM are widely applied to the speaker identification, but both have some disadvantages. To overcome those shortages, we introduce a new hybrid VQ/GMM model to improve recognition rate of the speaker identification system in the chapter. Although in baseline form, the VQ-based solution is less accurate than the GMM, but it offers simplicity in computation. Besides, after some experiments, we found that VQ and GMM techniques are suitable apply for the speaker independent task. Therefore, we hope to make use of their merits via a hybrid VQ/GMM classifier. There are many forms of GMM and other pattern classification techniques adaptation in the past. In hybrid VQ/GMM, most of them use VQ as an optimization function to reduce Expectation Maximization algorithm in order to improve the training speed (Reynolds and Rose, 1995; J. Pelecano, 2000). Besides, some researchers use GMM as a post-processor after VQ cluster the speech signal into regions (Qiguang Lin et al, 1996). In our proposed hybrid modeling, both VQ model and GMM model will run parallel after signal preprocessing process. A comparison performance of hybrid VQ/GMM, DTW, VQ, GMM and SVM techniques for speaker recognition will be done through the experiments and will reported in this chapter. This chapter is organized as follows. In Section 2, we reviews proposed speaker recognition framework. In Section 3, we discuss how we construct the hybrid modeling for pattern classification. Section 4 shows the experimental result for the comparison performance. Finally, section 5 we concludes our work. Penerbit UTM 2008 Book Section PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/24970/1/AbdMananAhmad2008_ParallelizationStrategyOfSpeakerIdentification.pdf Ahmad, Abdul Manan and Loh, Mun Yee (2008) Parallelization strategy of speaker identification system for Hybrid Modeling. In: Advance in Speaker Recognition Techniques and Technology. Penerbit UTM , Johor, 27-38 . ISBN 978-983-52-0629-0
score 13.211869