Speech recognition system using MATLAB : design, implementation, and samples codes
Research in automatic speech recognition has been done for almost four decades. Over the past decades, the development of speech recognition applications gives invaluable contributions. Speech has the potential to be a better interface than other computing devices used such as keyboard or mouse. Thi...
Saved in:
Main Authors: | , |
---|---|
Format: | Book |
Language: | English |
Published: |
Lambert Academic Publishing
2011
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/27200/1/Speech_Recognition.pdf http://irep.iium.edu.my/27200/ https://www.morebooks.de/store/gb/book/speech-recognition-system-using-matlab/isbn/978-3-8465-0376-8 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.iium.irep.27200 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.272002013-02-06T02:29:36Z http://irep.iium.edu.my/27200/ Speech recognition system using MATLAB : design, implementation, and samples codes Abushariah, Ahmad A. M. Gunawan, Teddy Surya TK5101 Telecommunication. Including telegraphy, radio, radar, television Research in automatic speech recognition has been done for almost four decades. Over the past decades, the development of speech recognition applications gives invaluable contributions. Speech has the potential to be a better interface than other computing devices used such as keyboard or mouse. This project aims to develop automated English digits speech recognition system. The project relies heavily on the well known and widely used statistical method in characterizing the speech pattern, the Hidden Markov Model (HMM), which provides a highly reliable way for recognizing speech. This project discusses the theory of HMM and then extends the ideas to the development and implementation by applying this method in computational speech recognition. Basically, the system is able to recognize the spoken utterances by translating the speech waveform into a set of feature vectors using Mel Frequency Cepstral Coefficients (MFCC) technique, which then estimates the observation likelihood by using the Forward algorithm. The HMM parameters are estimated by applying the Baum Welch algorithm on previously trained samples. The most likely sequence is then decoded using Viterbi algorithm, thus producing the recognized word. This project focuses on all English digits from (Zero through Nine), which is based on isolated words structure. Two modules were developed, namely the isolated words speech recognition and the continuous speech recognition. Both modules were tested in both clean and noisy environments and showed relatively successful recognition rates. In clean environment and isolated words speech recognition module, the multi-speaker mode achieved 99.5% whereas the speaker-independent mode achieved 79.5%. In clean environment and continuous speech recognition module, the multi-speaker mode achieved 70% whereas the speaker-independent mode achieved 55%. However in noisy environment and isolated words speech recognition module, the multi-speaker mode achieved 88% whereas the speaker-independent mode achieved 67%. In noisy environment and continuous speech recognition module, the multi-speaker mode achieved 92.5% whereas the speaker-independent mode achieved 75%. These recognition rates are relatively successful if compared to similar systems. Lambert Academic Publishing 2011 Book REM application/pdf en http://irep.iium.edu.my/27200/1/Speech_Recognition.pdf Abushariah, Ahmad A. M. and Gunawan, Teddy Surya (2011) Speech recognition system using MATLAB : design, implementation, and samples codes. Lambert Academic Publishing, Saarbrucken, Germany. ISBN 978-3-8465-0376-8 https://www.morebooks.de/store/gb/book/speech-recognition-system-using-matlab/isbn/978-3-8465-0376-8 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English |
topic |
TK5101 Telecommunication. Including telegraphy, radio, radar, television |
spellingShingle |
TK5101 Telecommunication. Including telegraphy, radio, radar, television Abushariah, Ahmad A. M. Gunawan, Teddy Surya Speech recognition system using MATLAB : design, implementation, and samples codes |
description |
Research in automatic speech recognition has been done for almost four decades. Over the past decades, the development of speech recognition applications gives invaluable contributions. Speech has the potential to be a better interface than other computing devices used such as keyboard or mouse. This project aims to develop automated English digits speech recognition system. The project relies heavily on the well known and widely used statistical method in characterizing the speech pattern, the Hidden Markov Model (HMM), which provides a highly reliable way for recognizing speech. This project discusses the theory of HMM and then extends the ideas to the development and implementation by applying this method in computational speech recognition. Basically, the system is able to recognize the spoken utterances by translating the speech waveform into a set of feature vectors using Mel Frequency Cepstral Coefficients (MFCC) technique, which then estimates the observation likelihood by using the Forward algorithm. The HMM parameters are estimated by applying the Baum Welch algorithm on previously trained samples. The most likely sequence is then decoded using Viterbi algorithm, thus producing the recognized word. This project focuses on all English digits from (Zero through Nine), which is based on isolated words structure. Two modules were developed, namely the isolated words speech recognition and the continuous speech recognition. Both modules were tested in both clean and noisy environments and showed relatively successful recognition rates. In clean environment and isolated words speech recognition module, the multi-speaker mode achieved 99.5% whereas the speaker-independent mode achieved 79.5%. In clean environment and continuous speech recognition module, the multi-speaker mode achieved 70% whereas the speaker-independent mode achieved 55%. However in noisy environment and isolated words speech recognition module, the multi-speaker mode achieved 88% whereas the speaker-independent mode achieved 67%. In noisy environment and continuous speech recognition module, the multi-speaker mode achieved 92.5% whereas the speaker-independent mode achieved 75%. These recognition rates are relatively successful if compared to similar systems. |
format |
Book |
author |
Abushariah, Ahmad A. M. Gunawan, Teddy Surya |
author_facet |
Abushariah, Ahmad A. M. Gunawan, Teddy Surya |
author_sort |
Abushariah, Ahmad A. M. |
title |
Speech recognition system using MATLAB : design, implementation, and samples codes |
title_short |
Speech recognition system using MATLAB : design, implementation, and samples codes |
title_full |
Speech recognition system using MATLAB : design, implementation, and samples codes |
title_fullStr |
Speech recognition system using MATLAB : design, implementation, and samples codes |
title_full_unstemmed |
Speech recognition system using MATLAB : design, implementation, and samples codes |
title_sort |
speech recognition system using matlab : design, implementation, and samples codes |
publisher |
Lambert Academic Publishing |
publishDate |
2011 |
url |
http://irep.iium.edu.my/27200/1/Speech_Recognition.pdf http://irep.iium.edu.my/27200/ https://www.morebooks.de/store/gb/book/speech-recognition-system-using-matlab/isbn/978-3-8465-0376-8 |
_version_ |
1643609290639933440 |
score |
13.211869 |