Speaker Independent Speech Recognition Using Neural Network

In spite of the advances accomplished throughout the last few decades, automatic speech recognition (ASR) is still a challenging and difficult task when the systems are applied in the real world. Different requirements for various applications drive the researchers to explore for more effective w...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Chin Luh
Format: Thesis
Language:en
Published: 2004
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/37/1/1000548949_t_FK_2004_90.pdf
http://psasir.upm.edu.my/id/eprint/37/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832509482249224192
author Tan, Chin Luh
author_facet Tan, Chin Luh
author_sort Tan, Chin Luh
building UPM Library
collection Institutional Repository
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
continent Asia
country Malaysia
description In spite of the advances accomplished throughout the last few decades, automatic speech recognition (ASR) is still a challenging and difficult task when the systems are applied in the real world. Different requirements for various applications drive the researchers to explore for more effective ways in the particular application. Attempts to apply artificial neural networks (ANN) as a classification tool are proposed to increase the reliability of the system. This project studies the approach of using neural network for speaker independent isolated word recognition on small vocabularies and proposes a method to have a simple MLP as speech recognizer. Our approach is able to overcome the current limitations of MLP in the selection of input buffers’ size by proposing a method on frames selection. Linear predictive coding (LPC) has been applied to represent speech signal in frames in early stage. Features from the selected frames are used to train the multilayer perceptrons (MLP) feedforward back-propagation (FFBP) neural network during the training stage. Same routine has been applied to the speech signal during the recognition stage and the unknown test pattern will be classified to one of the nearest pattern. In short, the selected frames represent the local features of the speech signal and all of them contribute to the global similarity for the whole speech signal. The analysis, design and the PC based voice dialling system is developed using MATLAB®.
format Thesis
id my.upm.eprints-37
institution Universiti Putra Malaysia
language en
publishDate 2004
record_format eprints
spelling my.upm.eprints-372015-08-06T01:49:29Z http://psasir.upm.edu.my/id/eprint/37/ Speaker Independent Speech Recognition Using Neural Network Tan, Chin Luh In spite of the advances accomplished throughout the last few decades, automatic speech recognition (ASR) is still a challenging and difficult task when the systems are applied in the real world. Different requirements for various applications drive the researchers to explore for more effective ways in the particular application. Attempts to apply artificial neural networks (ANN) as a classification tool are proposed to increase the reliability of the system. This project studies the approach of using neural network for speaker independent isolated word recognition on small vocabularies and proposes a method to have a simple MLP as speech recognizer. Our approach is able to overcome the current limitations of MLP in the selection of input buffers’ size by proposing a method on frames selection. Linear predictive coding (LPC) has been applied to represent speech signal in frames in early stage. Features from the selected frames are used to train the multilayer perceptrons (MLP) feedforward back-propagation (FFBP) neural network during the training stage. Same routine has been applied to the speech signal during the recognition stage and the unknown test pattern will be classified to one of the nearest pattern. In short, the selected frames represent the local features of the speech signal and all of them contribute to the global similarity for the whole speech signal. The analysis, design and the PC based voice dialling system is developed using MATLAB®. 2004-12 Thesis NonPeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/37/1/1000548949_t_FK_2004_90.pdf Tan, Chin Luh (2004) Speaker Independent Speech Recognition Using Neural Network. Masters thesis, Universiti Putra Malaysia. Neural Network
spellingShingle Neural Network
Tan, Chin Luh
Speaker Independent Speech Recognition Using Neural Network
title Speaker Independent Speech Recognition Using Neural Network
title_full Speaker Independent Speech Recognition Using Neural Network
title_fullStr Speaker Independent Speech Recognition Using Neural Network
title_full_unstemmed Speaker Independent Speech Recognition Using Neural Network
title_short Speaker Independent Speech Recognition Using Neural Network
title_sort speaker independent speech recognition using neural network
topic Neural Network
url http://psasir.upm.edu.my/id/eprint/37/1/1000548949_t_FK_2004_90.pdf
http://psasir.upm.edu.my/id/eprint/37/
url_provider http://psasir.upm.edu.my/