Performance of Isolated Digit Speech Recognition in Crowded Environment.

Speech recognition is a process that recognizes what the speaker says. Its objective is to extract, characterize and recognize the information in the speech signal conveying what the speaker says. One of major problems in speech recognition domain is disturbance caused by background noise. This dist...

Full description

Saved in:
Bibliographic Details
Main Author: Muhamad Arif, Hashim
Format: Thesis
Language:en
en
Published: 2007
Subjects:
Online Access:https://etd.uum.edu.my/123/1/Muhamad_Arif_Hashim.pdf
https://etd.uum.edu.my/123/2/Muhamad_Arif_Hashim-1.pdf
https://etd.uum.edu.my/123/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1833435298972827648
author Muhamad Arif, Hashim
author_facet Muhamad Arif, Hashim
author_sort Muhamad Arif, Hashim
building UUM Library
collection Institutional Repository
content_provider Universiti Utara Malaysia
content_source UUM Electronic Theses
continent Asia
country Malaysia
description Speech recognition is a process that recognizes what the speaker says. Its objective is to extract, characterize and recognize the information in the speech signal conveying what the speaker says. One of major problems in speech recognition domain is disturbance caused by background noise. This disturbance can decrease the effectiveness and reliability of the system and its accuracy. This research objective is to measure the performance of isolated digit speech recognition in crowded environment. VQSR prototype uses two kinds of distance measure: Euclidean distance and city block distance. Noisy digit speech, which is constructed from TIDigit speech database and cafeteria noise from CLSU database, is used to train and test the prototype. The prototype is also tested using real data that been recorded in a crowded and noisy cafeteria. Results of training and testing phases are recorded and compared between these two distance measures using a set of performance measurement analysis. This set includes Sensitivity, Specificity, Total Accuracy, False Acceptance Rate, False Rejection Rate and Half Total Error Rate analysis. Based on the performance measurement, a robust and reliable digit speech can be used by user that has high possibility of success and low probability in making errors. Finally, the proposed model and guideline in evaluating the digit speech performance can be use in other speech domain.
format Thesis
id my.uum.etd-123
institution Universiti Utara Malaysia
language en
en
publishDate 2007
record_format eprints
spelling my.uum.etd-1232013-07-24T12:05:40Z https://etd.uum.edu.my/123/ Performance of Isolated Digit Speech Recognition in Crowded Environment. Muhamad Arif, Hashim TK Electrical engineering. Electronics Nuclear engineering Speech recognition is a process that recognizes what the speaker says. Its objective is to extract, characterize and recognize the information in the speech signal conveying what the speaker says. One of major problems in speech recognition domain is disturbance caused by background noise. This disturbance can decrease the effectiveness and reliability of the system and its accuracy. This research objective is to measure the performance of isolated digit speech recognition in crowded environment. VQSR prototype uses two kinds of distance measure: Euclidean distance and city block distance. Noisy digit speech, which is constructed from TIDigit speech database and cafeteria noise from CLSU database, is used to train and test the prototype. The prototype is also tested using real data that been recorded in a crowded and noisy cafeteria. Results of training and testing phases are recorded and compared between these two distance measures using a set of performance measurement analysis. This set includes Sensitivity, Specificity, Total Accuracy, False Acceptance Rate, False Rejection Rate and Half Total Error Rate analysis. Based on the performance measurement, a robust and reliable digit speech can be used by user that has high possibility of success and low probability in making errors. Finally, the proposed model and guideline in evaluating the digit speech performance can be use in other speech domain. 2007-08-05 Thesis NonPeerReviewed application/pdf en https://etd.uum.edu.my/123/1/Muhamad_Arif_Hashim.pdf application/pdf en https://etd.uum.edu.my/123/2/Muhamad_Arif_Hashim-1.pdf Muhamad Arif, Hashim (2007) Performance of Isolated Digit Speech Recognition in Crowded Environment. Masters thesis, Universiti Utara Malaysia.
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Muhamad Arif, Hashim
Performance of Isolated Digit Speech Recognition in Crowded Environment.
title Performance of Isolated Digit Speech Recognition in Crowded Environment.
title_full Performance of Isolated Digit Speech Recognition in Crowded Environment.
title_fullStr Performance of Isolated Digit Speech Recognition in Crowded Environment.
title_full_unstemmed Performance of Isolated Digit Speech Recognition in Crowded Environment.
title_short Performance of Isolated Digit Speech Recognition in Crowded Environment.
title_sort performance of isolated digit speech recognition in crowded environment.
topic TK Electrical engineering. Electronics Nuclear engineering
url https://etd.uum.edu.my/123/1/Muhamad_Arif_Hashim.pdf
https://etd.uum.edu.my/123/2/Muhamad_Arif_Hashim-1.pdf
https://etd.uum.edu.my/123/
url_provider http://etd.uum.edu.my/