Web application for lipreading

Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing...

Full description

Saved in:
Bibliographic Details
Main Author: Lau, Yee Lin
Format: Final Year Project / Dissertation / Thesis
Published: 2024
Subjects:
Online Access:http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf
http://eprints.utar.edu.my/6818/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utar-eprints.6818
record_format eprints
spelling my-utar-eprints.68182024-11-21T05:17:06Z Web application for lipreading Lau, Yee Lin QA76 Computer software T Technology (General) Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing deep learning model, LipCoordNet. It allows users to upload video to the web application and the application will generate text and video output for the users to visualize the speech instead of listening to the sounds. The users can choose to download the predicted text to their own device for future usage. Based on the output of the lipreading, the average word error rate (WER) and character error rate (CER) of an Asian speaker and a Native speaker is calculated, resulting in the average WER and CER value of the Asian speaker being higher than that of the Native speaker. To reduce the WER and CER of the sentences spoken by Asian speakers, efforts have been made in trying to train the LipCoordNet model with the Asian speakers dataset. 270 Asian speaker dataset has been collected with 27 Asian speakers speaking 10 sentences each. For the evaluation of the usability of the web application, five respondents are selected to participate in the system usability testing and contribute to the system usability scale (SUS) score. The SUS score obtained is 87.5, indicating that the system receives a grade A with the adjective rating of Excellent. 2024 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf Lau, Yee Lin (2024) Web application for lipreading. Final Year Project, UTAR. http://eprints.utar.edu.my/6818/
institution Universiti Tunku Abdul Rahman
building UTAR Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tunku Abdul Rahman
content_source UTAR Institutional Repository
url_provider http://eprints.utar.edu.my
topic QA76 Computer software
T Technology (General)
spellingShingle QA76 Computer software
T Technology (General)
Lau, Yee Lin
Web application for lipreading
description Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing deep learning model, LipCoordNet. It allows users to upload video to the web application and the application will generate text and video output for the users to visualize the speech instead of listening to the sounds. The users can choose to download the predicted text to their own device for future usage. Based on the output of the lipreading, the average word error rate (WER) and character error rate (CER) of an Asian speaker and a Native speaker is calculated, resulting in the average WER and CER value of the Asian speaker being higher than that of the Native speaker. To reduce the WER and CER of the sentences spoken by Asian speakers, efforts have been made in trying to train the LipCoordNet model with the Asian speakers dataset. 270 Asian speaker dataset has been collected with 27 Asian speakers speaking 10 sentences each. For the evaluation of the usability of the web application, five respondents are selected to participate in the system usability testing and contribute to the system usability scale (SUS) score. The SUS score obtained is 87.5, indicating that the system receives a grade A with the adjective rating of Excellent.
format Final Year Project / Dissertation / Thesis
author Lau, Yee Lin
author_facet Lau, Yee Lin
author_sort Lau, Yee Lin
title Web application for lipreading
title_short Web application for lipreading
title_full Web application for lipreading
title_fullStr Web application for lipreading
title_full_unstemmed Web application for lipreading
title_sort web application for lipreading
publishDate 2024
url http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf
http://eprints.utar.edu.my/6818/
_version_ 1817849302470361088
score 13.223943