Web application for lipreading
Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing...
Saved in:
Main Author: | |
---|---|
Format: | Final Year Project / Dissertation / Thesis |
Published: |
2024
|
Subjects: | |
Online Access: | http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf http://eprints.utar.edu.my/6818/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-utar-eprints.6818 |
---|---|
record_format |
eprints |
spelling |
my-utar-eprints.68182024-11-21T05:17:06Z Web application for lipreading Lau, Yee Lin QA76 Computer software T Technology (General) Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing deep learning model, LipCoordNet. It allows users to upload video to the web application and the application will generate text and video output for the users to visualize the speech instead of listening to the sounds. The users can choose to download the predicted text to their own device for future usage. Based on the output of the lipreading, the average word error rate (WER) and character error rate (CER) of an Asian speaker and a Native speaker is calculated, resulting in the average WER and CER value of the Asian speaker being higher than that of the Native speaker. To reduce the WER and CER of the sentences spoken by Asian speakers, efforts have been made in trying to train the LipCoordNet model with the Asian speakers dataset. 270 Asian speaker dataset has been collected with 27 Asian speakers speaking 10 sentences each. For the evaluation of the usability of the web application, five respondents are selected to participate in the system usability testing and contribute to the system usability scale (SUS) score. The SUS score obtained is 87.5, indicating that the system receives a grade A with the adjective rating of Excellent. 2024 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf Lau, Yee Lin (2024) Web application for lipreading. Final Year Project, UTAR. http://eprints.utar.edu.my/6818/ |
institution |
Universiti Tunku Abdul Rahman |
building |
UTAR Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Tunku Abdul Rahman |
content_source |
UTAR Institutional Repository |
url_provider |
http://eprints.utar.edu.my |
topic |
QA76 Computer software T Technology (General) |
spellingShingle |
QA76 Computer software T Technology (General) Lau, Yee Lin Web application for lipreading |
description |
Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing deep learning model, LipCoordNet. It allows users to upload video to the web application and the application will generate text and video output for the users to visualize the speech instead of listening to the sounds. The users can choose to download the predicted text to their own device for future usage. Based on the output of the lipreading, the average word error rate (WER) and character
error rate (CER) of an Asian speaker and a Native speaker is calculated, resulting in the average WER and CER value of the Asian speaker being higher than that of the Native speaker. To reduce the WER and CER of the sentences
spoken by Asian speakers, efforts have been made in trying to train the LipCoordNet model with the Asian speakers dataset. 270 Asian speaker dataset has been collected with 27 Asian speakers speaking 10 sentences each. For the
evaluation of the usability of the web application, five respondents are selected to participate in the system usability testing and contribute to the system usability scale (SUS) score. The SUS score obtained is 87.5, indicating that the system receives a grade A with the adjective rating of Excellent.
|
format |
Final Year Project / Dissertation / Thesis |
author |
Lau, Yee Lin |
author_facet |
Lau, Yee Lin |
author_sort |
Lau, Yee Lin |
title |
Web application for lipreading |
title_short |
Web application for lipreading |
title_full |
Web application for lipreading |
title_fullStr |
Web application for lipreading |
title_full_unstemmed |
Web application for lipreading |
title_sort |
web application for lipreading |
publishDate |
2024 |
url |
http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf http://eprints.utar.edu.my/6818/ |
_version_ |
1817849302470361088 |
score |
13.223943 |