End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application

In this paper, an FPT.AI-based text-to-speech (TTS) application is developed that converts Vietnamese text into spoken words. The application is developed based on Django for Python and in the form of an interactive web page which is connected to an FPT.AI server through its application programming...

Full description

Saved in:
Bibliographic Details
Main Authors: Chung, T.D., Drieberg, M., Bin Hassan, M.F., Khalyasmaa, A.
Format: Conference or Workshop Item
Published: Institute of Electrical and Electronics Engineers Inc. 2020
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85085167676&doi=10.1109%2fLifeTech48969.2020.1570620448&partnerID=40&md5=69ffc5e130e473e9d8519397ab61a341
http://eprints.utp.edu.my/24638/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utp.eprints.24638
record_format eprints
spelling my.utp.eprints.246382021-08-27T06:13:27Z End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application Chung, T.D. Drieberg, M. Bin Hassan, M.F. Khalyasmaa, A. In this paper, an FPT.AI-based text-to-speech (TTS) application is developed that converts Vietnamese text into spoken words. The application is developed based on Django for Python and in the form of an interactive web page which is connected to an FPT.AI server through its application programming interface (API). The application supports conversion of text to seven different Vietnamese speeches. Four out of seven voices can be used to convert up to 500 characters in a single transaction while the others support that of 400 characters. Based on the results obtained, the first conversion time takes up to 10 s to convert 400-character text into speech while the subsequent times, given same text, it takes under 1.8 s for the conversion. This is applicable to all voices. © 2020 IEEE. Institute of Electrical and Electronics Engineers Inc. 2020 Conference or Workshop Item NonPeerReviewed https://www.scopus.com/inward/record.uri?eid=2-s2.0-85085167676&doi=10.1109%2fLifeTech48969.2020.1570620448&partnerID=40&md5=69ffc5e130e473e9d8519397ab61a341 Chung, T.D. and Drieberg, M. and Bin Hassan, M.F. and Khalyasmaa, A. (2020) End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application. In: UNSPECIFIED. http://eprints.utp.edu.my/24638/
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description In this paper, an FPT.AI-based text-to-speech (TTS) application is developed that converts Vietnamese text into spoken words. The application is developed based on Django for Python and in the form of an interactive web page which is connected to an FPT.AI server through its application programming interface (API). The application supports conversion of text to seven different Vietnamese speeches. Four out of seven voices can be used to convert up to 500 characters in a single transaction while the others support that of 400 characters. Based on the results obtained, the first conversion time takes up to 10 s to convert 400-character text into speech while the subsequent times, given same text, it takes under 1.8 s for the conversion. This is applicable to all voices. © 2020 IEEE.
format Conference or Workshop Item
author Chung, T.D.
Drieberg, M.
Bin Hassan, M.F.
Khalyasmaa, A.
spellingShingle Chung, T.D.
Drieberg, M.
Bin Hassan, M.F.
Khalyasmaa, A.
End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application
author_facet Chung, T.D.
Drieberg, M.
Bin Hassan, M.F.
Khalyasmaa, A.
author_sort Chung, T.D.
title End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application
title_short End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application
title_full End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application
title_fullStr End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application
title_full_unstemmed End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application
title_sort end-to-end conversion speed analysis of an fpt.ai-based text-to-speech application
publisher Institute of Electrical and Electronics Engineers Inc.
publishDate 2020
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85085167676&doi=10.1109%2fLifeTech48969.2020.1570620448&partnerID=40&md5=69ffc5e130e473e9d8519397ab61a341
http://eprints.utp.edu.my/24638/
_version_ 1738656618262822912
score 13.211869