Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum

Speech is structured in cohesive sequences of information units defined with unique speaker-related intonation patterns, known as prosody. To capture the communicative intention of a speaker, listeners first focus on the change of rising and/or falling prosody patterns (known as speech events) on wo...

Full description

Saved in:
Bibliographic Details
Main Author: Mohamed Hanum, Haslizatul Fairuz
Format: Thesis
Language:English
Published: 2021
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/54987/1/54987.pdf
https://ir.uitm.edu.my/id/eprint/54987/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.uitm.ir.54987
record_format eprints
spelling my.uitm.ir.549872022-02-16T04:57:52Z https://ir.uitm.edu.my/id/eprint/54987/ Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum Mohamed Hanum, Haslizatul Fairuz Language. Linguistic theory. Comparative grammar Phonology. Phonetics Speech is structured in cohesive sequences of information units defined with unique speaker-related intonation patterns, known as prosody. To capture the communicative intention of a speaker, listeners first focus on the change of rising and/or falling prosody patterns (known as speech events) on words and phrasing to capture the degree of information conveyed. Secondly, into national breaks are used by speakers to convey speech content in a sequence of smaller sub-segments known as phrasing. Incorrect perception of the events and boundaries causes the communicative intention to be falsely understood by listeners. Thus, previous works evaluate the role of boundaries in languages like English, Swedish, Japanese, and Chinese to help automatic speech recognition and understanding domain. The main issue addressed by previous work on boundary classification is the disagreement on prosody event labeling. Boundaries are falsely classified due to limitations on expert evaluation of prosody events and lack of a prosody standard. Also, evaluating how the pause and intonation variations discriminate between a phrase with a continual content from other subsequent phrases that carry the end (final) content is still an open question. Limited work on the role of prosody is often defined based on specific language lexical rules, which degrades understanding of phrasing and speech content for the under-resourced language. As related research are limited to prosody stress indicating prominence on individual Malay words; thus, processes for defining the prosody governing Malay phrase boundaries are essential. This research aims to evaluate the correlates of prosody on boundaries with a deep (finality) or a shallow (continual) content. Dataset for the boundary classification task contains formal statements recorded from the debate session from the Malaysian Parliamentary Speech recordings. In the first phase, a refined set of Rapid Phrase Prosody Tasks (RPIT) instructions labelled the speech signal with perceived continual or final content. Responses from four male and four female volunteers are analysed using KAPPA and Krippendorff analysis to construct a Malay phrase (MySP) boundaries dataset with an average of 85% agreement on perceived boundary labels from the first research phase. Pitch regression and the rise-fall-connection (RFC) contour parameters are used to model the role of prosody on the perceived boundaries. A new phrase strength correlates computed as slope feature from the highest nuclei point towards the end of the boundary word region. The role of each vector combined with the word and silence durations to signify each deep (final) and shallow (continual) boundary is tested with supervised classifiers. The supervised K-Nearest Neighbour (KNN), Random Forest (RF), and Logistic Regression (Log-Reg) models predicted the boundary classes with up to 75% accuracy. A higher degree of slope excursion is observed on boundaries that the listeners perceived as the deep boundaries (evaluated with finality in speech content through the RPIT) and used to improve classification results on up to 20% of the falsely classified boundaries. This study contributes to the Malay prosody knowledge and classification of phrasing the boundaries 2021-10 Thesis NonPeerReviewed text en https://ir.uitm.edu.my/id/eprint/54987/1/54987.pdf ID54987 Mohamed Hanum, Haslizatul Fairuz (2021) Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum. PhD thesis, thesis, Universiti Teknologi MARA.
institution Universiti Teknologi Mara
building Tun Abdul Razak Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Mara
content_source UiTM Institutional Repository
url_provider http://ir.uitm.edu.my/
language English
topic Language. Linguistic theory. Comparative grammar
Phonology. Phonetics
spellingShingle Language. Linguistic theory. Comparative grammar
Phonology. Phonetics
Mohamed Hanum, Haslizatul Fairuz
Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
description Speech is structured in cohesive sequences of information units defined with unique speaker-related intonation patterns, known as prosody. To capture the communicative intention of a speaker, listeners first focus on the change of rising and/or falling prosody patterns (known as speech events) on words and phrasing to capture the degree of information conveyed. Secondly, into national breaks are used by speakers to convey speech content in a sequence of smaller sub-segments known as phrasing. Incorrect perception of the events and boundaries causes the communicative intention to be falsely understood by listeners. Thus, previous works evaluate the role of boundaries in languages like English, Swedish, Japanese, and Chinese to help automatic speech recognition and understanding domain. The main issue addressed by previous work on boundary classification is the disagreement on prosody event labeling. Boundaries are falsely classified due to limitations on expert evaluation of prosody events and lack of a prosody standard. Also, evaluating how the pause and intonation variations discriminate between a phrase with a continual content from other subsequent phrases that carry the end (final) content is still an open question. Limited work on the role of prosody is often defined based on specific language lexical rules, which degrades understanding of phrasing and speech content for the under-resourced language. As related research are limited to prosody stress indicating prominence on individual Malay words; thus, processes for defining the prosody governing Malay phrase boundaries are essential. This research aims to evaluate the correlates of prosody on boundaries with a deep (finality) or a shallow (continual) content. Dataset for the boundary classification task contains formal statements recorded from the debate session from the Malaysian Parliamentary Speech recordings. In the first phase, a refined set of Rapid Phrase Prosody Tasks (RPIT) instructions labelled the speech signal with perceived continual or final content. Responses from four male and four female volunteers are analysed using KAPPA and Krippendorff analysis to construct a Malay phrase (MySP) boundaries dataset with an average of 85% agreement on perceived boundary labels from the first research phase. Pitch regression and the rise-fall-connection (RFC) contour parameters are used to model the role of prosody on the perceived boundaries. A new phrase strength correlates computed as slope feature from the highest nuclei point towards the end of the boundary word region. The role of each vector combined with the word and silence durations to signify each deep (final) and shallow (continual) boundary is tested with supervised classifiers. The supervised K-Nearest Neighbour (KNN), Random Forest (RF), and Logistic Regression (Log-Reg) models predicted the boundary classes with up to 75% accuracy. A higher degree of slope excursion is observed on boundaries that the listeners perceived as the deep boundaries (evaluated with finality in speech content through the RPIT) and used to improve classification results on up to 20% of the falsely classified boundaries. This study contributes to the Malay prosody knowledge and classification of phrasing the boundaries
format Thesis
author Mohamed Hanum, Haslizatul Fairuz
author_facet Mohamed Hanum, Haslizatul Fairuz
author_sort Mohamed Hanum, Haslizatul Fairuz
title Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_short Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_full Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_fullStr Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_full_unstemmed Prosodic contours for classification of perceived boundaries on Malay phrasing / Haslizatul Fairuz Mohamed Hanum
title_sort prosodic contours for classification of perceived boundaries on malay phrasing / haslizatul fairuz mohamed hanum
publishDate 2021
url https://ir.uitm.edu.my/id/eprint/54987/1/54987.pdf
https://ir.uitm.edu.my/id/eprint/54987/
_version_ 1725975735330406400
score 13.211869