A bi-annotated Malay-English code-switching (Manglish) dataset of X posts for biological gender identification and authorship attribution
Low-resource languages, like Malay, face the threat of extinction when linguistic resources become scarce. This paper addresses the scarcity issue by contributing to the inventory of low-resource languages, specifically focusing on Malay-English, known as Manglish. Manglish speakers are primarily...
Saved in:
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2024
|
Subjects: | |
Online Access: | http://eprints.uthm.edu.my/10920/1/J17377_a3b15f369ba6e61ca5517eaf40899173.pdf http://eprints.uthm.edu.my/10920/ https://doi.org/10.1016/j.dib.2024.110034 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!