Evaluating Google Neural Machine Translation from Chinese to English: technical vs. literary texts

As the global need for translation increases, machine translation (MT) has significantly enhanced the efficiency in facilitating information dissemination and cross-cultural communication. However, its quality remains bound by intrinsic limitations among language pairs and text genres. These d...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhongming Zhang, Syed Nurulakla Syed Abdullah, Muhammad Alif Redzuan Abdullah, Wenqi Duan
Format:	Article
Language:	en
Published:	Penerbit Universiti Kebangsaan Malaysia 2025
Online Access:	http://journalarticle.ukm.my/26280/1/Gema_Online_25_3_9.pdf http://journalarticle.ukm.my/26280/ https://ejournal.ukm.my/gema/issue/view/1852
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	As the global need for translation increases, machine translation (MT) has significantly enhanced the efficiency in facilitating information dissemination and cross-cultural communication. However, its quality remains bound by intrinsic limitations among language pairs and text genres. These discrepancies lead to distinct MT performance when processing technical and literary texts, forming the core gap and focus. This study aims to compare the quality of Google Neural Machine Translation (GNMT) in literary and technical texts, investigating error disparities and establishing the abilities and limits of MT across diverse linguistic contexts. The research was concerned with the English-Chinese language pair with the Multidimensional Quality Metrics (MQM) framework for manual annotation. The COMET automatic evaluation metric was also applied for validation and confirmation of quality differences observed. This study selected five excerpts from Apple product manuals (33 aligned units) and the novel, the Old Man and Sea (32 aligned units), respectively. Findings included (1) GNMT performed well with technical texts, but acted less effective with literary texts and technical texts exhibited notable terminology errors, whereas literary texts showed more stylistic inconsistencies; (2) MQM scores demonstrated a remarkable difference, with technical texts outperforming literary texts by 18.57%; and (3) COMET evaluation validated the above observations, confirming a significant difference between the two text styles. Although GNMT faced challenges with both texts, the quality remained acceptable within this study. Results recommend improving GNMT algorithms to enhance accuracy and remedy error patterns and distributions.

Evaluating Google Neural Machine Translation from Chinese to English: technical vs. literary texts

Similar Items