MARC表示: EP-Poland : building a bilingual parallel corpus for interpreting research

EP-Poland : building a bilingual parallel corpus for interpreting research

This paper reports on the process of building the EP-Poland corpus and on the first empirical applications thereof. This extensive bidirectional English-Polish corpus of original parliamentary contributions paired with professional simultaneous interpretations includes 11 European Parliament deba...

詳細記述

保存先:

書誌詳細
主要な著者:	Bartłomiejczyk, Magdalena, Gumul, Ewa, Koržinek, Danijel
フォーマット:	論文
言語:	English
出版事項:	Penerbit Universiti Kebangsaan Malaysia 2022
オンライン･アクセス:	http://journalarticle.ukm.my/18572/1/46647-178358-1-PB.pdf http://journalarticle.ukm.my/18572/ https://ejournal.ukm.my/gema/issue/view/1467
タグ:	タグ追加タグなし, このレコードへの初めてのタグを付けませんか!

id	my-ukm.journal.18572
record_format	eprints
spelling	my-ukm.journal.185722022-05-11T07:10:58Z http://journalarticle.ukm.my/18572/ EP-Poland : building a bilingual parallel corpus for interpreting research Bartłomiejczyk, Magdalena Gumul, Ewa Koržinek, Danijel This paper reports on the process of building the EP-Poland corpus and on the first empirical applications thereof. This extensive bidirectional English-Polish corpus of original parliamentary contributions paired with professional simultaneous interpretations includes 11 European Parliament debates held between January 2016 and February 2020. The main topic of these debates is the rule of law crisis triggered by the Law and Justice government in Poland. The corpus contains over 157,000 tokens and about 20 h 45 min of recordings, counting both source and target texts. The two interpreting directions (English-Polish and Polish-English) are represented almost evenly. The annotation of the corpus completed so far includes mark-up information, POS tagging, labelling disfluency phenomena, and all forms of explicitating shifts. Manual annotation for personal deixis is in progress. An additional interesting feature is the speaker identification performed employing the X-vector method, which allowed us to identify 36 interpreters. We begin with an overview of the existing interpreting corpora. Then we proceed to explain the design features of the EP-Poland and report on two completed empirical studies analysing idiosyncratic interpreting behaviour. We conclude by outlining future development pathways and offering some remarks on corpus significance and its limitations. Penerbit Universiti Kebangsaan Malaysia 2022-02 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/18572/1/46647-178358-1-PB.pdf Bartłomiejczyk, Magdalena and Gumul, Ewa and Koržinek, Danijel (2022) EP-Poland : building a bilingual parallel corpus for interpreting research. GEMA ; Online Journal of Language Studies, 22 (1). pp. 110-126. ISSN 1675-8021 https://ejournal.ukm.my/gema/issue/view/1467
institution	Universiti Kebangsaan Malaysia
building	Tun Sri Lanang Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Kebangsaan Malaysia
content_source	UKM Journal Article Repository
url_provider	http://journalarticle.ukm.my/
language	English
description	This paper reports on the process of building the EP-Poland corpus and on the first empirical applications thereof. This extensive bidirectional English-Polish corpus of original parliamentary contributions paired with professional simultaneous interpretations includes 11 European Parliament debates held between January 2016 and February 2020. The main topic of these debates is the rule of law crisis triggered by the Law and Justice government in Poland. The corpus contains over 157,000 tokens and about 20 h 45 min of recordings, counting both source and target texts. The two interpreting directions (English-Polish and Polish-English) are represented almost evenly. The annotation of the corpus completed so far includes mark-up information, POS tagging, labelling disfluency phenomena, and all forms of explicitating shifts. Manual annotation for personal deixis is in progress. An additional interesting feature is the speaker identification performed employing the X-vector method, which allowed us to identify 36 interpreters. We begin with an overview of the existing interpreting corpora. Then we proceed to explain the design features of the EP-Poland and report on two completed empirical studies analysing idiosyncratic interpreting behaviour. We conclude by outlining future development pathways and offering some remarks on corpus significance and its limitations.
format	Article
author	Bartłomiejczyk, Magdalena Gumul, Ewa Koržinek, Danijel
spellingShingle	Bartłomiejczyk, Magdalena Gumul, Ewa Koržinek, Danijel EP-Poland : building a bilingual parallel corpus for interpreting research
author_facet	Bartłomiejczyk, Magdalena Gumul, Ewa Koržinek, Danijel
author_sort	Bartłomiejczyk, Magdalena
title	EP-Poland : building a bilingual parallel corpus for interpreting research
title_short	EP-Poland : building a bilingual parallel corpus for interpreting research
title_full	EP-Poland : building a bilingual parallel corpus for interpreting research
title_fullStr	EP-Poland : building a bilingual parallel corpus for interpreting research
title_full_unstemmed	EP-Poland : building a bilingual parallel corpus for interpreting research
title_sort	ep-poland : building a bilingual parallel corpus for interpreting research
publisher	Penerbit Universiti Kebangsaan Malaysia
publishDate	2022
url	http://journalarticle.ukm.my/18572/1/46647-178358-1-PB.pdf http://journalarticle.ukm.my/18572/ https://ejournal.ukm.my/gema/issue/view/1467
_version_	1732946464512933888
score	13.251813

EP-Poland : building a bilingual parallel corpus for interpreting research

類似資料