Show simple item record

dc.contributor.authorPopovic, Maja
dc.contributor.authorArcan, Mihael
dc.date.accessioned2019-01-31T14:20:08Z
dc.date.available2019-01-31T14:20:08Z
dc.date.issued2016-05-23
dc.identifier.citationPopovic, Maja, & Arcan, Mihael. (2016). PE2rr corpus: manual error annotation of automatically pre-annotated MT post-edits. Paper presented at the LREC 2016, Tenth International Conference on Language Resources and Evaluation, Portorož, Slovenia, 23-28 May.en_IE
dc.identifier.urihttp://hdl.handle.net/10379/14894
dc.description.abstractWe present a freely available corpus containing source language texts from different domains along with their automatically generated translations into several distinct morphologically rich languages, their post-edited versions, and error annotations of the performed post-edit operations. We believe that the corpus will be useful for many different applications. The main advantage of the approach used for creation of the corpus is the fusion of post-editing and error classification tasks, which have usually been seen as two independent tasks, although naturally they are not. We also show benefits of coupling automatic and manual error classification which facilitates the complex manual error annotation task as well as the development of automatic error classification tools. In addition, the approach facilitates annotation of language pair related issues.en_IE
dc.description.sponsorshipThis publication has emanated from research supported by TRAMOOC project (Translation for Massive Open Online Courses) partially funded by the European Commission under H2020-ICT-2014/H2020-ICT-2014-1 under grant agreement number 644333 and by the Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289 (Insight).en_IE
dc.formatapplication/pdfen_IE
dc.language.isoenen_IE
dc.publisherEuropean Language Resources Associationen_IE
dc.relation.ispartofLanguage Resource and Evaluation Conference (LREC)en
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/ie/
dc.subjectPE2rr corpusen_IE
dc.subjectError annotationen_IE
dc.subjectMT post-editsen_IE
dc.subjectPre-annotateden_IE
dc.subjectMachine translationen_IE
dc.subjectPost-editingen_IE
dc.subjectError annotationen_IE
dc.titlePE2 rr corpus: manual error annotation of automatically pre-annotated MT post-editsen_IE
dc.typeConference Paperen_IE
dc.date.updated2019-01-23T17:51:06Z
dc.local.publishedsourcehttp://www.lrec-conf.org/proceedings/lrec2016/summaries/405.htmlen_IE
dc.description.peer-reviewednon-peer-reviewed
dc.contributor.funderHorizon 2020en_IE
dc.contributor.funderScience Foundation Irelanden_IE
dc.internal.rssid13192029
dc.local.contactMihael Arcan. Email: mihael.arcan@insight-centre.org
dc.local.copyrightcheckedYes
dc.local.versionPUBLISHED
dcterms.projectinfo:eu-repo/grantAgreement/EC/H2020::IA/644333/EU/Translation for Massive Open Online Courses/TraMOOCen_IE
dcterms.projectinfo:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/en_IE
nui.item.downloads54


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland