dc.contributor.author | Popovic, Maja | |
dc.contributor.author | Arcan, Mihael | |
dc.date.accessioned | 2019-01-31T14:20:08Z | |
dc.date.available | 2019-01-31T14:20:08Z | |
dc.date.issued | 2016-05-23 | |
dc.identifier.citation | Popovic, Maja, & Arcan, Mihael. (2016). PE2rr corpus: manual error annotation of automatically pre-annotated MT post-edits. Paper presented at the LREC 2016, Tenth International Conference on Language Resources and Evaluation, Portorož, Slovenia, 23-28 May. | en_IE |
dc.identifier.uri | http://hdl.handle.net/10379/14894 | |
dc.description.abstract | We present a freely available corpus containing source language texts from different domains along with their automatically generated
translations into several distinct morphologically rich languages, their post-edited versions, and error annotations of the performed
post-edit operations. We believe that the corpus will be useful for many different applications. The main advantage of the approach used
for creation of the corpus is the fusion of post-editing and error classification tasks, which have usually been seen as two independent
tasks, although naturally they are not. We also show benefits of coupling automatic and manual error classification which facilitates
the complex manual error annotation task as well as the development of automatic error classification tools. In addition, the approach
facilitates annotation of language pair related issues. | en_IE |
dc.description.sponsorship | This publication has emanated from research supported
by TRAMOOC project (Translation for Massive Open
Online Courses) partially funded by the European Commission under H2020-ICT-2014/H2020-ICT-2014-1 under
grant agreement number 644333 and by the Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289
(Insight). | en_IE |
dc.format | application/pdf | en_IE |
dc.language.iso | en | en_IE |
dc.publisher | European Language Resources Association | en_IE |
dc.relation.ispartof | Language Resource and Evaluation Conference (LREC) | en |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Ireland | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/3.0/ie/ | |
dc.subject | PE2rr corpus | en_IE |
dc.subject | Error annotation | en_IE |
dc.subject | MT post-edits | en_IE |
dc.subject | Pre-annotated | en_IE |
dc.subject | Machine translation | en_IE |
dc.subject | Post-editing | en_IE |
dc.subject | Error annotation | en_IE |
dc.title | PE2 rr corpus: manual error annotation of automatically pre-annotated MT post-edits | en_IE |
dc.type | Conference Paper | en_IE |
dc.date.updated | 2019-01-23T17:51:06Z | |
dc.local.publishedsource | http://www.lrec-conf.org/proceedings/lrec2016/summaries/405.html | en_IE |
dc.description.peer-reviewed | non-peer-reviewed | |
dc.contributor.funder | Horizon 2020 | en_IE |
dc.contributor.funder | Science Foundation Ireland | en_IE |
dc.internal.rssid | 13192029 | |
dc.local.contact | Mihael Arcan. Email: mihael.arcan@insight-centre.org | |
dc.local.copyrightchecked | Yes | |
dc.local.version | PUBLISHED | |
dcterms.project | info:eu-repo/grantAgreement/EC/H2020::IA/644333/EU/Translation for Massive Open Online Courses/TraMOOC | en_IE |
dcterms.project | info:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/ | en_IE |
nui.item.downloads | 71 | |