Show simple item record

dc.contributor.authorPopovic, Maja
dc.contributor.authorArcan, Mihael
dc.contributor.authorAvramidis, Eleftherios
dc.contributor.authorBurchardt, Aljoscha
dc.contributor.authorLommel, Arle
dc.date.accessioned2019-02-04T15:33:29Z
dc.date.available2019-02-04T15:33:29Z
dc.date.issued2015-04-11
dc.identifier.citationPopovic, Maja, Arcan, Mihael, Avramidis, Eleftherios, Burchardt, Aljoscha, & Lommel, Arle. (2015). Poor man’s lemmatisation for automatic error classification. Paper presented at the 18th Annual Conference of the European Association for Machine Translation (EAMT2015 ), Antalya, Turkey, 11-13 May.en_IE
dc.identifier.urihttp://hdl.handle.net/10379/14902
dc.description.abstractThis paper demonstrates the possibility to make an existing automatic error classifier for machine translations independent from the requirement of lemmatisation. This makes it usable also for smaller and under-resourced languages and in situations where there is no lemmatiser at hand. It is shown that cutting all words into the first four letters is the best method even for highly inflective languages, preserving both the detected distribution of error types within a translation output as well as over various translation outputs. The main cost of not using a lemmatiser is the lower accuracy of detecting the inflectional error class due to its confusion with mistranslations. For shorter words, actual inflectional errors will be tagged as mistranslations, for longer words the other way round. Keeping all that in mind, it is possible to use the error classifier without target language lemmatisation and to extrapolate inflectional and lexical error rates according to the average word length in the analysed text.en_IE
dc.description.sponsorshipThis publication has emanated from research supported by QTLEAP project – ECs FP7 (FP7/2007- 2013) under grant agreement number 610516: “QTLEAP: Quality Translation by Deep Language Engineering Approaches” and by a research grant from Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289. We are grateful to the reviewers for their valuable feedbacen_IE
dc.language.isoenen_IE
dc.publisherEuropean Association for Machine Translationen_IE
dc.relation.ispartofEuropean Association for Machine Translation (EAMT-2015)en
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/ie/
dc.subjectLemmatisationen_IE
dc.subjectError classificationen_IE
dc.titlePoor man’s lemmatisation for automatic error classificationen_IE
dc.typeConference Paperen_IE
dc.date.updated2019-01-23T17:53:30Z
dc.local.publishedsourcehttps://aclanthology.info/papers/W15-4914/w15-4914en_IE
dc.description.peer-reviewednon-peer-reviewed
dc.contributor.funderSeventh Framework Programmeen_IE
dc.contributor.funderScience Foundation Irelanden_IE
dc.internal.rssid13192048
dc.local.contactMihael Arcan. Email: mihael.arcan@insight-centre.org
dc.local.copyrightcheckedYes
dc.local.versionPUBLISHED
dcterms.projectinfo:eu-repo/grantAgreement/EC/FP7::SP1::ICT/610516/EU/Quality Translation by Deep Language Engineering Approaches/QTLEAPen_IE
dcterms.projectinfo:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/en_IE
nui.item.downloads78


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland