Show simple item record

dc.contributor.authorMonti, Johanna
dc.contributor.authorSangati, Federico
dc.contributor.authorArcan, Mihael
dc.date.accessioned2019-02-04T15:09:56Z
dc.date.available2019-02-04T15:09:56Z
dc.date.issued2015-12-03
dc.identifier.citationMonti, Johanna, Sangati, Federico, & Arcan, Mihael. (2015). TED-MWE: a bilingual parallel corpus with MWE annotation: Towards a methodology for annotating MWEs in parallel multilingual corpora. Paper presented at the Second Italian Conference on Computational Linguistics (CLiC-it 2015), Trento, Italy, 3-4 December.en_IE
dc.identifier.isbn9788899200008.
dc.identifier.urihttp://hdl.handle.net/10379/14901
dc.description.abstractThe translation of Multiword expressions (MWE) by Machine Translation (MT) represents a big challenge, and although MT has considerably improved in recent years, MWE mistranslations still occur very frequently. There is the need to develop large data sets, mainly parallel corpora, annotated with MWEs, since they are useful both for SMT training purposes and MWE translation quality evaluation. This paper describes a methodology to annotate a parallel spoken corpus with MWEs. The dataset used for this experiment is an English-Italian corpus extracted from the TED spoken corpus and complemented by an SMT output.en_IE
dc.description.sponsorshipWe greatly acknowledge the PARSEME IC1207 COST Action for supporting this work. We are particularly grateful to Manuela Cherchi, Erika Ibba, Anna De Santis, Giuseppe Casu, Jessica Ladu, Ilaria Del Rio, Elisa Virdis, Gino Castangia for their annotation work.en_IE
dc.formatapplication/pdfen_IE
dc.language.isoenen_IE
dc.publisherAccademia University Pressen_IE
dc.relation.ispartofSecond Italian Conference on Computational Linguistics (CLiC-it 2015)en
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/ie/
dc.subjectTED-MWEen_IE
dc.subjectBilingual parallel corpusen_IE
dc.subjectMultilingualen_IE
dc.titleTED-MWE: a bilingual parallel corpus with MWE annotation: Towards a methodology for annotating MWEs in parallel multilingual corporaen_IE
dc.typeConference Paperen_IE
dc.date.updated2019-01-23T17:52:49Z
dc.identifier.doi10.4000/books.aaccademia.1514
dc.local.publishedsourcehttps://dx.doi.org/10.4000/books.aaccademia.1514en_IE
dc.description.peer-reviewednon-peer-reviewed
dc.internal.rssid13192050
dc.local.contactMihael Arcan. Email: mihael.arcan@insight-centre.org
dc.local.copyrightcheckedYes
dc.local.versionPUBLISHED
nui.item.downloads86


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland