dc.contributor.author | Arcan, Mihael | |
dc.contributor.author | McCrae, John P. | |
dc.contributor.author | Buitelaar, Paul | |
dc.date.accessioned | 2019-01-30T15:04:05Z | |
dc.date.available | 2019-01-30T15:04:05Z | |
dc.date.issued | 2016-12-11 | |
dc.identifier.citation | Arcan, Mihael, McCrae, John P., & Buitelaar, Paul. (2016). Expanding wordnets to new languages with multilingual sense disambiguation. Paper presented at the COLING 2016, the 26th International Conference on Computational Linguistics, Osaka, Japan, 11-16 December. | en_IE |
dc.identifier.uri | http://hdl.handle.net/10379/14889 | |
dc.description.abstract | Princeton WordNet is one of the most important resources for natural language processing, but
is only available for English. While it has been translated using the expand approach to many
other languages, this is an expensive manual process. Therefore it would be beneficial to have a
high-quality automatic translation approach that would support NLP techniques, which rely on
WordNet in new languages. The translation of wordnets is fundamentally complex because of the
need to translate all senses of a word including low frequency senses, which is very challenging
for current machine translation approaches. For this reason we leverage existing translations
of WordNet in other languages to identify contextual information for wordnet senses from a
large set of generic parallel corpora. We evaluate our approach using 10 translated wordnets for
European languages. Our experiment shows a significant improvement over translation without
any contextual information. Furthermore, we evaluate how the choice of pivot languages affects
performance of multilingual word sense disambiguation. | en_IE |
dc.description.sponsorship | This publication has emanated from research supported in part by a research grant from Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289 (Insight) and the European Union supported
project MixedEmotions (H2020-644632). | en_IE |
dc.format | application/pdf | en_IE |
dc.language.iso | en | en_IE |
dc.publisher | The COLING 2016 Organizing Committee | en_IE |
dc.relation.ispartof | International Conference on Computational Linguistics (COLING-2016) | en |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Ireland | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/3.0/ie/ | |
dc.subject | Wordnets | en_IE |
dc.subject | Languages | en_IE |
dc.subject | Multilingual | en_IE |
dc.subject | Disambiguation | en_IE |
dc.title | Expanding wordnets to new languages with multilingual sense disambiguation | en_IE |
dc.type | Conference Paper | en_IE |
dc.date.updated | 2019-01-23T17:41:46Z | |
dc.local.publishedsource | https://aclanthology.info/volumes/proceedings-of-coling-2016-the-26th-international-conference-on-computational-linguistics-technical-papers | en_IE |
dc.description.peer-reviewed | non-peer-reviewed | |
dc.contributor.funder | Science Foundation Ireland | en_IE |
dc.contributor.funder | Horizon 2020 | en_IE |
dc.internal.rssid | 13192049 | |
dc.local.contact | Mihael Arcan. Email: mihael.arcan@insight-centre.org | |
dc.local.copyrightchecked | Yes | |
dc.local.version | PUBLISHED | |
dcterms.project | info:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/ | en_IE |
dcterms.project | info:eu-repo/grantAgreement/EC/H2020::IA/644632/EU/Social Semantic Emotion Analysis for Innovative Multilingual Big Data Analytics Markets/MixedEmotions | en_IE |
nui.item.downloads | 81 | |