Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation
Date
2019-05-20Author
Arcan, Mihael
Torregrosa, Daniel
Ahmadi, Sina
McCrae, John P.
Metadata
Show full item recordUsage
This item's downloads: 498 (view details)
Recommended Citation
Arcan, Mihael, Torregrosa, Daniel, Ahmadi, Sina, & McCrae, John P. (2019). Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation. Paper presented at the Translation Inference Across Dictionaries Workshop (TIAD 2019), Leipzig, Germany, 20-23 May, doi:10.13025/S89K9J
Published Version
Abstract
In the widely-connected digital world, multilingual lexical
resources are one of the most important resources, for natural language
processing applications, including information retrieval, question answering or knowledge management. These applications benefit from the multilingual knowledge as well as from the semantic relation between the words
documented in these resources. Since multilingual dictionary creation
and curation is a time-consuming task, we explored the use of multi-way
neural machine translation trained on corpora of languages from the same
family and trained additionally with a relatively small human-validated
dictionary to infer new translation candidates. Our results showed not
only that new dictionary entries can be identified and extracted from the
translation model, but also that the expected precision and recall of the
resulting dictionary can be adjusted by using different thresholds.