Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation
McCrae, John P.
MetadataShow full item record
This item's downloads: 478 (view details)
Arcan, Mihael, Torregrosa, Daniel, Ahmadi, Sina, & McCrae, John P. (2019). Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation. Paper presented at the Translation Inference Across Dictionaries Workshop (TIAD 2019), Leipzig, Germany, 20-23 May, doi:10.13025/S89K9J
In the widely-connected digital world, multilingual lexical resources are one of the most important resources, for natural language processing applications, including information retrieval, question answering or knowledge management. These applications benefit from the multilingual knowledge as well as from the semantic relation between the words documented in these resources. Since multilingual dictionary creation and curation is a time-consuming task, we explored the use of multi-way neural machine translation trained on corpora of languages from the same family and trained additionally with a relatively small human-validated dictionary to infer new translation candidates. Our results showed not only that new dictionary entries can be identified and extracted from the translation model, but also that the expected precision and recall of the resulting dictionary can be adjusted by using different thresholds.