Search
Now showing items 1-9 of 9
Creating a fine-grained corpus for a less-resourced language: the case of Kurdish
(NUI Galway, 2019-07-28)
Kurdish is a less-resourced language consisting of different dialects written in various scripts. Approximately 30 million people in different countries speak the language. The lack of corpora is one of the main obstacles ...
Creating a multilingual terminological resource using linked data:the case of archaeological domain in the Italian language
(CEUR Workshop Proceedings, 2019-11-13)
The lack of multilingual terminological resources in specialized domains constitutes an obstacle to the access and reuse of information. In the
technical domain of cultural heritage and,
in particular, archaeology, such ...
Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation
(National University of Ireland, Galway, 2019-05-20)
In the widely-connected digital world, multilingual lexical
resources are one of the most important resources, for natural language
processing applications, including information retrieval, question answering or knowledge ...
NUIG at the FinSBD Task: Sentence boundary detection for noisy financial PDFs in English and French
(NUI Galway, 2019-08-12)
Portable Document Format (PDF) has become the industry-standard document as it is independent of the software, hardware or operating system. Publicly listed companies annually publish a variety of reports and too take ...
CoFiF: A corpus of financial reports in French language
(NUI Galway, 2019-08-12)
In an era when machine learning and artificial intelligence have huge momentum, the data demand to train and test models is steadily growing. We introduce CoFiF, the first corpus comprising company reports in the French ...
TIAD 2019 Shared Task: Leveraging knowledge graphs with neural machine translation for automatic multilingual dictionary generation
(National University of Ireland, Galway, 2019-04-20)
This paper describes the different proposed approaches to the TIAD 2019 Shared Task, which consisted in the automatic discovery and generation of dictionaries leveraging multilingual knowledge bases. We present three methods ...
Lexical sense alignment using weighted bipartite b-matching
(NUI Galway, 2019-05-20)
In this study, we present a similarity-based approach for lexical sense alignment in WordNet and Wiktionary with a focus on the polysemous items. Our approach relies on semantic textual similarity using features such as ...
Towards electronic lexicography for the Kurdish language
(eLex 2019, 2019-10-01)
This paper describes the development of lexicographic resources for Kurdish and provides a
lexical model for this language. Kurdish is considered a less-resourced language, and currently,
lacks machine-readable lexical ...
The ELEXIS interface for interoperable lexical resources
(eLex 2019, 2019-10-01)
ELEXIS is a project that aims to create a European network of lexical resources, and one of
the key challenges for this is the development of an interoperable interface for different lexical
resources so that further ...