Browsing Data Science Institute (Conference Papers) by Author "http://dx.doi.org/10.13039/501100002081"

Now showing items 1-5 of 5

Cross-lingual sentence embedding using multi-task learning

Goswami, Koustava; Dutta, Sourav; Assem, Haytham; Fransen, Theodorus; McCrae, John P. (Association for Computational Linguistics, 2021-11-07)

Multilingual sentence embeddings capture rich semantic information not only for measuring similarity between texts but also for catering to a broad range of downstream cross-lingual NLP tasks. State-of-the-art multilingual ...
Historical data preservation and interpretation pipeline for Irish civil registration records

Beyan, Oya; Mealy, P. J.; Grant, Dolores; Grant, Rebecca; Harrower, Natalie; Breathnach, Ciara; Collins, Sandra; Decker, Stefan (Springer Verlag, 2015-10-28)

Semantic Web technologies give us the opportunity to understand today's data-rich society and provide novel means to explore our past. Civil registration records such as birth, death, and marriage registers contain a vast ...
A multilingual evaluation dataset for monolingual word sense alignment

Ahmadi, Sina; McCrae, John P.; Nimb, Sanni; Khan, Fahad; Monachini, Monica; Pedersen, Bolette S.; Declerck, Thierry; Wissik, Tanja; Bellandi, Andrea; Pisani, Irene; Troelsgård, Thomas; Olsen, Sussi; Krek, Simon; Lipp, Veronika; Váradi, Tamás; Simon, László; Gyorffy, Andras; Tiberius, Carole; Schoonheim, Tanneke; Moshe, Yifat Ben; Rudich, Maya; Ahmad, Raya Abu; Lonke, Dorielle; Kovalenko, Kira; Langemets, Margit; Kallas, Jelena; Oksana, Dereza; Fransen, Theodorus; Cillessen, David; Lindemann, David; Alonso, Mikel; Salgado, Ana; Sancho, Jose Luis; Urena-Ruiz, Rafael-J.; Zamorano, Jordi Porta; Simov, Kiril; Osenova, Petya; Kancheva, Zara; Radev, Ivaylo; Stankovic, Ranka; Perdih, Andrej; Gabrovsek, Dejan (National University of Ireland Galway, 2020-05-16)

Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually ...
NUIG-Panlingua-KMI Hindi-Marathi MT Systems for Similar Language Translation Task @ WMT 2020

Ojha, Atul Kr.; Rani, Priya; Bansal, Akanksha; Chakravarthi, Bharathi Raja; Kumar, Ritesh; McCrae, John P. (Association for Computational Linguistics, 2020-11-19)

NUIG-Panlingua-KMI submission to WMT 2020 seeks to push the state-of-the-art in the Similar language translation task for the Hindi ↔ Marathi language pair. As part of these efforts, we conducted a series of experiments to ...
Unsupervised deep language and dialect identification for short texts

Goswami, Koustava; Sarkar, Rajdeep; Chakravarthi, Bharathi Raja; Fransen, Theodorus; McCrae, John P. (International Committee on Computational Linguistics, 2020-12)

Automatic Language Identification (LI) or Dialect Identification (DI) of short texts of closely related languages or dialects, is one of the primary steps in many natural language processing pipelines. Language identification ...

Browsing Data Science Institute (Conference Papers) by Author "http://dx.doi.org/10.13039/501100002081"

Cross-lingual sentence embedding using multi-task learning ﻿

Historical data preservation and interpretation pipeline for Irish civil registration records ﻿

A multilingual evaluation dataset for monolingual word sense alignment ﻿

NUIG-Panlingua-KMI Hindi-Marathi MT Systems for Similar Language Translation Task @ WMT 2020 ﻿

Unsupervised deep language and dialect identification for short texts ﻿

Cross-lingual sentence embedding using multi-task learning

Historical data preservation and interpretation pipeline for Irish civil registration records

A multilingual evaluation dataset for monolingual word sense alignment

NUIG-Panlingua-KMI Hindi-Marathi MT Systems for Similar Language Translation Task @ WMT 2020

Unsupervised deep language and dialect identification for short texts