Now showing items 1-14 of 14

    • Challenges of word sense alignment: Portuguese language resources 

      Salgado, Ana; Ahmadi, Sina; Simões, Alberto; McCrae, John P.; Costa, Rute (National University of Ireland Galway, 2020-05-16)
      This paper reports on an ongoing task of monolingual word sense alignment in which a comparative study between the Portuguese Academy of Sciences Dictionary and the Dicionario Aberto ´ is carried out in the context of the ...
    • CoFiF: A corpus of financial reports in French language 

      Ahmadi, Sina; Daudert, Tobias (NUI Galway, 2019-08-12)
      In an era when machine learning and artificial intelligence have huge momentum, the data demand to train and test models is steadily growing. We introduce CoFiF, the first corpus comprising company reports in the French ...
    • A corpus of the Sorani Kurdish folkloric lyrics 

      Ahmadi, Sina; Hassani, Hossein; Abedi, Kamaladdin (National University of Ireland Galway, 2020-05-16)
      Kurdish poetry and prose narratives were historically transmitted orally and less in a written form. Being an essential medium of oral narration and literature, Kurdish lyrics have had a unique attribute in becoming a ...
    • Creating a fine-grained corpus for a less-resourced language: the case of Kurdish 

      Omer Abdulrahman, Roshna; Hassani, Hossein; Ahmadi, Sina (NUI Galway, 2019-07-28)
      Kurdish is a less-resourced language consisting of different dialects written in various scripts. Approximately 30 million people in different countries speak the language. The lack of corpora is one of the main obstacles ...
    • Creating a multilingual terminological resource using linked data:the case of archaeological domain in the Italian language 

      Carlino, Carola; Ahmadi, Sina; Speranza, Giulia (CEUR Workshop Proceedings, 2019-11-13)
      The lack of multilingual terminological resources in specialized domains constitutes an obstacle to the access and reuse of information. In the technical domain of cultural heritage and, in particular, archaeology, such ...
    • Defying Wikidata: Validation of terminological relations in the web of data 

      Martín-Chozas, Patricia; Ahmadi, Sina; Montiel-Ponsoda, Elena (National University of Ireland Galway, 2020-05-16)
      In this paper we present an approach to validate terminological data retrieved from open encyclopaedic knowledge bases. This need arises from the enrichment of automatically extracted terms with information from existing ...
    • The ELEXIS interface for interoperable lexical resources 

      McCrae, John P.; Tiberius, Carole; Khan, Anas Fahad; Kernerman, Ilan; Declerck, Thierry; Krek, Simon; Monachini, Monica; Ahmadi, Sina (eLex 2019, 2019-10-01)
      ELEXIS is a project that aims to create a European network of lexical resources, and one of the key challenges for this is the development of an interoperable interface for different lexical resources so that further ...
    • Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation 

      Arcan, Mihael; Torregrosa, Daniel; Ahmadi, Sina; McCrae, John P. (National University of Ireland, Galway, 2019-05-20)
      In the widely-connected digital world, multilingual lexical resources are one of the most important resources, for natural language processing applications, including information retrieval, question answering or knowledge ...
    • Lexical sense alignment using weighted bipartite b-matching 

      Ahmadi, Sina; Arcan, Mihael; McCrae, John (NUI Galway, 2019-05-20)
      In this study, we present a similarity-based approach for lexical sense alignment in WordNet and Wiktionary with a focus on the polysemous items. Our approach relies on semantic textual similarity using features such as ...
    • A multilingual evaluation dataset for monolingual word sense alignment 

      Ahmadi, Sina; McCrae, John P.; Nimb, Sanni; Khan, Fahad; Monachini, Monica; Pedersen, Bolette S.; Declerck, Thierry; Wissik, Tanja; Bellandi, Andrea; Pisani, Irene; Troelsgård, Thomas; Olsen, Sussi; Krek, Simon; Lipp, Veronika; Váradi, Tamás; Simon, László; Gyorffy, Andras; Tiberius, Carole; Schoonheim, Tanneke; Moshe, Yifat Ben; Rudich, Maya; Ahmad, Raya Abu; Lonke, Dorielle; Kovalenko, Kira; Langemets, Margit; Kallas, Jelena; Oksana, Dereza; Fransen, Theodorus; Cillessen, David; Lindemann, David; Alonso, Mikel; Salgado, Ana; Sancho, Jose Luis; Urena-Ruiz, Rafael-J.; Zamorano, Jordi Porta; Simov, Kiril; Osenova, Petya; Kancheva, Zara; Radev, Ivaylo; Stankovic, Ranka; Perdih, Andrej; Gabrovsek, Dejan (National University of Ireland Galway, 2020-05-16)
      Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually ...
    • NUIG at the FinSBD Task: Sentence boundary detection for noisy financial PDFs in English and French 

      Daudert, Tobias; Ahmadi, Sina (NUI Galway, 2019-08-12)
      Portable Document Format (PDF) has become the industry-standard document as it is independent of the software, hardware or operating system. Publicly listed companies annually publish a variety of reports and too take ...
    • On lexicographical networks 

      Ahmadi, Sina; Arcan, Mihael; McCrae, John (NUI Galway, 2018-12-06)
      In this study, we analyze various aspects of lexicographical networks. We would like to answer our research questions of what are the characteristics of the lexicographical networks? In addition to the existing notions of ...
    • TIAD 2019 Shared Task: Leveraging knowledge graphs with neural machine translation for automatic multilingual dictionary generation 

      Torregrosa, Daniel; Arcan, Mihael; Ahmadi, Sina; McCrae, John P. (National University of Ireland, Galway, 2019-04-20)
      This paper describes the different proposed approaches to the TIAD 2019 Shared Task, which consisted in the automatic discovery and generation of dictionaries leveraging multilingual knowledge bases. We present three methods ...
    • Towards electronic lexicography for the Kurdish language 

      Ahmadi, Sina; Hassani, Hossein; McCrae, John P. (eLex 2019, 2019-10-01)
      This paper describes the development of lexicographic resources for Kurdish and provides a lexical model for this language. Kurdish is considered a less-resourced language, and currently, lacks machine-readable lexical ...