Now showing items 1-20 of 61

    • A comparison of emotion annotation approaches for text 

      Wood, Ian; McCrae, John; Andryushechkin, Vladimir; Buitelaar, Paul (MDPI AG, 2018-05-11)
      While the recognition of positive/negative sentiment in text is an established task with many standard data sets and well developed methodologies, the recognition of a more nuanced affect has received less attention: there ...
    • Applying informal health reports and search queries for public health monitoring: An evaluation of characteristics, potentials, and requirements of online self-reporting, discussions, and search behaviour 

      Marques Barros, Joana Carina (NUI Galway, 2021-01-13)
      The introduction of digital data sources has positively impacted public health surveillance and has paved the way for novel approaches. Internet-based sources provide large volumes of data which can be analysed in near ...
    • Asistent -- a machine translation system for Slovene, Serbian and Croatian 

      Arcan, Mihael; Popovic, Maja; Buitelaar, Paul (University of Ljubljana, 2016-09-29)
      The META-NET research on language technologies in 2012 showed a weak support on tools for crossing the language barrier for many European languages, including the south Slavic languages. Therefore, we describe a statistical ...
    • Automatic enrichment of terminological resources: the IATE RDF example 

      Arcan, Mihael; Montiel-Ponsoda, Elena; McCrae, John P.; Buitelaar, Paul (European Language Resources Association, 2018-05-07)
      Terminological resources have proven necessary in many organizations and institutions to ensure communication between experts. However, the maintenance of these resources is a very time-consuming and expensive process. ...
    • Automatic taxonomy generation: a use-case in the legal domain 

      Robin, Cécile; O'Neill, James; Buitelaar, Paul (LTC'17, 8th Language & Technology Conference, 2017-11-17)
      A key challenge in the legal domain is the adaptation and representation of the legal knowledge expressed through texts, in order for legal practitioners and researchers to access this information more easily and faster ...
    • Back-translation approach for code-switching machine translation: A case study 

      Masoud, Maraim; Torregrosa, Daniel; Buitelaar, Paul; Arčan, Mihael (AICS2019, 2019-12-05)
      Recently, machine translation has demonstrated significant progress in terms of translation quality. However, most of the research has focused on translating with pure monolingual texts in the source and the target side ...
    • Benchmarking Domain-Specific Expert Search Using Workshop Program Committees 

      Bordea, Georgeta; Buitelaar, Paul (ACM, 2013)
      Traditionally, relevance assessments for expert search have been gathered through self-assessment or based on the opinions of co-workers. We introduce three benchmark datasets1 for expert search that use conference workshops ...
    • Challenges for the multilingual web of data 

      Gracia, Jorge; Montiel-Ponsoda, Elena; Cimiano, Philipp; Gómez-Pérez, Asunción; Buitelaar, Paul; McCrae, John (Elsevier BV, 2012-03-01)
    • Classifying sentential modality in legal language: A use case in financial regulations, acts and directives 

      O'Neill, James; Buitelaar, Paul; Robin, Cécile; O'Brien, Leona (ACM, 2017-06-12)
      Texts expressed in legal language are often di cult and time consuming for lawyers to read through, particularly for the purpose of identifying relevant deontic modalities (obligations, prohibitions and permissions). ...
    • A comparison of emotion annotation approaches for text 

      Wood, Ian D.; McCrae, John P.; Andryushechkin, Vladimir; Buitelaar, Paul (MDPI, 2018-05-11)
      While the recognition of positive/negative sentiment in text is an established task with many standard data sets and well developed methodologies, the recognition of a more nuanced affect has received less attention: there ...
    • A comparison of emotion annotation schemes and a new annotated data set 

      Wood, Ian D.; McCrae, John P.; Andryushechkin, Vladimir; Buitelaar, Paul (European Languages Resources Association (ELRA), 2018-05-07)
      While the recognition of positive/negative sentiment in text is an established task with many standard data sets and well developed methodologies, the recognition of more nuanced affect has received less attention, and ...
    • Contextualised sentiment analysis in the financial domain 

      Daudert, Tobias (NUI Galway, 2021-07-02)
      Sentiments and beliefs play an important role in actions and decisions in a market environment; for example, people disliking a brand will tend to avoid products of it or people aiming to reduce their carbon-dioxide footprint ...
    • CURED4NLG: A dataset for table-to-text generation 

      Pasricha, Nivranshu; Arcan, Mihael; Buitelaar, Paul (University of Galway, 2023)
      We introduce CURED4NLG, a dataset for the task of table-to-text generation focusing on the public health domain. The dataset consists of 280 pairs of tables and documents extracted from weekly epidemiological reports ...
    • A dataset for troll classification of Tamil memes 

      Chakravarthi, Bharathi Raja; Varma, Pranav; Arcan, Mihael; McCrae, John P.; Buitelaar, Paul; Shardul, Suryawanshi (European Language Resources Association (ELRA), 2020-05-11)
      Social media are interactive platforms that facilitate the creation or sharing of information, ideas or other forms of expression among people. This exchange is not free from offensive, trolling or malicious contents ...
    • A decade of Semantic Web research through the lenses of a mixed methods approach 

      Kirrane, Sabrina; Sabou, Marta; Fernandez, Javier D.; Osborne, Francesco; Robin, Cécile; Buitelaar, Paul; Motta, Enrico; Polleres, Axel (IOS Press, 2019-06-20)
      The identification of research topics and trends is an important scientometric activity, as it can help guide the direction of future research. In the Semantic Web area, initially topic and trend detection was primarily ...
    • Developing a Dataset for Technology Structure Mining 

      QasemiZadeh, Behrang; Buitelaar, Paul; Monaghan, Fergal (IEEE, 2010)
      This paper describes steps that have been taken to construct a development dataset for the task of Technology Structure Mining. We have defined the proposed task as the process of mapping a scientific corpus into a ...
    • Domain adaptive extraction of topical hierarchies for Expertise Mining 

      Bordea, Georgeta (2013-09-11)
      In this age of pervasive internet access we have become accustomed to rely on web search for our most basic information needs. But complex queries in knowledge-intensive organisations, as well as in the academic environment, ...
    • Domain-independent term extraction through domain modelling 

      Bordea, Georgeta; Buitelaar, Paul; Polajnar, Tamara (10th International Conference on Terminology and Artificial Intelligence, 2013-09-11)
      Extracting general or intermediate level terms is a relevant problem that has not received much attention in literature. Current approaches for term extraction rely on contrastive corpora to identify domain-specific terms, ...
    • Enhancing multiple-choice question answering with causal knowledge 

      Dalal, Dhairya; Arcan, Mihael; Buitelaar, Paul (Association for Computational Linguistics, 2021-06-10)
      The task of causal question answering aims to reason about causes and effects over a provided real or hypothetical premise. Recent approaches have converged on using transformer-based language models to solve question ...
    • Enhancing statistical machine translation with bilingual terminology in a CAT environment 

      Arcan, Mihael; Turchi, Marco; Tonelli, Sara; Buitelaar, Paul (Association for Machine Translation in the Americas, 2014-10-22)
      In this paper, we address the problem of extracting and integrating bilingual terminology into a Statistical Machine Translation (SMT) system for a Computer Aided Translation (CAT) tool scenario. We develop a framework ...