Recent Submissions

  • CoFiF: A corpus of financial reports in French language 

    Ahmadi, Sina; Daudert, Tobias (NUI Galway, 2019-08-12)
    In an era when machine learning and artificial intelligence have huge momentum, the data demand to train and test models is steadily growing. We introduce CoFiF, the first corpus comprising company reports in the French ...
  • Creating a fine-grained corpus for a less-resourced language: the case of Kurdish 

    Omer Abdulrahman, Roshna; Hassani, Hossein; Ahmadi, Sina (NUI Galway, 2019-07-28)
    Kurdish is a less-resourced language consisting of different dialects written in various scripts. Approximately 30 million people in different countries speak the language. The lack of corpora is one of the main obstacles ...
  • NUIG at the FinSBD Task: Sentence boundary detection for noisy financial PDFs in English and French 

    Daudert, Tobias; Ahmadi, Sina (NUI Galway, 2019-08-12)
    Portable Document Format (PDF) has become the industry-standard document as it is independent of the software, hardware or operating system. Publicly listed companies annually publish a variety of reports and too take ...
  • Passive diagnosis incorporating the PHQ-4 for depression and anxiety 

    Delahunty, Fionn; Johansson, Robert; Mihael, Arcan (NUI Galway, 2019)
    Depression and anxiety are the two most prevalent mental health disorders worldwide, impacting the lives of millions of people each year. In this work, we develop and evaluate a multilabel, multidimensional deep neural ...
  • On lexicographical networks 

    Ahmadi, Sina; Arcan, Mihael; McCrae, John (NUI Galway, 2018-12-06)
    In this study, we analyze various aspects of lexicographical networks. We would like to answer our research questions of what are the characteristics of the lexicographical networks? In addition to the existing notions of ...
  • Open social data crime analytics 

    Ihsan, Ullah,; Lane, Caoilfhionn; Drury, Brett; Mellotte, Marc; Madden, Michael G. (IJCAI 17 Melbourne, 2017-07-20)
    Crime is under-reported. Reporting crime requires the victim to complete a number of administrative obligations. These obligations, as well as the nature of the crime, may create an inertia that discourages the reporting ...
  • Enabling case-based reasoning on the web of data 

    Heitmann, Benjamin; Hayes, Conor (2010-07-20)
    While Case-based reasoning (CBR) has successfully been deployed on the Web, its data models are typically inconsistent with existing information infrastructure and standards. In this paper, we examine how CBR can operate ...
  • The role of negative results for choosing an evaluation approach - a recommender systems case study 

    Heitmann, Benjamin; Hayes, Conor (CEUR Workshop Proceedings, 2015-06-01)
    We describe a case study, which shows how important negative results are in uncovering biased evaluation methodologies. Our re- search question is how to compare a recommender algorithm that uses an RDF graph to a ...
  • Semantic relation classification: task formalisation and refinement 

    Silva, Vivian S.; Hürliman, Manuela; Davis, Brian; Handschuh, Siegfried; Freitas, André (Association for Computational Linguistics, 2016-12-12)
    The identification of semantic relations between terms within texts is a fundamental task in Natural Language Processing which can support applications requiring a lightweight semantic interpretation model. Currently, ...
  • Dublin City University and Partners' participation in the INS and VTT Tracks at TRECVid 2016 

    Marsden, Mark; Mohedano, Eva; McGuinness, Kevin; Calafell, Andrea; Giro-i-Nieto, Xavier; O'Connor, Noel E.; Zhou, Jiang; Azavedo, Lucas; Daudert, Tobias; Davis, Brian; Hürlimann, Manuela; Afli, Haithem; Du, Jinhua; Ganguly, Debasis; Li, Wei; Way, Andy; Smeaton, Alan F. (2016-11-14)
    Dublin City University participated with a consortium of colleagues from NUI Galway and Universitat Polit`ecnica de Catalunya in two tasks in TRECVid 2016, Instance Search (INS) and Video to Text (VTT). For the INS task ...
  • A Twitter sentiment gold standard for the Brexit referendum 

    Hürlimann, Manuela; Davis, Brian; Cortis, Keith; Freitas, André; Handschuh, Siegfried; Fernández, Sergio (CEUR Workshop Proceedings, 2016-09-12)
    A Twitter Sentiment Gold Standard for the Brexit Referendum Manuela Hürlimann, Brian Davis Insight Centre for Data Analytics National University of Ireland Galway, Ireland {first.last}@insight-centre.org Keith Cortis, André ...
  • In or out? Real-time monitoring of BREXIT sentiment on Twitter 

    Vasiliu, Laurentiu; Freitas, André; Caroli, Frederico; McDermott, Ross; Zarrouk, Manel; Hürlimann, Manuela; Davis, Brian; Daudert, Tobias; Khaled, Malek Ben; Byrne, David; Fernández, Sergio; Cavallini, Angelo (CEUR Workshop Proceedings, 2016-09-12)
    The SSIX (Social Sentiment analysis financial IndeXes) project is a European Innovation Project sponsored by the European Commission under the Horizon 2020 framework. SSIX aims to provide European SMEs with a collection ...
  • Combining lexical and spatial knowledge to predict spatial relations between objects in images 

    Hürlimann, Manuela; Bos, Johan (ACL Anthology, 2016-08-11)
    Explicit representations of images are useful for linguistic applications related to images. We design a representation based on first-order models that capture the objects present in an image as well as their spatial ...
  • A hybrid method for rating prediction using linked data features and text reviews 

    Yumusak, Semih; Muñoz, Emir; Minervini, Pasquale; Dogdu, Erdogan; Kodaz, Halife (CEUR-WS.org, 2016)
    This paper describes our entry for the Linked Data Mining Challenge 2016, which poses the problem of classifying music albums as good or bad by mining Linked Data. The original labels are assigned according to aggregated ...
  • A linked data-based decision tree classifier to review movies 

    Aldarra, Suad; Muñoz, Emir (CEUR-WS.org, 2015)
    In this paper, we describe our contribution to the 2015 Linked Data Mining Challenge. The proposed task is concerned with the prediction of review of movies as good or bad , as does Metacritic website based on critics ...
  • Learning content patterns from linked data 

    Muñoz, Emir (CEUR-WS.org, 2014)
    Linked Data (LD) datasets (e.g., DBpedia, Freebase) are used in many knowledge extraction tasks due to the high variety of domains they cover. Unfortunately, many of these datasets do not provide a description for their ...
  • µRaptor: A DOM-based system with appetite for hCard elements 

    Muñoz, Emir; Costabello, Luca; Vandenbussche, Pierre-Yves (CEUR-WS.org, 2014)
    This paper describes µRaptor, a DOM-based method to extract hCard microformats from HTML pages stripped of microformat markup. µRaptor extracts DOM sub-trees, converts them into rules, and uses them to extract hCard ...
  • Triplifying Wikipedia's tables 

    Muñoz, Emir; Hogan, Aidan; Mileo, Alessandra (CEUR-WS.org, 2013)
    We are currently investigating methods to triplify the content of Wikipedia's tables. We propose that existing knowledge-bases can be leveraged to semi-automatically extract high-quality facts (in the form of RDF triples) ...
  • Using social media data for online television recommendation services at RTÉ Ireland 

    Barraza-Urbina, Andrea; Hromic, Hugo; Heitmann, Benjamin; Hayes, Conor; Hulpus, Ioana (2015-09)
    Raidió Teilifís Éireann (RTÉ) is the public service television and radio broadcaster in Ireland. Through on demand video services, RTÉ allows their users to catch up on television broadcasts via the RTÉ Player. The company ...
  • Robot-assisted care for elderly with dementia: is there a potential for genuine end-user empowerment? 

    Felzmann, Heike; Murphy, Kathy; Casey, Dympna; Beyan, Oya (2015)

View more