Now showing items 46-65 of 153

  • Generating linked-data based domain-specific sentiment lexicons from legacy language and semantic resources 

    Vulcu, Gabriela; Buitelaar, Paul; Negi, Sapna; Pereira, Bianca; Arcan, Mihael; Coughlan, Barry; Sanchez, J. Fernando; Iglesias, Carlos A. (European Language Resources Association, 2014)
    We present a methodology for legacy language resource adaptation that generates domain-specific sentiment lexicons organized around domain entities described with lexical information and sentiment words described in the ...
  • GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer research 

    Kamdar, M; Iqbal, A; Saleem, M; Deus, H; Decker, S (2014)
    Cancer genomics researchers have greatly benefited from high-throughput technologies for the characterization of genomic alterations in patients. These voluminous genomics datasets when supplemented with the appropriate ...
  • Grand challenge: Automatic anomaly detection over sliding windows 

    Zaarour, Tarek; Pavlopoulou, Niki; Hasan, Souleiman; ul Hassan, Umair; Curry, Edward (Association for Computing Machinery ACM, 2017-06-19)
    With the advances in the Internet of Things and rapid generation of vast amounts of data, there is an ever growing need for leveraging and evaluating event-based systems as a basis for building realtime data analytics ...
  • Historical data preservation and interpretation pipeline for Irish civil registration records 

    Beyan, Oya; Mealy, P. J.; Grant, Dolores; Grant, Rebecca; Harrower, Natalie; Breathnach, Ciara; Collins, Sandra; Decker, Stefan (Springer Verlag, 2015-10-28)
    Semantic Web technologies give us the opportunity to understand today's data-rich society and provide novel means to explore our past. Civil registration records such as birth, death, and marriage registers contain a vast ...
  • Hot Topics and Schisms in NLP: Community and Trend Analysis with Saffron on ACL and LREC Proceedings 

    Buitelaar, Paul; Bordea, Georgeta; Coughlan, Barry (2014)
    In this paper we present a comparative analysis of two series of conferences in the field of Computational Linguistics, the LREC conference and the ACL conference. Conference proceedings were ...
  • Identification of bilingual terms from monolingual documents for statistical machine translation 

    Arcan, Mihael; Giuliano, Claudio; Turchi, Marco; Buitelaar, Paul (Association for Computational Linguistics, 2014-08-23)
    The automatic translation of domain-specific documents is often a hard task for generic Statistical Machine Translation (SMT) systems, which are not able to correctly translate the large number of terms encountered in the ...
  • Identifying equivalent relation paths in knowledge graphs 

    Mohamed, Sameh K.; Muñoz, Emir; Nováček, Vít; Vandenbussche, Pierre-Yves (Springer Verlag, 2017-06-19)
    Relation paths are sequences of relations with inverse that allow for complete exploration of knowledge graphs in a two-way unconstrained manner. They are powerful enough to encode complex relationships between entities ...
  • Identifying main obstacles for statistical machine translation of morphologically rich South Slavic languages 

    Popovic, Maja; Arcan, Mihael (European Association for Machine Translation, 2015-05-11)
    The best way to improve a statistical machine translation system is to identify concrete problems causing translation errors and address them. Many of these problems are related to the characteristics of the involved ...
  • Implicit and explicit aspect extraction in financial microblogs 

    Gaillat, Thomas; Stearns, Bernardo; McDermott, Ross; Sridhar, Gopal; Zarrouk, Manel; Davis, Brian (Association for Computational Linguistics, 2018-07-15)
    This paper focuses on aspect extraction which is a sub-task of Aspect-based Sentiment Analysis. The goal is to report an extraction method of financial aspects in microblog messages. Our approach uses a stock-investment ...
  • Improving wordnets for under-resourced languages using machine translation 

    Chakravarthi, Bharathi Raja; Arcan, Mihael; McCrae, John P. (Global Wordnet Association, 2018-01-08)
    Wordnets are extensively used in natural language processing, but the current approaches for manually building a wordnet from scratch involves large research groups for a long period of time, which are typically not ...
  • Instance selection for online automatic post-editing in a multi-domain scenario 

    Chatterjee, Rajen; Arcan, Mihael; Negri, Matteo; Turchi, Marco (Association for Machine Translation in the Americas, 2016)
    In recent years, several end-to-end online translation systems have been proposed to successfully incorporate human post-editing feedback in the translation workflow. The performance of these systems in a multi-domain ...
  • Integrating Open Source Software Repositories on the Web through Linked Data 

    Iqbal, A; Decker S (2015)
    In this paper, we propose a novel approach to the problem of integrating code forges based on metadata (e.g., programming language, database, intended audience, operating system), similar software projects and software ...
  • Investigating Context Parameters in Technology Term Recognition 

    QasemiZadeh, Behrang; Handschuh, siegfried (2014)
    We propose and evaluate the task of technology term recognition: a method to extract technology terms at a synchronic level from a corpus of scientific publications. The proposed method is built on the principles of ...
  • IRIS: English-Irish machine translation system 

    Arcan, Mihael; Lane, Caoilfhionn; Ó Droighneáin, Eoin; Buitelaar, Paul (European Language Resources Association, 2016-05-23)
    We describe IRIS, a statistical machine translation (SMT) system for translating from English into Irish and vice versa. Since Irish is considered an under-resourced language with a limited amount of machine-readable text, ...
  • Kanopy: Analysing the Semantic Network around Document Topics 

    Hulpus, Ioana; Hayes, Conor; Karnstedt, Marcel; Greene, Derek; Jozwowicz, Marek (Springer, 2013)
    External knowledge bases, both generic and domain specific, available on the Web of Data have the potential of enriching the content of text documents with structured information. We present the Kanopy system that makes ...
  • Knowledge portability with semantic expansion of ontology labels 

    Arcan, Mihael; Turchi, Marco; Buitelaar, Paul (Association for Computational Linguistics, 2015-07-26)
    Our research focuses on the multilingual enhancement of ontologies that, often represented only in English, need to be translated in different languages to enable knowledge access across languages. Ontology translation ...
  • Language related issues for machine translation between closely related south Slavic languages 

    Popovic, Maja; Arcan, Mihael; Klubicka, Filip (The COLING 2016 Organizing Committee, 2016-12-12)
    Machine translation between closely related languages is less challenging and exhibits a smaller number of translation errors than translation between distant languages, but there are still obstacles which should be ...
  • Linguistic linked data for sentiment analysis 

    Buitelaar, Paul; Arcan, Mihael; Iglesias, Carlos A.; Sánchez-Rada, Juan Fernando; Strapparava, Carlo (Association for Computational Linguistics, 2013-09-23)
    In this paper we describe the specification of a model for the semantically interoperable representation of language resources for sentiment analysis. The model integrates ‘lemon’, an RDF-based model for the specification ...
  • Linked Biomedical Dataspace: Lessons Learned integrating Data for Drug Discovery 

    Hasnain, Ali; Kamdar, Maulik; Deus, Helena; Mehdi, Muntazir; Decker, Stefan (Springer, 2014)
    The increase in the volume and heterogeneity of biomedical data sources has motivated researchers to embrace Linked Data (LD) technologies to solve the ensuing integration challenges and enhance information discovery. As ...
  • Linked cancer genome atlas database 

    Padmanabhuni, Shanmukha S.; Decker, Stefan (2013)