Now showing items 70-89 of 544

    • A comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed data 

      Rani, Priya; Suryawanshi, Shardul; Goswami, Koustava; Chakravarthi, Bharathi Raja; Fransen, Theodorus; McCrae, John P. (European Language Resources Association (ELRA), 2020-05-11)
      Hate speech detection in social media communication has become one of the primary concerns to avoid conflicts and curb undesired activities. In an environment where multilingual speakers switch among multiple languages, ...
    • A comparison of emotion annotation approaches for text 

      Wood, Ian D.; McCrae, John P.; Andryushechkin, Vladimir; Buitelaar, Paul (MDPI, 2018-05-11)
      While the recognition of positive/negative sentiment in text is an established task with many standard data sets and well developed methodologies, the recognition of a more nuanced affect has received less attention: there ...
    • A comparison of emotion annotation schemes and a new annotated data set 

      Wood, Ian D.; McCrae, John P.; Andryushechkin, Vladimir; Buitelaar, Paul (European Languages Resources Association (ELRA), 2018-05-07)
      While the recognition of positive/negative sentiment in text is an established task with many standard data sets and well developed methodologies, the recognition of more nuanced affect has received less attention, and ...
    • A comparison of statistical and neural machine translation for Slovene, Serbian and Croatian 

      Arcan, Mihael (Language Technologies and Digital Humanities 2018, 2018-09-20)
      In this paper we present a comparison of translation quality using of Statistical Machine Translation (SMT) and Neural Machine Translation (NMT), considering translation directions between English, Slovene, Serbian and ...
    • Constructing Twitter Datasets using Signals for Event Detection Evaluation 

      Hromic, Hugo; Hayes, Conor (22nd International Conference on Case-Based Reasoning, 2014-09-29)
      Twitter is a very attractive real-time platform for research on event detection. However, despite the great amount of interest, datasets suitable for evaluating such methods are not easily available. The two most important ...
    • A Content Analysis: How Wikipedia Talk Pages Are Used 

      Schneider, Jodi; Passant, Alexandre; Breslin, John G. (2010)
    • A Context Lifecycle For Web-Based Context Management Services 

      Hynes, Gearoid; Reynolds, Vinny; Hauswirth, Manfred (2009)
      During the development of context aware applications a con- text management component must traditionally be created. This task re- quires specialist context lifecycle management expertise and hence can be a significant ...
    • Converging Web and Desktop Data with Konduit 

      Dragan, Laura; Möller, Knud; Handschuh, Siegfried; Ambrus, Oszkar (2009)
      In this paper we present Konduit, a desktop-based platform for visual scripting with RDF data. Based on the idea of the semantic desktop, non-technical users can create, manipulate and mash-up RDF data with Konduit, and ...
    • A Conversation-oriented language for B2B integration based on Semantic Web Services 

      Gomez, Juan Miguel; Haller, Armin; Bussler, Christoph (2005)
      Establishing conversations in a B2B environment has significantly eased since the advent of standards such as RosettaNet and ebXML. These standardisation efforts have maintained some flexibility in defining interactions ...
    • CORAAL - Towards Deep Exploitation of Textual Resources in Life Sciences 

      Nováček, Vít; Groza, Tudor; Handschuh, Siegfried (Springer Verlag, 2009)
      Prominent biomedical literature search tools like ScienceDirect, PubMed Central or MEDLINE allow for efficient retrieval of resources based on key words. Due to vast amounts of data available in life sciences, key word ...
    • Corpus creation for sentiment analysis in code-mixed Tamil-English text 

      Chakravarthi, Bharathi Raja; Muralidaran, Vigneshwaran; Priyadharshini, Ruba; McCrae, John P. (European Language Resources Association (ELRA), 2020-05-11)
      Understanding the sentiment of a comment from a video or an image is an essential task in many applications. Sentiment analysis of a text can be useful for various decision-making processes. One such application is to ...
    • A corpus of the Sorani Kurdish folkloric lyrics 

      Ahmadi, Sina; Hassani, Hossein; Abedi, Kamaladdin (National University of Ireland Galway, 2020-05-16)
      Kurdish poetry and prose narratives were historically transmitted orally and less in a written form. Being an essential medium of oral narration and literature, Kurdish lyrics have had a unique attribute in becoming a ...
    • Cost-Aware Processing of Similarity Queries in Structured Overlays 

      Karnstedt, Marcel; Hauswirth, Manfred (2006)
      Large-scale distributed data management with P2P systems requires the existence of similarity operators for queries as we cannot assume that all users will agree on exactly the same schema and value representations and ...
    • Creating a fine-grained corpus for a less-resourced language: the case of Kurdish 

      Omer Abdulrahman, Roshna; Hassani, Hossein; Ahmadi, Sina (NUI Galway, 2019-07-28)
      Kurdish is a less-resourced language consisting of different dialects written in various scripts. Approximately 30 million people in different countries speak the language. The lack of corpora is one of the main obstacles ...
    • Creating a multilingual terminological resource using linked data:the case of archaeological domain in the Italian language 

      Carlino, Carola; Ahmadi, Sina; Speranza, Giulia (CEUR Workshop Proceedings, 2019-11-13)
      The lack of multilingual terminological resources in specialized domains constitutes an obstacle to the access and reuse of information. In the technical domain of cultural heritage and, in particular, archaeology, such ...
    • Cross-lingual sentence embedding using multi-task learning 

      Goswami, Koustava; Dutta, Sourav; Assem, Haytham; Fransen, Theodorus; McCrae, John P. (Association for Computational Linguistics, 2021-11-07)
      Multilingual sentence embeddings capture rich semantic information not only for measuring similarity between texts but also for catering to a broad range of downstream cross-lingual NLP tasks. State-of-the-art multilingual ...
    • CURED4NLG: A dataset for table-to-text generation 

      Pasricha, Nivranshu; Arcan, Mihael; Buitelaar, Paul (University of Galway, 2023)
      We introduce CURED4NLG, a dataset for the task of table-to-text generation focusing on the public health domain. The dataset consists of 280 pairs of tables and documents extracted from weekly epidemiological reports ...
    • D-FOAF - Distributed Identity Management based on Social Networks 

      Kruk, Sebastian Ryszard; Gzella, Adam; Grzonkowski, Slawomir (2006)
      Contemporary Web consists of more than just information, it provides a large number of services, which often require identi¿cation of it¿s users. Since distributed or shared identi¿cation systems are not yet widely adopted ...
    • D-FOAF - Security Aspects in Distributed User Management System 

      Grzonkowski, Slawomir; Gzella, Adam; Kruk, Sebastian Ryszard; Woroniecki, Tomasz (IEEE, 2005)
      The contemporary Internet offers various services ranging from electronic newspapers to on- line social networks. To authorize themselves, users have to register to on-line services. However, most of the ...
    • D-FOAF: Distributed Identity Management with Access Rights Delegation 

      Kruk, Sebastian Ryszard; Grzonkowski, Slawomir; Gzella, Adam; Woroniecki, Tomasz; Choi, Hee Chul (2006)
      WWW provides a large number of services, which often require identification of it¿s users. This has lead to the fact that today users have to maintain a large number of different credentials for different websites - ...