Browsing Data Science Institute (Workshop Papers) by Author "http://dx.doi.org/10.13039/501100008530"

Now showing items 1-17 of 17

BEARS: Towards an evaluation framework for bandit-based interactive recommender systems

Barraza-Urbina, Andrea; Koutrika, Georgia; d'Aquin, Mathieu,; Hayes, Conor (NUI Galway, 2018-10-06)

Recommender Systems (RS) deployed in fast-paced dynamic scenarios must quickly learn to adapt in response to user evaluative feedback. In these settings, the RS faces an online learning problem where each decision should ...
CoFiF: A corpus of financial reports in French language

Ahmadi, Sina; Daudert, Tobias (NUI Galway, 2019-08-12)

In an era when machine learning and artificial intelligence have huge momentum, the data demand to train and test models is steadily growing. We introduce CoFiF, the first corpus comprising company reports in the French ...
A comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed data

Rani, Priya; Suryawanshi, Shardul; Goswami, Koustava; Chakravarthi, Bharathi Raja; Fransen, Theodorus; McCrae, John P. (European Language Resources Association (ELRA), 2020-05-11)

Hate speech detection in social media communication has become one of the primary concerns to avoid conflicts and curb undesired activities. In an environment where multilingual speakers switch among multiple languages, ...
Corpus creation for sentiment analysis in code-mixed Tamil-English text

Chakravarthi, Bharathi Raja; Muralidaran, Vigneshwaran; Priyadharshini, Ruba; McCrae, John P. (European Language Resources Association (ELRA), 2020-05-11)

Understanding the sentiment of a comment from a video or an image is an essential task in many applications. Sentiment analysis of a text can be useful for various decision-making processes. One such application is to ...
A dataset for troll classification of Tamil memes

Chakravarthi, Bharathi Raja; Varma, Pranav; Arcan, Mihael; McCrae, John P.; Buitelaar, Paul; Shardul, Suryawanshi (European Language Resources Association (ELRA), 2020-05-11)

Social media are interactive platforms that facilitate the creation or sharing of information, ideas or other forms of expression among people. This exchange is not free from offensive, trolling or malicious contents ...
Edge2Guard: Botnet attacks detecting offline models for resource-constrained IoT devices

Sudharsan, Bharath; Sundaram, Dineshkumar; Patel, Pankesh; Breslin, John G.; Ali, Muhammad Intizar (National University of Ireland Galway, 2021-03-22)

In today's IoT smart environments, dozens of MCU-based connected device types exist such as HVAC controllers, smart meters, smoke detectors, etc. The security conditions for these essential IoT devices remain unsatisfactory ...
Enhancing multiple-choice question answering with causal knowledge

Dalal, Dhairya; Arcan, Mihael; Buitelaar, Paul (Association for Computational Linguistics, 2021-06-10)

The task of causal question answering aims to reason about causes and effects over a provided real or hypothetical premise. Recent approaches have converged on using transformer-based language models to solve question ...
Multilingual multimodal machine translation for Dravidian languages utilizing phonetic transcription

Chakravarthi, Bharathi Raja; Priyadharshini, Ruba; Stearns, Bernardo; Jayapal, Arun; Sridevy, S.; Arcan, Mihael; Zarrouk, Manel; McCrae, John P. (European Association for Machine Translation, 2019-08-19)

Multimodal machine translation is the task of translating from a source text into the target language using information from other modalities. Existing multimodal datasets have been restricted to only highly resourced ...
Multimodal meme dataset (MultiOFF) for identifying offensive content in image and text

Suryawanshi, Shardul; Chakravarthi, Bharathi Raja; Arcan, Mihael; Buitelaar, Paul (European Language Resources Association (ELRA), 2020-05-11)

A meme is a form of media that spreads an idea or emotion across the internet. As posting meme has become a new form of communication of the web, due to the multimodal nature of memes, postings of hateful memes or related ...
Neural machine translation of literary texts from English to Slovene

Kuzman, Taja; Vintar, Špela; Arčan, Mihael (Machine Translation Summit 2019, 2019-08-19)

Neural Machine Translation has shown promising performance in literary texts. Since literary machine translation has not yet been researched for the English-toSlovene translation direction, this paper aims to fulfill ...
NUIG at the FinSBD Task: Sentence boundary detection for noisy financial PDFs in English and French

Daudert, Tobias; Ahmadi, Sina (NUI Galway, 2019-08-12)

Portable Document Format (PDF) has become the industry-standard document as it is independent of the software, hardware or operating system. Publicly listed companies annually publish a variety of reports and too take ...
NUIG at TIAD: Combining unsupervised NLP and graph metrics for translation inference

McCrae, John P.; Arcan, Mihael (European Language Resources Association (ELRA), 2020-05-11)

In this paper, we present the NUIG system at the TIAD shard task. This system includes graph-based metrics calculated using novel algorithms, with an unsupervised document embedding tool called ONETA and an unsupervised ...
NUIG-DSI at the WebNLG+ challenge: Leveraging transfer learning for RDF-to-text generation

Pasricha, Nivranshu; Arcan, Mihael; Buitelaar, Paul (Association for Computational Linguistics, 2020-12-18)

This paper describes the system submitted by NUIG-DSI to the WebNLG+ challenge 2020 in the RDF-to-text generation task for the English language. For this challenge, we leverage transfer learning by adopting the T5 model ...
NUIG-DSI’s submission to the GEM Benchmark 2021

Pasricha, Nivranshu; Arcan, Mihael; Buitelaar, Paul (Association for Computational Linguistics, 2021-08-05)

This paper describes the submission by NUIG-DSI to the GEM benchmark 2021. We participate in the modeling shared task where we submit outputs on four datasets for data-to-text generation, namely, DART, WebNLG (en), E2E and ...
A sentiment analysis dataset for code-mixed Malayalam-English

Chakravarthi, Bharathi Raja; Jose, Navya; Suryawanshi, Shardul; Sherly, Elizabeth; McCrae, John P. (European Language Resources Association (ELRA), 2020-05-11)

There is an increasing demand for sentiment analysis of text from social media which are mostly code-mixed. Systems trained on monolingual data fail for code-mixed data due to the complexity of mixing at different levels ...
Towards sharing task environments to support reproducible evaluations of interactive recommender systems

Barraza-Urbina, Andrea; d'Aquin, Mathieu (NUI Galway, 2019-09-20)

Beyond sharing datasets or simulations, we believe the Recommender Systems (RS) community should share Task Environments. In this work, we propose a high-level logical architecture that will help to reason about the core ...
Utilising knowledge graph embeddings for data-to-text generation

Pasricha, Nivranshu; Arcan, Mihael; Buitelaar, Paul (Association for Computational Linguistics, 2020-12-18)

Data-to-text generation has recently seen a move away from modular and pipeline architectures towards end-to-end architectures based on neural networks. In this work, we employ knowledge graph embeddings and explore their ...

Browsing Data Science Institute (Workshop Papers) by Author "http://dx.doi.org/10.13039/501100008530"

BEARS: Towards an evaluation framework for bandit-based interactive recommender systems ﻿

CoFiF: A corpus of financial reports in French language ﻿

A comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed data ﻿

Corpus creation for sentiment analysis in code-mixed Tamil-English text ﻿

A dataset for troll classification of Tamil memes ﻿

Edge2Guard: Botnet attacks detecting offline models for resource-constrained IoT devices ﻿

Enhancing multiple-choice question answering with causal knowledge ﻿

Multilingual multimodal machine translation for Dravidian languages utilizing phonetic transcription ﻿

Multimodal meme dataset (MultiOFF) for identifying offensive content in image and text ﻿

Neural machine translation of literary texts from English to Slovene ﻿

NUIG at the FinSBD Task: Sentence boundary detection for noisy financial PDFs in English and French ﻿

NUIG at TIAD: Combining unsupervised NLP and graph metrics for translation inference ﻿

NUIG-DSI at the WebNLG+ challenge: Leveraging transfer learning for RDF-to-text generation ﻿

NUIG-DSI’s submission to the GEM Benchmark 2021 ﻿

A sentiment analysis dataset for code-mixed Malayalam-English ﻿

Towards sharing task environments to support reproducible evaluations of interactive recommender systems ﻿

Utilising knowledge graph embeddings for data-to-text generation ﻿

BEARS: Towards an evaluation framework for bandit-based interactive recommender systems

CoFiF: A corpus of financial reports in French language

A comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed data

Corpus creation for sentiment analysis in code-mixed Tamil-English text

A dataset for troll classification of Tamil memes

Edge2Guard: Botnet attacks detecting offline models for resource-constrained IoT devices

Enhancing multiple-choice question answering with causal knowledge

Multilingual multimodal machine translation for Dravidian languages utilizing phonetic transcription

Multimodal meme dataset (MultiOFF) for identifying offensive content in image and text

Neural machine translation of literary texts from English to Slovene

NUIG at the FinSBD Task: Sentence boundary detection for noisy financial PDFs in English and French

NUIG at TIAD: Combining unsupervised NLP and graph metrics for translation inference

NUIG-DSI at the WebNLG+ challenge: Leveraging transfer learning for RDF-to-text generation

NUIG-DSI’s submission to the GEM Benchmark 2021

A sentiment analysis dataset for code-mixed Malayalam-English

Towards sharing task environments to support reproducible evaluations of interactive recommender systems

Utilising knowledge graph embeddings for data-to-text generation