Now showing items 1-13 of 13

    • A comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed data 

      Rani, Priya; Suryawanshi, Shardul; Goswami, Koustava; Chakravarthi, Bharathi Raja; Fransen, Theodorus; McCrae, John P. (European Language Resources Association (ELRA), 2020-05-11)
      Hate speech detection in social media communication has become one of the primary concerns to avoid conflicts and curb undesired activities. In an environment where multilingual speakers switch among multiple languages, ...
    • Corpus creation for sentiment analysis in code-mixed Tamil-English text 

      Chakravarthi, Bharathi Raja; Muralidaran, Vigneshwaran; Priyadharshini, Ruba; McCrae, John P. (European Language Resources Association (ELRA), 2020-05-11)
      Understanding the sentiment of a comment from a video or an image is an essential task in many applications. Sentiment analysis of a text can be useful for various decision-making processes. One such application is to ...
    • A dataset for troll classification of Tamil memes 

      Chakravarthi, Bharathi Raja; Varma, Pranav; Arcan, Mihael; McCrae, John P.; Buitelaar, Paul; Shardul, Suryawanshi (European Language Resources Association (ELRA), 2020-05-11)
      Social media are interactive platforms that facilitate the creation or sharing of information, ideas or other forms of expression among people. This exchange is not free from offensive, trolling or malicious contents ...
    • Improving wordnets for under-resourced languages using machine translation 

      Chakravarthi, Bharathi Raja; Arcan, Mihael; McCrae, John P. (Global Wordnet Association, 2018-01-08)
      Wordnets are extensively used in natural language processing, but the current approaches for manually building a wordnet from scratch involves large research groups for a long period of time, which are typically not ...
    • Multilingual multimodal machine translation for Dravidian languages utilizing phonetic transcription 

      Chakravarthi, Bharathi Raja; Priyadharshini, Ruba; Stearns, Bernardo; Jayapal, Arun; Sridevy, S.; Arcan, Mihael; Zarrouk, Manel; McCrae, John P. (European Association for Machine Translation, 2019-08-19)
      Multimodal machine translation is the task of translating from a source text into the target language using information from other modalities. Existing multimodal datasets have been restricted to only highly resourced ...
    • Multimodal meme dataset (MultiOFF) for identifying offensive content in image and text 

      Suryawanshi, Shardul; Chakravarthi, Bharathi Raja; Arcan, Mihael; Buitelaar, Paul (European Language Resources Association (ELRA), 2020-05-11)
      A meme is a form of media that spreads an idea or emotion across the internet. As posting meme has become a new form of communication of the web, due to the multimodal nature of memes, postings of hateful memes or related ...
    • NUIG-Panlingua-KMI Hindi-Marathi MT Systems for Similar Language Translation Task @ WMT 2020 

      Ojha, Atul Kr.; Rani, Priya; Bansal, Akanksha; Chakravarthi, Bharathi Raja; Kumar, Ritesh; McCrae, John P. (Association for Computational Linguistics, 2020-11-19)
      NUIG-Panlingua-KMI submission to WMT 2020 seeks to push the state-of-the-art in the Similar language translation task for the Hindi ↔ Marathi language pair. As part of these efforts, we conducted a series of experiments to ...
    • A sentiment analysis dataset for code-mixed Malayalam-English 

      Chakravarthi, Bharathi Raja; Jose, Navya; Suryawanshi, Shardul; Sherly, Elizabeth; McCrae, John P. (European Language Resources Association (ELRA), 2020-05-11)
      There is an increasing demand for sentiment analysis of text from social media which are mostly code-mixed. Systems trained on monolingual data fail for code-mixed data due to the complexity of mixing at different levels ...
    • A survey of current datasets for code-switching research 

      Jose, Navya; Chakravarthi, Bharathi Raja; Suryawanshi, Shardul; Sherly, Elizabeth; McCrae, John P. (IEEE, 2020-03-06)
      Code switching is a prevalent phenomenon in the multilingual community and social media interaction. In the past ten years, we have witnessed an explosion of code switched data in the social media that brings together ...
    • ULD@NUIG at SemEval-2020 Task 9: Generative morphemes with an attention model for sentiment analysis in code-mixed text 

      Goswami, Koustava; Rani, Priya; Chakravarthi, Bharathi Raja; Fransen, Theodorus; McCrae, John P. (International Committee for Computational Linguistics, 2020)
      Code mixing is a common phenomena in multilingual societies where people switch from one language to another for various reasons. Recent advances in public communication over different social media sites have led to an ...
    • Unsupervised deep language and dialect identification for short texts 

      Goswami, Koustava; Sarkar, Rajdeep; Chakravarthi, Bharathi Raja; Fransen, Theodorus; McCrae, John P. (International Committee on Computational Linguistics, 2020-12)
      Automatic Language Identification (LI) or Dialect Identification (DI) of short texts of closely related languages or dialects, is one of the primary steps in many natural language processing pipelines. Language identification ...
    • Unsupervised method to analyze playing styles of EPL teams using ball possession-position data 

      Verma, Pranav; Sudharsan, Bharath; Chakravarthi, Bharathi Raja; O'Riordan, Colm; Hill, Seamus (IEEE, 2020-03-06)
      In the English Premier League (EPL) matches, a network of advanced systems gathers sports data in real-time to build a possession-position dataset. In this work, data fields from the sophisticated raw possession-position ...
    • WordNet gloss translation for under-resourced languages using multilingual neural machine translation 

      Chakravarthi, Bharathi Raja; Arcan, Mihael; McCrae, John P. (European Association for Machine Translation, 2019-08-19)
      In this paper, we translate the glosses in the English WordNet based on the expand approach for improving and generating wordnets with the help of multilingual neural machine translation. Neural Machine Translation (NMT) ...