Show simple item record

dc.contributor.authorRani, Priya
dc.contributor.authorSuryawanshi, Shardul
dc.contributor.authorGoswami, Koustava
dc.contributor.authorChakravarthi, Bharathi Raja
dc.contributor.authorFransen, Theodorus
dc.contributor.authorMcCrae, John P.
dc.identifier.citationRani, Priya, Suryawanshi, Shardul, Goswami, Koustava, Chakravarthi, Bharathi Raja, Fransen, Theodorus, & McCrae, John P. (2020). A comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed data. Paper presented at the Language Resources and Evaluation Conference (LREC 2020) Second Workshop on Trolling, Aggression and Cyberbullying, Marseille, France, 11-16 May.en_IE
dc.description.abstractHate speech detection in social media communication has become one of the primary concerns to avoid conflicts and curb undesired activities. In an environment where multilingual speakers switch among multiple languages, hate speech detection becomes a challenging task using methods that are designed for monolingual corpora. In our work, we attempt to analyze, detect and provide a comparative study of hate speech in a code-mixed social media text. We also provide a Hindi-English code-mixed data set consisting of Facebook and Twitter posts and comments. Our experiments show that deep learning models trained on this code-mixed corpus perform better.en_IE
dc.description.sponsorshipThis publication has emanated from research supported in part by a research grant from Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289 (Insight), SFI/12/RC/2289 P2 (Insight 2), & SFI/18/CRT/6223 (CRT-Centre for Research Training in Artficial Intelligence) co-funded by the European Regional Development Fund as well as by the EU H2020 programme under grant agreements 731015 (ELEXIS-European Lexical Infrastructure), 825182 (Pret- ˆ a-LLOD), and Irish Research Council ` grant IRCLA/2017/129 (CARDAMOM-Comparative Deep Models of Language for Minority and Historical Languages). The authors are grateful to Ajay Bohra and his team for sharing their data set and for their support. We would also like to thank our annotators for their contribution and lending us their precious time.en_IE
dc.publisherEuropean Language Resources Association (ELRA)en_IE
dc.relation.ispartofProceedings of the Second Workshop on Trolling, Aggression and Cyberbullyingen
dc.subjectHate Speechen_IE
dc.subjectCode mixingen_IE
dc.subjectConvolutional Neural Networksen_IE
dc.titleA comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed dataen_IE
dc.typeWorkshop paperen_IE
dc.contributor.funderScience Foundation Irelanden_IE
dc.contributor.funderEuropean Regional Development Funden_IE
dc.contributor.funderHorizon 2020en_IE
dc.contributor.funderIrish Research Councilen_IE
dc.local.contactShardul Suryawanshi, Ida Business Park, Lower Dangan, Galway. Email:
dcterms.projectinfo:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/en_IE
dcterms.projectinfo:eu-repo/grantAgreement/EC/H2020::RIA/731015/EU/European Lexicographic Infrastructure/ELEXISen_IE
dcterms.projectinfo:eu-repo/grantAgreement/EC/H2020::RIA/825182/EU/Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors/Pret-a-LLODen_IE

Files in this item

Attribution-NonCommercial-NoDerivs 3.0 Ireland
This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. Please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.

The following license files are associated with this item:


This item appears in the following Collection(s)

Show simple item record