Show simple item record

dc.contributor.authorSuryawanshi, Shardul
dc.contributor.authorChakravarthi, Bharathi Raja
dc.contributor.authorArcan, Mihael
dc.contributor.authorBuitelaar, Paul
dc.identifier.citationSuryawanshi, Shardul, Chakravarthi, Bharathi Raja, Arcan, Mihael, & Buitelaar, Paul. (2020). Multimodal meme dataset (MultiOFF) for identifying offensive content in image and text. Paper presented at the Language Resources and Evaluation Conference (LREC 2020) Second Workshop on Trolling, Aggression and Cyberbullying, Marseille, France, 11-16 May.en_IE
dc.description.abstractA meme is a form of media that spreads an idea or emotion across the internet. As posting meme has become a new form of communication of the web, due to the multimodal nature of memes, postings of hateful memes or related events like trolling, cyberbullying are increasing day by day. Hate speech, offensive content and aggression content detection have been extensively explored in a single modality such as text or image. However, combining two modalities to detect offensive content is still a developing area. Memes make it even more challenging since they express humour and sarcasm in an implicit way, because of which the meme may not be offensive if we only consider the text or the image. Therefore, it is necessary to combine both modalities to identify whether a given meme is offensive or not. Since there was no publicly available dataset for multimodal offensive meme content detection, we leveraged the memes related to the 2016 U.S. presidential election and created the MultiOFF multimodal meme dataset for offensive content detection dataset. We subsequently developed a classifier for this task using the MultiOFF dataset. We use an early fusion technique to combine the image and text modality and compare it with a text- and an image-only baseline to investigate its effectiveness. Our results show improvements in terms of Precision, Recall, and F-Score. The code and dataset for this paper is published in Multimodal-Meme-Classification-Identifying-Offensive-Content-in-Image-and-Texten_IE
dc.description.sponsorshipThis publication has emanated from research supported in part by a research grant from Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289 P2, co-funded by the European Regional Development Fund, as well as by the H2020 project Pret- ˆ a-LLOD under Grant Agreement ` number 825182.en_IE
dc.publisherEuropean Language Resources Association (ELRA)en_IE
dc.relation.ispartofProceedings of the Second Workshop on Trolling, Aggression and Cyberbullyingen
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.subjectmultimodal dataen_IE
dc.subjectoffensive contenten_IE
dc.subjectopinion miningen_IE
dc.titleMultimodal meme dataset (MultiOFF) for identifying offensive content in image and texten_IE
dc.typeWorkshop paperen_IE
dc.contributor.funderScience Foundation Irelanden_IE
dc.contributor.funderEuropean Regional Development Funden_IE
dc.contributor.funderHorizon 2020en_IE
dc.local.contactShardul Suryawanshi, Ida Business Park, Lower Dangan, Galway. Email:
dcterms.projectinfo:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/en_IE
dcterms.projectinfo:eu-repo/grantAgreement/EC/H2020::RIA/825182/EU/Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors/Pret-a-LLODen_IE

Files in this item


This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland