dc.contributor.author | Wood, Ian D. | |
dc.date.accessioned | 2016-02-01T10:12:53Z | |
dc.date.available | 2016-02-01T10:12:53Z | |
dc.date.issued | 2015-10 | |
dc.identifier.citation | Wood, Ian D. (2015, October 19 - 23, 2015). Proceedings of the 2015 Workshop on Topic Models: Post-Processing and Applications. Paper presented at the CIKM'15 24th ACM International Conference on Information and Knowledge Management, Melbourne, VIC, Australia. | en_IE |
dc.identifier.isbn | 978-1-4503-3784-7 | |
dc.identifier.uri | http://hdl.handle.net/10379/5514 | |
dc.description.abstract | When studying large social media data sets, it is useful to reduce the dimensionality of both the network (e.g. by finding communities) and user-generated data such as text (e.g. using topic models). Algorithms exist for both these tasks, however their combination has received little attention and proposed models to date are not scalable (e.g.: [4]). One approach to such combined modelling is to perform community and topic modelling independently and later combine the results. In the case of overlapping communities, this combination requires a method for attributing each users topic usage to the communities in which she participates. This paper presents a Bayesian model for attributing individual documents to communities which balances the users proportional community membership with community topic coherence. Community topic usage is modelled with a Dirichlet distribution with fixed concentration parameter, leading to a well defined conjugate prior. Thought the prior is computationally expensive, the already reduced dimensionality in both topics and communities make a tractable algorithm feasible, even for large data sets. The model is applied to a corpus of tweets and twitter follower relations collected on hash tags used by people with eating disorders [14]. | en_IE |
dc.description.sponsorship | Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289 (INSIGHT), European Union supported projects LIDER (ICT-2013.4.1-610782), MixedEmotions (H2020-644632). | en_IE |
dc.format | application/pdf | en_IE |
dc.language.iso | en | en_IE |
dc.publisher | ACM | en_IE |
dc.relation.ispartof | 2015 Workshop on Topic Models: Post-Processing and Applications (CIKM) | en |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Ireland | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/3.0/ie/ | |
dc.subject | Topic models | en_IE |
dc.subject | Community detection | en_IE |
dc.subject | Bayesian inference | en_IE |
dc.subject | Conjugate prior | en_IE |
dc.subject | Dirichlet distribution | en_IE |
dc.subject | Author community membership | en_IE |
dc.title | Community topic usage in social networks | en_IE |
dc.type | Conference Paper | en_IE |
dc.date.updated | 2016-01-08T12:39:29Z | |
dc.identifier.doi | 10.1145/2809936.2809937 | |
dc.local.publishedsource | http://dl.acm.org/citation.cfm?id=2809937 | en_IE |
dc.description.peer-reviewed | peer-reviewed | |
dc.internal.rssid | 10020525 | |
dc.local.contact | Ian Wood. Email: ian.wood@nuigalway.ie | |
dc.local.copyrightchecked | No | |
dc.local.version | ACCEPTED | |
nui.item.downloads | 390 | |