Show simple item record

dc.contributor.advisorHayes, Conor
dc.contributor.authorTorres-Tramón, Pablo
dc.date.accessioned2020-07-30T07:32:25Z
dc.date.available2020-07-30T07:32:25Z
dc.date.issued2020-05-14
dc.identifier.urihttp://hdl.handle.net/10379/16109
dc.description.abstractThe assessment of semantic relatedness for a given pair of entities in a knowledge graph has become a critical step in a wide variety of artificial intelligence tasks, including but not restricted to fields such as machine learning, natural language processing, and information retrieval. Semantic relatedness is a generalisation of semantic similarity; entities are semantically assessed by virtue of their relationships in the knowledge graph rather than by their inherent similarity. Semantic relatedness measures have been widely addressed in the research literature, producing a wide range of relatedness functions. These functions require to enumerate paths between entities, using a simple and yet powerful intuition: the more the number of paths connecting the entities, the more related. However, extracting paths from knowledge graphs is computationally expensive. Since the number of paths increases exponentially with the number of edges, a denser graph affects the tractability of the assessment. This issue becomes critical in online services where potential computational bottlenecks can be a point of failure and delays. In this thesis, we introduce an approach to semantic relatedness based on diffusion processes over knowledge graphs. We argue that diffusion processes can replace paths as the source of semantics without affecting the performance of these measures in real-world applications. We formalise this form of relatedness, and we compare them against their path-based cousins. We also study the methods to compute diffusion in large knowledge graphs. Our findings show that diffusion-based models behave similarly to path-based ones in terms of ranking of entity pairs and have a better computational performance. We evaluate the computational cost of our models and give recommendations to build real-world applications from them. To this end, we tested and evaluated our model in two relevant applications: entity retrieval in knowledge graphs and entity linking over streams of text. In the former, we introduced a two-stage retrieval model that combined a standard information retrieval model with a re-rank function based on our form of relatedness. In the latter, our model was able to overcome the computational bottleneck of linking entities in microblog posts using diffusion-based relatedness models and producing annotations in the text.en_IE
dc.publisherNUI Galway
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/ie/
dc.subjectSemantic Relatednessen_IE
dc.subjectDiffusion Processesen_IE
dc.subjectKnowledge Graphsen_IE
dc.subjectInformation Retrievalen_IE
dc.subjectData scienceen_IE
dc.subjectComputer Scienceen_IE
dc.subjectEngineering and Informaticsen_IE
dc.titleDiffusion-based models for semantic relatednessen_IE
dc.typeThesisen
dc.contributor.funderScience Foundation Irelanden_IE
dc.local.noteSemantic relatedness between entities is a critical step in Artificial Intelligence. Current approaches employ the connections between entities in knowledge graphs to determine relatedness. However, obtaining these connections can be very slow. We introduce a novel approach based on diffusion processes that can reduce computational performance significantly.en_IE
dc.local.finalYesen_IE
dcterms.projectinfo:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/en_IE
nui.item.downloads281


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland