Browsing Data Science Institute (Workshop Papers) by Issue Date
Now showing items 61-80 of 118
-
Fast and Scalable Pattern Mining for Media-Type Focused Crawling
(2009)Search engines targeting content other than hypertext documents require a crawler that discovers resources identifying files of certain media types. Naive crawling approaches do not guarantee a sufficient supply of new ... -
Linked Open Data in Sensor Data Mashups
(CEUR, 2009)Sensors and the real-time data they produce are novel sources of information which need to be integrated into the Semantic Web at very large scale. Most of the time such data is locked inside specific applications and only ... -
Towards Cross-Community Effects in Scientific Communities
(2009)Community effects on the behaviour of individuals, the community itself and other communities can be observed in a wide range of applications. This is true in scientific research, where communities of researchers have ... -
Towards a Practical Emergent Knowledge Exploitation
(AAAI Press, 2009) -
Describing Linked Datasets - On the Design and Usage of voiD, the 'Vocabulary of Interlinked Datasets'
(2009)In this paper we discuss the design and implementation of voiD, the \Vocabulary Of Interlinked Datasets", a vocabulary that allows to formally describe linked RDF datasets. We report on use cases for voiD, the current state ... -
Converging Web and Desktop Data with Konduit
(2009)In this paper we present Konduit, a desktop-based platform for visual scripting with RDF data. Based on the idea of the semantic desktop, non-technical users can create, manipulate and mash-up RDF data with Konduit, and ... -
DING! Dataset Ranking using Formal Descriptions
(2009)Considering that thousands if not millions of linked datasets will be published soon, we motivate in this paper the need for an efficient and effective way to rank interlinked datasets based on formal descriptions of their ... -
SIOC: Content Exchange and Semantic Interoperability Between Social Networks
(2009)This paper describes work performed during the last few years in the context of the SIOC project in order to model social data on the Web using Semantic Web technologies. We will give an overview of the SIOC model, ... -
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia Fragments
(2009)In this paper, we introduce interlinking multimedia (iM), a pragmatic way to apply the linked data principles to fragments of multimedia items. We report on use cases showing the need for retrieving and describing multimedia ... -
Visual Abstraction and Ordering in Faceted Browsing of Text Collections
(2010)While faceted navigation interfaces can assist users in exploring an information collection, there is yet little support for users in choosing a relevant item from the set of items returned from a filtering process. In ... -
Weaving the Pedantic Web
(CEUR, 2010)Over a decade after RDF has been published as a W3C recommendation, publishing open and machine-readable content on the Web has recently received a lot more attention, including from corporate and governmental bodies; ... -
Towards Dataset Dynamics: Change Frequency of Linked Open Data Sources
(CEUR, 2010)Datasets in the LOD cloud are far from being static in their nature and how they are exposed. As resources are added and new links are set, applications consuming the data should be able to deal with these changes. In ... -
Enabling case-based reasoning on the web of data
(2010-07-20)While Case-based reasoning (CBR) has successfully been deployed on the Web, its data models are typically inconsistent with existing information infrastructure and standards. In this paper, we examine how CBR can operate ... -
Triplifying Wikipedia's tables
(CEUR-WS.org, 2013)We are currently investigating methods to triplify the content of Wikipedia's tables. We propose that existing knowledge-bases can be leveraged to semi-automatically extract high-quality facts (in the form of RDF triples) ... -
Learning content patterns from linked data
(CEUR-WS.org, 2014)Linked Data (LD) datasets (e.g., DBpedia, Freebase) are used in many knowledge extraction tasks due to the high variety of domains they cover. Unfortunately, many of these datasets do not provide a description for their ... -
µRaptor: A DOM-based system with appetite for hCard elements
(CEUR-WS.org, 2014)This paper describes µRaptor, a DOM-based method to extract hCard microformats from HTML pages stripped of microformat markup. µRaptor extracts DOM sub-trees, converts them into rules, and uses them to extract hCard ... -
Constructing Twitter Datasets using Signals for Event Detection Evaluation
(22nd International Conference on Case-Based Reasoning, 2014-09-29)Twitter is a very attractive real-time platform for research on event detection. However, despite the great amount of interest, datasets suitable for evaluating such methods are not easily available. The two most important ... -
A linked data-based decision tree classifier to review movies
(CEUR-WS.org, 2015)In this paper, we describe our contribution to the 2015 Linked Data Mining Challenge. The proposed task is concerned with the prediction of review of movies as good or bad , as does Metacritic website based on critics ...