Browsing Data Science Institute (Workshop Papers) by Issue Date

Fast and Scalable Pattern Mining for Media-Type Focused Crawling

Umbrich, Jürgen; Karnstedt, Marcel; Harth, Andreas (2009)

Search engines targeting content other than hypertext documents require a crawler that discovers resources identifying files of certain media types. Naive crawling approaches do not guarantee a sufficient supply of new ...

Linked Open Data in Sensor Data Mashups

Phuoc, Danh Le; Hauswirth, Manfred (CEUR, 2009)

Sensors and the real-time data they produce are novel sources of information which need to be integrated into the Semantic Web at very large scale. Most of the time such data is locked inside specific applications and only ...

Towards Cross-Community Effects in Scientific Communities

Karnstedt, Marcel; Hayes, Conor (2009)

Community effects on the behaviour of individuals, the community itself and other communities can be observed in a wide range of applications. This is true in scientific research, where communities of researchers have ...

Towards a Practical Emergent Knowledge Exploitation

Nováček, Vít (AAAI Press, 2009)

Describing Linked Datasets - On the Design and Usage of voiD, the 'Vocabulary of Interlinked Datasets'

Cyganiak, Richard; Hausenblas, Michael (2009)

In this paper we discuss the design and implementation of voiD, the \Vocabulary Of Interlinked Datasets", a vocabulary that allows to formally describe linked RDF datasets. We report on use cases for voiD, the current state ...

Converging Web and Desktop Data with Konduit

Dragan, Laura; Möller, Knud; Handschuh, Siegfried; Ambrus, Oszkar (2009)

In this paper we present Konduit, a desktop-based platform for visual scripting with RDF data. Based on the idea of the semantic desktop, non-technical users can create, manipulate and mash-up RDF data with Konduit, and ...

DING! Dataset Ranking using Formal Descriptions

Toupikov, Nickolai; Umbrich, Jürgen; Delbru, Renaud; Hausenblas, Michael; Tummarello, Giovanni (2009)

Considering that thousands if not millions of linked datasets will be published soon, we motivate in this paper the need for an efficient and effective way to rank interlinked datasets based on formal descriptions of their ...

Integrating Social Networks and Sensor Networks

Breslin, John; Decker, Stefan; Hauswirth, Manfred; Hynes, Gearoid; Phuoc, Danh Le; Passant, Alexandre; Polleres, Axel; Rabsch, Cornelius; Reynolds, Vinny (2009)

SIOC: Content Exchange and Semantic Interoperability Between Social Networks

Breslin, John; Bojars, Uldis; Passant, Alexandre; Fernández, Sergio; Decker, Stefan (2009)

This paper describes work performed during the last few years in the context of the SIOC project in order to model social data on the Web using Semantic Web technologies. We will give an overview of the SIOC model, ...

Enabling Trust and Privacy on the Social Web

Passant, Alexandre; Hausenblas, Michael; Polleres, Axel; Decker, Stefan (2009)

Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia Fragments

Hausenblas, Michael (2009)

In this paper, we introduce interlinking multimedia (iM), a pragmatic way to apply the linked data principles to fragments of multimedia items. We report on use cases showing the need for retrieving and describing multimedia ...

Visual Abstraction and Ordering in Faceted Browsing of Text Collections

Thai, VinhTuan; Handschuh, Siegfried (2010)

While faceted navigation interfaces can assist users in exploring an information collection, there is yet little support for users in choosing a relevant item from the set of items returned from a filtering process. In ...

Weaving the Pedantic Web

Hogan, Aidan; Harth, Andreas; Passant, Alexandre; Decker, Stefan; Polleres, Axel (CEUR, 2010)

Over a decade after RDF has been published as a W3C recommendation, publishing open and machine-readable content on the Web has recently received a lot more attention, including from corporate and governmental bodies; ...

Towards Dataset Dynamics: Change Frequency of Linked Open Data Sources

Umbrich, Jürgen; Hausenblas, Michael; Hogan, Aidan; Polleres, Axel; Decker, Stefan (CEUR, 2010)

Datasets in the LOD cloud are far from being static in their nature and how they are exposed. As resources are added and new links are set, applications consuming the data should be able to deal with these changes. In ...

Enabling case-based reasoning on the web of data

Heitmann, Benjamin; Hayes, Conor (2010-07-20)

While Case-based reasoning (CBR) has successfully been deployed on the Web, its data models are typically inconsistent with existing information infrastructure and standards. In this paper, we examine how CBR can operate ...

Triplifying Wikipedia's tables

Muñoz, Emir; Hogan, Aidan; Mileo, Alessandra (CEUR-WS.org, 2013)

We are currently investigating methods to triplify the content of Wikipedia's tables. We propose that existing knowledge-bases can be leveraged to semi-automatically extract high-quality facts (in the form of RDF triples) ...

Learning content patterns from linked data

Muñoz, Emir (CEUR-WS.org, 2014)

Linked Data (LD) datasets (e.g., DBpedia, Freebase) are used in many knowledge extraction tasks due to the high variety of domains they cover. Unfortunately, many of these datasets do not provide a description for their ...

µRaptor: A DOM-based system with appetite for hCard elements

Muñoz, Emir; Costabello, Luca; Vandenbussche, Pierre-Yves (CEUR-WS.org, 2014)

This paper describes µRaptor, a DOM-based method to extract hCard microformats from HTML pages stripped of microformat markup. µRaptor extracts DOM sub-trees, converts them into rules, and uses them to extract hCard ...

Constructing Twitter Datasets using Signals for Event Detection Evaluation

Hromic, Hugo; Hayes, Conor (22nd International Conference on Case-Based Reasoning, 2014-09-29)

Twitter is a very attractive real-time platform for research on event detection. However, despite the great amount of interest, datasets suitable for evaluating such methods are not easily available. The two most important ...

A linked data-based decision tree classifier to review movies

Aldarra, Suad; Muñoz, Emir (CEUR-WS.org, 2015)

In this paper, we describe our contribution to the 2015 Linked Data Mining Challenge. The proposed task is concerned with the prediction of review of movies as good or bad , as does Metacritic website based on critics ...