The role of negative results for choosing an evaluation approach - a recommender systems case study

Heitmann, Benjamin; Hayes, Conor

View/Open

NoISE2015_paper_4.pdf (316.0Kb)

Date

2015-06-01

Author

Heitmann, Benjamin

Hayes, Conor

Metadata

Show full item record

Usage

This item's downloads: 255 (view details)

Recommended Citation

Heitmann, Benjamin, & Hayes, Conor, (2015). The Role of Negative Results for Choosing an Evaluation Approach - A Recommender Systems Case Study Paper presented at the NoISE 2015 : Workshop on Negative or Inconclusive Results in Semantic Web, Portoroz, Slovenia, 01 June.

Published Version

http://ceur-ws.org/Vol-1435/

Abstract

We describe a case study, which shows how important negative results are in uncovering biased evaluation methodologies. Our re- search question is how to compare a recommender algorithm that uses an RDF graph to a recommendation algorithm that uses rating data. Our case study uses DBpedia 3.8 and the MovieLens 100k data set. We show that the most popular evaluation protocol in the recommender sys- tems literature is biased towards evaluating collaborative filtering (CF) algorithms, as it uses the rating prediction task. Based on the negative results of this first experiment, we find an alternative evaluation task, the top-k recommendation task. While this task is harder to perform, our positive results show that it is a much better fit, which is not biased to- wards either CF or our graph-based algorithm. The second set of results are statistically significant (Wilcoxon rank sum test, p

URI

http://hdl.handle.net/10379/6561

Collections

Data Science Institute (Workshop Papers)

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland