The Role of Community-Driven Data Curation for Enterprises

View/ Open
Date
2010Author
Curry, Edward
Freitas, Andre
O'Ríain, Seán
Metadata
Show full item recordUsage
This item's downloads: 6346 (view details)
Recommended Citation
Curry, Edward and Freitas, Andre and O'Ri\'ain, Sean (2010) 'The Role of Community-Driven Data Curation for Enterprises' In: Wood, David(Eds.). Linking Enterprise Data. New York : Springer US.
Published Version
Abstract
With increased utilization of data within their operational
and strategic processes, enterprises need to ensure data quality and accuracy.
Data curation is a process that can ensure the quality of data and its fitness
for use. Traditional approaches to curation are struggling with increased data
volumes, and near real-time demands for curated data. In response, curation
teams have turned to community crowd-sourcing and semi-automated metadata tools
for assistance. This chapter provides an overview of data curation, discusses
the business motivations for curating data and investigates the role of
community-based data curation, focusing on internal communities and
pre-competitive data collaborations. The chapter is supported by case studies
from Wikipedia, The New York Times, Thomson Reuters, Protein Data Bank and
ChemSpider upon which best practices for both social and technical aspects of
community-driven data curation are described.
Description
Book chapter