Show simple item record

dc.contributor.authorZappa, Achille
dc.contributor.authorSplendiani, Andrea
dc.contributor.authorRomano, Paolo
dc.date.accessioned2018-09-20T16:28:57Z
dc.date.available2018-09-20T16:28:57Z
dc.date.issued2012-01-01
dc.identifier.citationZappa, Achille; Splendiani, Andrea; Romano, Paolo (2012). Towards linked open gene mutations data. BMC Bioinformatics 13 ,
dc.identifier.issn1471-2105
dc.identifier.urihttp://hdl.handle.net/10379/14515
dc.description.abstractBackground: With the advent of high-throughput technologies, a great wealth of variation data is being produced. Such information may constitute the basis for correlation analyses between genotypes and phenotypes and, in the future, for personalized medicine. Several databases on gene variation exist, but this kind of information is still scarce in the Semantic Web framework. In this paper, we discuss issues related to the integration of mutation data in the Linked Open Data infrastructure, part of the Semantic Web framework. We present the development of a mapping from the IARC TP53 Mutation database to RDF and the implementation of servers publishing this data. Methods: A version of the IARC TP53 Mutation database implemented in a relational database was used as first test set. Automatic mappings to RDF were first created by using D2RQ and later manually refined by introducing concepts and properties from domain vocabularies and ontologies, as well as links to Linked Open Data implementations of various systems of biomedical interest. Since D2RQ query performances are lower than those that can be achieved by using an RDF archive, generated data was also loaded into a dedicated system based on tools from the Jena software suite. Results: We have implemented a D2RQ Server for TP53 mutation data, providing data on a subset of the IARC database, including gene variations, somatic mutations, and bibliographic references. The server allows to browse the RDF graph by using links both between classes and to external systems. An alternative interface offers improved performances for SPARQL queries. The resulting data can be explored by using any Semantic Web browser or application. Conclusions: This has been the first case of a mutation database exposed as Linked Data. A revised version of our prototype, including further concepts and IARC TP53 Mutation database data sets, is under development. The publication of variation information as Linked Data opens new perspectives: the exploitation of SPARQL searches on mutation data and other biological databases may support data retrieval which is presently not possible. Moreover, reasoning on integrated variation data may support discoveries towards personalized medicine.
dc.publisherSpringer Nature
dc.relation.ispartofBMC Bioinformatics
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/ie/
dc.subjectsemantic-web
dc.subjectnetwork
dc.subjectdisease
dc.subjectimpact
dc.subjectmodel
dc.titleTowards linked open gene mutations data
dc.typeArticle
dc.identifier.doi10.1186/1471-2105-13-s4-s7
dc.local.publishedsourcehttps://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/1471-2105-13-S4-S7?site=bmcbioinformatics.biomedcentral.com
nui.item.downloads0


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland