Semantic web data warehousing for caGrid.

Authors: McCusker JP; Phillips JA; González Beltrán A; Finkelstein A; Krauthammer M

Abstract: The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annotated, it is currently up to the caGrid client to infer relationships between the different models and their classes. In this paper, we present a Semantic Web-based data warehouse (Corvus) for creating relationships among caGrid models. This is accomplished through the transformation of semantically-annotated caBIG Unified Modeling Language (UML) information models into Web Ontology Language (OWL) ontologies that preserve those semantics. We demonstrate the validity of the approach by Semantic Extraction, Transformation and Loading (SETL) of data from two caGrid data sources, caTissue and caArray, as well as alignment and query of those sources in Corvus. We argue that semantic integration is necessary for integration of data from distributed web services and that Corvus is a useful way of accomplishing this. Our approach is generalizable and of broad utility to researchers facing similar integration challenges.

Keywords: Computational Biology/*methods; Databases, Protein; Humans; Information Storage and Retrieval/*methods; *Internet; Neoplasm Proteins/chemistry; Neoplasms/*metabolism; Semantics; Tumor Markers, Biological/chemistry; User-Computer Interface
Journal: BMC bioinformatics
Volume: 10 Suppl 10
Pages: S2
Date: Oct. 7, 2009
PMID: 19796399
Select reference article to upload


Categories: Ontology, Cancer
Citation:

McCusker JP, Phillips JA, González Beltrán A, Finkelstein A, Krauthammer M (2009) Semantic web data warehousing for caGrid. BMC bioinformatics 10 Suppl 10: S2.



Update (Admin) | Auto-Update

Comment on This Data Unit