help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH

First published April 24, 2008 as JAMIA PrePrint; doi:10.1197/jamia.M2732
Journal of the American Medical Informatics Association 2008;15(4):559-568
© 2008 American Medical Informatics Association


A more recent version of this article appeared on July 1, 2008
This Article
Right arrow Full Text (PDF)
Right arrow Data Supplement
Right arrow All Versions of this Article:
M2732v1
15/4/559    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shironoshita, E. P.
Right arrow Articles by Kabuka, M. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shironoshita, E. P.
Right arrow Articles by Kabuka, M. R.

Submitted on January 28, 2008
Accepted on April 16, 2008

semCDI: A Query Formulation for Semantic Data Integration in caBIG

E. Patrick Shironoshita MS1, Yves R. Jean-Mary MS1, Ray M. Bradley1, and Mansur R. Kabuka PhD2*

Affiliation of the authors: 1 INFOTECH Soft, Inc., Miami, FL; 2 INFOTECH Soft, Inc., Miami, FL; University of Miami, Coral Gables, FL

* To whom correspondence should be addressed.

Objective

To develop mechanisms to formulate queries over the semantic representation of cancer-related data services available through the cancer Biomedical Informatics Grid (caBIG).

Design The semCDI query formulation uses a view of caBIG semantic concepts, metadata, and data as an ontology, and defines a methodology to specify queries using the SPARQL query language, extended with Horn rules. semCDI enables the joining of data that represent different concepts through associations modeled as object properties, and the merging of data representing the same concept in different sources through Common Data Elements (CDE) modeled as datatype properties, using Horn rules to specify additional semantics indicating conditions for merging data.

Validation In order to validate this formulation, a prototype has been constructed, and two queries have been executed against currently available caBIG data services.

Discussion The semCDI query formulation uses the rich semantic metadata available in caBIG to build queries and integrate data from multiple sources. Its promise will be further enhanced as more data services are registered in caBIG, and as more linkages can be achieved between the knowledge contained within caBIG's NCI Thesaurus and the data contained in the Data Services. The intellectual property rights for the semCDI query formulation presented in this paper are held by INFOTECHSoft, Inc.

Conclusion semCDI provides a formulation for the creation of queries on the semantic representation of caBIG. This constitutes the foundation to build a semantic data integration system for more efficient and effective querying and exploratory searching of cancer-related data.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Copyright © 1994 by the American Medical Informatics Association.