| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The practice of informatics |
a Department of Computer Science, Yale University, New Haven, CT
b Center for Medical Informatics, Yale University, New Haven, CT
c Department of Anesthesiology, Yale University, New Haven, CT
d Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT
* Correspondence and reprints: John Corwin, Department of Computer Science, Yale University, P.O. Box 208285, New Haven, CT 06520-8285; Tel: (203) 432-1246; Fax: (203) 432-0593. (Email: john.corwin{at}yale.edu).
Received for publication: 06/27/06; accepted for publication: 10/05/06.
Data sparsity and schema evolution issues affecting clinical informatics and bioinformatics communities have led to the adoption of vertical or object-attribute–value-based database schemas to overcome limitations posed when using conventional relational database technology. This paper explores these issues and discusses why biomedical data are difficult to model using conventional relational techniques. The authors propose a solution to these obstacles based on a relational database engine using a sparse, column-store architecture. The authors provide benchmarks comparing the performance of queries and schema-modification operations using three different strategies: (1) the standard conventional relational design; (2) past approaches used by biomedical informatics researchers; and (3) their sparse, column-store architecture. The performance results show that their architecture is a promising technique for storing and processing many types of data that are not handled well by the other two semantic data models.
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |