help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH

First published September 23, 2002 as JAMIA PrePrint; doi:10.1197/jamia.M1133
Journal of the American Medical Informatics Association 2003;10(1):21-38
© 2003 American Medical Informatics Association


A more recent version of this article appeared on January 1, 2003
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M1133v1
10/1/21    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Fisk, J. M.
Right arrow Articles by Nadkarni, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Fisk, J. M.
Right arrow Articles by Nadkarni, P.

Submitted on April 8, 2002
Accepted on August 5, 2002

Integrating Query of Relational and Textual Data in Clinical Databases: a Case Study

John M. Fisk MD1*, Pradeep Mutalik MD2, Forrest W. Levin MS3, Joseph Erdos MD, PhD4, Caroline Taylor MD5, and Prakash Nadkarni MD2

Affiliation of the authors: 1 Center of Medical Informatics, Yale University School of Medicine, New Haven, CT; 2 Center for Medical Informatics, Yale University School of Medicine, New Haven, CT; 3 Information Technology Office, Veterans Administration Medical Center, West Haven, CT; 4 Information Technology Office, Veterans Administraton Medical Center, West Haven, CT; 5 Department of Radiology, Veterans Administration Medical Center, West Haven, CT

* To whom correspondence should be addressed.

Objective The authors designed and implemented a clinical data mart composed of an integrated information retrieval (IR) and relational database management system (RDBMS).

Design Using commodity software, which supports interactive, attribute-centric text and relational searches, the mart houses 2.8 million documents that span a five-year period, and supports basic IR features such as Boolean searches, stemming, and proximity and fuzzy searching.

Measurements Results are relevance-ranked using either "total documents per patient" or "report type weighting".

Results Non-curated medical text has a significant degree of malformation with respect to spelling and punctuation, which creates difficulties for text indexing and searching. Presently, the IR facilities of RDBMS packages lack the features necessary to adequately handle such malformed text.

Conclusion A robust IR+RDBMS system can be developed, but it requires integrating RDBMSs with third party IR software: RDBMS vendors need to make their IR offerings more accessible to non-programmers.




This article has been cited by other articles:


Home page
J. Am. Med. Inform. Assoc.Home page
Q. T. Zeng, J. Crowell, R. M. Plovnick, E. Kim, L. Ngo, and E. Dibble
Assisting Consumer Health Information Retrieval with Query Recommendations
J. Am. Med. Inform. Assoc., January 1, 2006; 13(1): 80 - 90.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Copyright © 1994 by the American Medical Informatics Association.