help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Stein, H. D.
Right arrow Articles by Miller, P. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Stein, H. D.
Right arrow Articles by Miller, P. L.
J Am Med Inform Assoc. 2000;7:42-54. DOI .
© 2000 American Medical Informatics Association


Research Paper

Exploring the Degree of Concordance of Coded and Textual Data in Answering Clinical Queries from a Clinical Data Repository

H. David Stein, MD, Prakash Nadkarni, MD, Joseph Erdos, MD, PhD and Perry L. Miller, MD, PhD

Yale University School of Medicine, New Haven, Connecticut.

Corresdpondence and reprints: H. David Stein, MD, Yale University School of Medicine, Center for Medical Informatics, 333 Decar Street, P.O. Box 208009, New Haven, CT 06520-8009; e-mail: hdavid.stein{at}yale.edu.

Received for publication: 03/31/99; accepted for publication: 08/16/99.

Objective: To query a clinical data repository (CDR) for answers to clinical questions to determine whether different types of fields (coded and free text) would yield confirmatory, complementary, or conflicting information and to discuss the issues involved in producing the discrepancies between the fields.

Methods: The appropriate data fields in a subset of a CDR (5,135 patient records) were searched for the answers to three questions related to surgical procedures. Each search included at least one coded data field and at least one free-text field. The identified free-text records were then searched manually to ensure correct interpretation. The fields were then compared to determine whether they agreed with each other, were supportive of each other, contained no entry (absence of data), or were contradictory.

Results: The degree of concordance varied greatly according to the field and the question asked. Some fields were not granular enough to answer the question. The free-text fields often gave an answer that was not definitive. Absence of data was most logically interpreted in some cases as lack of completion of data and in others as a negative answer. Even with a question as specific as which side a hernia was on, contradictory data were found in 5 to 8 percent of the records.

Conclusions: Using the data in the CDR to answer clinical questions can yield significantly disparate results depending on the question and which data fields are searched. A database cannot just be queried in automated fashion and the results reported. Both coded and textual fields must be searched to obtain the fullest assessment. This can be expected to result in information that may be confirmatory, complementary, or conflicting. To yield the most accurate information possible, final answers to questions require human judgment and may require the gathering of additional information.




This article has been cited by other articles:


Home page
J. Am. Med. Inform. Assoc.Home page
S. B. Johnson, S. Bakken, D. Dine, S. Hyun, E. Mendonca, F. Morrison, T. Bright, T. Van Vleck, J. Wrenn, and P. Stetson
An Electronic Health Record Based on Structured Narrative
J. Am. Med. Inform. Assoc., January 1, 2008; 15(1): 54 - 64.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
J. A. Linder, D. W. Bates, D. H. Williams, M. A. Connolly, and B. Middleton
Acute Infections in Primary Care: Accuracy of Electronic Diagnoses and Electronic Antibiotic Prescribing
J. Am. Med. Inform. Assoc., January 1, 2006; 13(1): 61 - 66.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
D. G. T. Arts, N. F. de Keizer, and G.-J. Scheffer
Defining and Improving Data Quality in Medical Registries: A Literature Review, Case Study, and Generic Framework
J. Am. Med. Inform. Assoc., November 1, 2002; 9(6): 600 - 611.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
C. Lovis and R. H. Baud
Fast Exact String Pattern-matching Algorithms Adapted to the Characteristics of the Medical Language
J. Am. Med. Inform. Assoc., July 1, 2000; 7(4): 378 - 391.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
BRENNAN and STEAD
Assessing Data Quality: From Concordance, through Correctness and Completeness, to ValidManipulatable Representations
J. Am. Med. Inform. Assoc., January 1, 2000; 7(1): 106 - 107.
[Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2000 by the American Medical Informatics Association.