help button home button JAMIA Bigger figures
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

First published March 28, 2003 as JAMIA PrePrint; doi:10.1197/jamia.M1157
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M1157v1
10/4/330    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wilcox, A. B.
Right arrow Articles by Hripcsak, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wilcox, A. B.
Right arrow Articles by Hripcsak, G.
Journal of the American Medical Informatics Association 10:330-338 (2003)
© 2003 American Medical Informatics Association


Research Paper

The Role of Domain Knowledge in Automating Medical Text Report Classification

Adam B. Wilcox, PhD and George Hripcsak, MD, MS

Affiliations: of the authors: Department of Medical Informatics, University of Utah, Salt Lake City, Utah (ABW); Medical Informatics, Intermountain Health Care, Salt Lake City, Utah (ABW); Department of Medical Informatics, Columbia University, New York, New York (GH), USA

Correspondence and reprints: Adam B. Wilcox, PhD, Medical Informatics, Intermountain Health Care, 4646 West Lake Park Blvd., Salt Lake City, UT 84120; e-mail: <lpawilco{at}ihc.com>

Received for publication: 05/14/02; accepted for publication: 03/03/03.

Objective: To analyze the effect of expert knowledge on the inductive learning process in creating classifiers for medical text reports.

Design: The authors converted medical text reports to a structured form through natural language processing. They then inductively created classifiers for medical text reports using varying degrees and types of expert knowledge and different inductive learning algorithms. The authors measured performance of the different classifiers as well as the costs to induce classifiers and acquire expert knowledge.

Measurements: The measurements used were classifier performance, training-set size efficiency, and classifier creation cost.

Results: Expert knowledge was shown to be the most significant factor affecting inductive learning performance, outweighing differences in learning algorithms. The use of expert knowledge can affect comparisons between learning algorithms. This expert knowledge may be obtained and represented separately as knowledge about the clinical task or about the data representation used. The benefit of the expert knowledge is more than that of inductive learning itself, with less cost to obtain.

Conclusion: For medical text report classification, expert knowledge acquisition is more significant to performance and more cost-effective to obtain than knowledge discovery. Building classifiers should therefore focus more on acquiring knowledge from experts than trying to learn this knowledge inductively.




This article has been cited by other articles:


Home page
J. Am. Med. Inform. Assoc.Home page
N. K. Mishra, D. M. Cummo, J. J. Arnzen, and J. Bonander
A Rule-based Approach for Identifying Obesity and Its Comorbidities in Medical Discharge Summaries
J. Am. Med. Inform. Assoc., July 1, 2009; 16(4): 576 - 579.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
S. V.S. Pakhomov, J. D. Buntrock, and C. G. Chute
Automating the Assignment of Diagnosis Codes to Patient Encounters Using Example-based and Machine Learning Techniques
J. Am. Med. Inform. Assoc., September 1, 2006; 13(5): 516 - 525.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
B. Hazlehurst, H. R. Frost, D. F. Sittig, and V. J. Stevens
MediClass: A System for Detecting and Classifying Encounter-based Clinical Events in Any Electronic Medical Record
J. Am. Med. Inform. Assoc., September 1, 2005; 12(5): 517 - 529.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2003 by the American Medical Informatics Association.