help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

First published October 24, 2008 as JAMIA PrePrint; doi:10.1197/jamia.M2950
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M2950v1
16/1/109    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Uzuner, O.
Right arrow Articles by Sibanda, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Uzuner, O.
Right arrow Articles by Sibanda, T.
J Am Med Inform Assoc. 2009;16:109-115. DOI 10.1197/jamia.M2950.
© 2009 American Medical Informatics Association


Research Paper

Machine Learning and Rule-based Approaches to Assertion Classification

Özlem Uzuner, PhDa,b,c,*, Xiaoran Zhangb and Tawanda Sibanda, MEngb

a Information Studies, State University of New York, Albany, NY
b MIT CSAIL, Cambridge, MA
c Computer Engineering, Middle East Technical University, Northern Cyprus Campus, Kalkanli, Guzelyurt, Cyprus

* Correspondence: Özlem Uzuner Draper 114A, 135 Western Ave, Albany NY 12222 (Email: ouzuner{at}albany.edu).

Received for publication: 08/06/08; accepted for publication: 09/28/08.

Objectives: The authors study two approaches to assertion classification. One of these approaches, Extended NegEx (ENegEx), extends the rule-based NegEx algorithm to cover alter-association assertions; the other, Statistical Assertion Classifier (StAC), presents a machine learning solution to assertion classification.

Design: For each mention of each medical problem, both approaches determine whether the problem, as asserted by the context of that mention, is present, absent, or uncertain in the patient, or associated with someone other than the patient. The authors use these two systems to (1) extend negation and uncertainty extraction to recognition of alter-association assertions, (2) determine the contribution of lexical and syntactic context to assertion classification, and (3) test if a machine learning approach to assertion classification can be as generally applicable and useful as its rule-based counterparts.

Measurements: The authors evaluated assertion classification approaches with precision, recall, and F-measure.

Results: The ENegEx algorithm is a general algorithm that can be directly applied to new corpora. Despite being based on machine learning, StAC can also be applied out-of-the-box to new corpora and achieve similar generality.

Conclusion: The StAC models that are developed on discharge summaries can be successfully applied to radiology reports. These models benefit the most from words found in the ± 4 word window of the target and can outperform ENegEx.




This article has been cited by other articles:


Home page
J. Am. Med. Inform. Assoc.Home page
L. Ohno-Machado and R. A. Miller
Outstanding submissions to the AMIA annual symposium now featured in JAMIA.
J. Am. Med. Inform. Assoc., January 1, 2009; 16(1): 143 - 144.
[Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2009 by the American Medical Informatics Association.