help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

First published August 21, 2007 as JAMIA PrePrint; doi:10.1197/jamia.M2130
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M2130v1
14/6/736    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by McCowan, I. A.
Right arrow Articles by Fry, M.-J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by McCowan, I. A.
Right arrow Articles by Fry, M.-J.
J Am Med Inform Assoc. 2007;14:736-745. DOI 10.1197/jamia.M2130.
© 2007 American Medical Informatics Association


Application of Information Technology

Collection of Cancer Stage Data by Classifying Free-text Medical Reports

Iain A. McCowan, PhDa,*, Darren C. Moore, MEnga, Anthony N. Nguyen, PhDa, Rayleen V. Bowman, PhDb, Belinda E. Clarke, PhDc, Edwina E. Duhigc and Mary-Jane Fryd

a CSIRO e-Health Research Centre, Brisbane, Australia
b Department of Medicine, University of Queensland, Brisbane, Australia
c Department of Anatomical Pathology, The Prince Charles Hospital, Brisbane, Australia
d Queensland Cancer Control Analysis Team, Queensland Health, Brisbane, Australia

* Correspondence: Iain McCowan, PhD, CSIRO e-Health Research Centre, PO Box 10842 Adelaide Street, Brisbane QLD 4000, Australia (Email: iain.mccowan{at}csiro.au).

Received for publication: 04/19/06; accepted for publication: 08/02/07.

Cancer staging provides a basis for planning clinical management, but also allows for meaningful analysis of cancer outcomes and evaluation of cancer care services. Despite this, stage data in cancer registries is often incomplete, inaccurate, or simply not collected. This article describes a prototype software system (Cancer Stage Interpretation System, CSIS) that automatically extracts cancer staging information from medical reports. The system uses text classification techniques to train support vector machines (SVMs) to extract elements of stage listed in cancer staging guidelines. When processing new reports, CSIS identifies sentences relevant to the staging decision, and subsequently assigns the most likely stage. The system was developed using a database of staging data and pathology reports for 710 lung cancer patients, then validated in an independent set of 179 patients against pathologic stage assigned by two independent pathologists. CSIS achieved overall accuracy of 74% for tumor (T) staging and 87% for node (N) staging, and errors were observed to mirror disagreements between human experts.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2007 by the American Medical Informatics Association.