help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

First published December 20, 2007 as JAMIA PrePrint; doi:10.1197/jamia.M2544
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M2544v1
15/2/150    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Freifeld, C. C.
Right arrow Articles by Brownstein, J. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Freifeld, C. C.
Right arrow Articles by Brownstein, J. S.
J Am Med Inform Assoc. 2008;15:150-157. DOI 10.1197/jamia.M2544.
© 2008 American Medical Informatics Association


Model Formulation

HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports

Clark C. Freifelda,*, Kenneth D. Mandla,b,c, Ben Y. Reisa,b,c and John S. Brownsteina,b,c

a Children’s Hospital Informatics Program at the Harvard-MIT Division of Health Sciences and Technology, Boston, MA
b Division of Emergency Medicine, Children’s Hospital Boston, Boston, MA
c Department of Pediatrics, Harvard Medical School, Boston, MA.

* Correspondence: Clark C. Freifeld, Children’s Hospital Informatics Program at the Harvard-MIT Division of Health Sciences and Technology, 300 Longwood Ave., Boston, MA 02115 (Email: clark.freifeld{at}childrens.harvard.edu).

Received for publication: 06/29/07; accepted for publication: 11/29/07.

Objective: Unstructured electronic information sources, such as news reports, are proving to be valuable inputs for public health surveillance. However, staying abreast of current disease outbreaks requires scouring a continually growing number of disparate news sources and alert services, resulting in information overload. Our objective is to address this challenge through the HealthMap.org Web application, an automated system for querying, filtering, integrating and visualizing unstructured reports on disease outbreaks.

Design: This report describes the design principles, software architecture and implementation of HealthMap and discusses key challenges and future plans.

Measurements: We describe the process by which HealthMap collects and integrates outbreak data from a variety of sources, including news media (e.g., Google News), expert-curated accounts (e.g., ProMED Mail), and validated official alerts. Through the use of text processing algorithms, the system classifies alerts by location and disease and then overlays them on an interactive geographic map. We measure the accuracy of the classification algorithms based on the level of human curation necessary to correct misclassifications, and examine geographic coverage.

Results: As part of the evaluation of the system, we analyzed 778 reports with HealthMap, representing 87 disease categories and 89 countries. The automated classifier performed with 84% accuracy, demonstrating significant usefulness in managing the large volume of information processed by the system. Accuracy for ProMED alerts is 91% compared to Google News reports at 81%, as ProMED messages follow a more regular structure.

Conclusion: HealthMap is a useful free and open resource employing text-processing algorithms to identify important disease outbreak information through a user-friendly interface.




This article has been cited by other articles:


Home page
CMAJHome page
K. Wilson and J. S. Brownstein
Early detection of disease outbreaks using the Internet
Can. Med. Assoc. J., April 14, 2009; 180(8): 829 - 831.
[Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
C. G. Chute
Biosurveillance, classification, and semantic health technologies.
J. Am. Med. Inform. Assoc., March 1, 2008; 15(2): 172 - 173.
[Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2008 by the American Medical Informatics Association.