help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH

First published December 20, 2007 as JAMIA PrePrint; doi:10.1197/jamia.M2544
Journal of the American Medical Informatics Association 2008;15(2):150-157
© 2008 American Medical Informatics Association


A more recent version of this article appeared on March 1, 2008
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M2544v1
15/2/150    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Freifeld, C. C.
Right arrow Articles by Brownstein, J. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Freifeld, C. C.
Right arrow Articles by Brownstein, J. S.

Submitted on June 29, 2007
Accepted on November 29, 2007

HealthMap: Global infectious disease monitoring through automated classification and visualization of Internet media reports

Clark C. Freifeld1*, Kenneth D. Mandl2, Ben Y. Reis2, and John S. Brownstein2

Affiliation of the authors: 1 Children's Hospital Informatics Program at the Harvard-MIT Division of Health Sciences and Technology, Boston, MA ; 2 Children's Hospital Informatics Program at the Harvard-MIT Division of Health Sciences and Technology, Boston, MA; Division of Emergency Medicine, Children's Hospital Boston, Boston, MA; Department of Pediatrics, Harvard Medical School, Boston, MA

* To whom correspondence should be addressed.

Objective Unstructured electronic information sources, such as news reports, are proving to be valuable inputs for public health surveillance. However, staying abreast of current disease outbreaks requires scouring a continually growing number of disparate news sources and alert services, resulting in information overload. Our objective is to address this challenge through the HealthMap.org Web application, an automated system for querying, filtering, integrating and visualizing unstructured reports on disease outbreaks.

Design This report describes the design principles, software architecture and implementation of HealthMap and discusses key challenges and future plans.

Measurements We describe the process by which HealthMap collects and integrates outbreak data from a variety of sources, including news media (e.g., Google News), expert-curated accounts (e.g., ProMED Mail), and validated official alerts. Through the use of text processing algorithms, the system classifies alerts by location and disease and then overlays them on an interactive geographic map. We measure the accuracy of the classification algorithms based on the level of human curation necessary to correct misclassifications, and examine geographic coverage.

Results As part of the evaluation of the system, we analyzed 778 reports with HealthMap, representing 87 disease categories and 89 countries. The automated classifier performed with 84% accuracy, demonstrating significant usefulness in managing the large volume of information processed by the system. Accuracy for ProMED alerts is 91% compared to Google News reports at 81%, as ProMED messages follow a more regular structure.

Conclusion HealthMap is a useful free and open resource employing text-processing algorithms to identify important disease outbreak information through a user-friendly interface.




This article has been cited by other articles:


Home page
J. Am. Med. Inform. Assoc.Home page
C. G. Chute
Biosurveillance, classification, and semantic health technologies.
J. Am. Med. Inform. Assoc., March 1, 2008; 15(2): 172 - 173.
[Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Copyright © 1994 by the American Medical Informatics Association.