help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH

First published December 15, 2005 as JAMIA PrePrint; doi:10.1197/jamia.M1920
Journal of the American Medical Informatics Association 2006;13(2):160-165
© 2006 American Medical Informatics Association


A more recent version of this article appeared on March 1, 2006
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M1920v1
13/2/160    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Cassa, C. A.
Right arrow Articles by Mandl, K. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cassa, C. A.
Right arrow Articles by Mandl, K. D.

Submitted on July 27, 2005
Accepted on November 28, 2005

A Novel, Context-Sensitive Approach to Anonymizing Spatial Surveillance Data: Impact on Outbreak Detection

Christopher A. Cassa MEng1*, Shaun J. Grannis MD, MS2, J. Marc Overhage MD, PhD2, and Kenneth D. Mandl MD, MPH3

Affiliation of the authors: 1 Children's Hospital Informatics Program, Children's Hospital Boston, Boston, MA; Harvard-MIT Division of Health Sciences and Technology, Cambridge, MA; Clinical Decision Making Group, Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA; 2 Indiana University School of Medicine, Indianapolis, IN; The Regenstrief Institute, Inc., Indianapolis, IN; 3 Children's Hospital Informatics Program, Children's Hospital Boston, Boston, MA; Harvard-MIT Division of Health Sciences and Technology, Cambridge, MA; Harvard Medical School, Boston, MA

* To whom correspondence should be addressed.

Objective The use of spatially-based methods and algorithms in epidemiology and surveillance presents privacy challenges for researchers and public health agencies. We describe a novel method for anonymizing individuals in public health datasets, by transposing their spatial locations through a process informed by the underlying population density. Further, we measure the impact of the skew on detection of spatial clustering as measured by a spatial scanning statistic.

Design Cases were emergency department (ED) visits for respiratory illness. Baseline ED visit data were injected with artificially-created clusters ranging in magnitude, shape, and location. The geocoded locations were then transformed using a de-identification algorithm that accounts for the local underlying population density.

Measurements 12,600 separate weeks of case data with artificially created clusters were combined with control data and the impact on detection of spatial clustering identified by a spatial scan statistic was measured.

Results The anonymization algorithm produced an expected skew of cases which resulted in high values of dataset k-anonymity. De-identification that moves points an average distance of 0.25km lowers the spatial cluster detection sensitivity by less than 4%, and lowers the detection specificity less than 1%.

Conclusion A population-density based Gaussian spatial blurring markedly decreases the ability to identify individuals in a dataset while only slightly decreasing the performance of a standardly used outbreak detection tool. These findings suggest new approaches to anonymizing data spatial epidemiology and surveillance.




This article has been cited by other articles:


Home page
J. Am. Med. Inform. Assoc.Home page
S. Sengupta, N. S. Calman, and G. Hripcsak
A Model for Expanded Public Health Reporting in the Context of HIPAA
J. Am. Med. Inform. Assoc., September 1, 2008; 15(5): 569 - 574.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
B. Y. Reis, C. Kirby, L. E. Hadden, K. Olson, A. J. McMurry, J. B. Daniel, and K. D. Mandl
AEGIS: A Robust and Scalable Real-time Public Health Surveillance System
J. Am. Med. Inform. Assoc., September 1, 2007; 14(5): 581 - 588.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
A. J. McMurry, C. A. Gilbert, B. Y. Reis, H. C. Chueh, I. S. Kohane, and K. D. Mandl
A Self-scaling, Distributed Information Architecture for Public Health, Research, and Clinical Care
J. Am. Med. Inform. Assoc., July 1, 2007; 14(4): 527 - 533.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. C. Wieland, J. S. Brownstein, B. Berger, and K. D. Mandl
Density-equalizing Euclidean minimum spanning trees for the detection of all disease cluster shapes
PNAS, May 29, 2007; 104(22): 9404 - 9409.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Public HealthHome page
K. L. Olson, S. J. Grannis, and K. D. Mandl
Privacy Protection Versus Cluster Detection in Spatial Epidemiology
Am J Public Health, November 1, 2006; 96(11): 2002 - 2008.
[Abstract] [Full Text] [PDF]


Home page
NEJMHome page
J. S. Brownstein, C. A. Cassa, and K. D. Mandl
No Place to Hide -- Reverse Identification of Patients from Published Maps
N. Engl. J. Med., October 19, 2006; 355(16): 1741 - 1742.
[Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Copyright © 1994 by the American Medical Informatics Association.