help button home button JAMIA Hate scrolling?
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH

First published February 5, 2004 as JAMIA PrePrint; doi:10.1197/jamia.M1474
Journal of the American Medical Informatics Association 2004;11(3):179-185
© 2004 American Medical Informatics Association


A more recent version of this article appeared on May 1, 2004
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M1474v1
11/3/179    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Crowell, J.
Right arrow Articles by Lacroix, E.-M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Crowell, J.
Right arrow Articles by Lacroix, E.-M.

Submitted on October 28, 2003
Accepted on December 21, 2003

A Frequency-based Technique to Improve the Spelling Suggestion Rank in Medical Queries

Jonathan Crowell MS1*, Qing Zeng PhD1, Long Ngo PhD2, and Eve-Marie Lacroix MS3

Affiliation of the authors: 1 Decision Systems Group, Brigham & Women's Hospital, Harvard Medical School, Boston, MA; 2 Department of Biostatistics, Harvard School of Public Health, Boston, MA; 3 Public Services Division, National Library of Medicine, Bethesda, MD

* To whom correspondence should be addressed.

Objective There is an abundance of health-related information online, and millions of consumers search for such information. Spell checking is of crucial importance in returning pertinent results, so we propose a technique for increasing the effectiveness of spell-checking tools used for health-related information retrieval.

Design A sample of incorrectly-spelled medical terms was submitted to two different spell checking tool, and the resulting suggestions, derived under two different dictionary configurations, were re-sorted according to how frequently each term appeared in log data from a medical search engine.

Measurements Univariable analysis was carried out to assess the effect of each factor (spell checking tool, dictionary type, re-sort or no re-sort) on the probability of success. The factors that were statistically significant in the univariable analysis were then used in multivariable analysis to evaluate the independent effect of each of the factors.

Results The re-sorted suggestions proved to be significantly more accurate than the original list returned by the spell-checking tool. The odds of finding the correct suggestion in the number one rank were increased by 63% after re-sorting using our method. This effect was independent of both the dictionary and the spell checking tool that were used.

Conclusion Using knowledge about the frequency of a given word's occurrence in the medical domain can significantly improve spelling correction for medical queries.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Copyright © 1994 by the American Medical Informatics Association.