help button home button JAMIA Bigger figures
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

First published February 5, 2004 as JAMIA PrePrint; doi:10.1197/jamia.M1474
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M1474v1
11/3/179    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Crowell, J.
Right arrow Articles by Lacroix, E.-M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Crowell, J.
Right arrow Articles by Lacroix, E.-M.
J Am Med Inform Assoc. 2004;11:179-185. DOI 10.1197/jamia.M1474.
© 2004 American Medical Informatics Association


Research Paper

A Frequency-based Technique to Improve the Spelling Suggestion Rank in Medical Queries

Jonathan Crowell, MS, Qing Zeng, PhD, Long Ngo, PhD and Eve-Marie Lacroix, MS

Affiliations of the authors: Decision Systems Group, Brigham & Women's Hospital, Harvard Medical School, Boston, MA (JC, QZ); Department of Biostatistics, Harvard School of Public Health, Boston, MA (LN); Public Services Division, National Library of Medicine, Bethesda, MD (E-ML).

Correspondence and reprints: Jonathan Crowell, MS, Decision Systems Group, Brigham & Women's Hospital, Harvard Medical School, Boston, MA 02115; e-mail: <jcrowell{at}dsg.bwh.harvard.edu>.

Received for publication: 10/28/03; accepted for publication: 12/21/03.

Objective: There is an abundance of health-related information online, and millions of consumers search for such information. Spell checking is of crucial importance in returning pertinent results, so the authors propose a technique for increasing the effectiveness of spell-checking tools used for health-related information retrieval.

Design: A sample of incorrectly spelled medical terms was submitted to two different spell-checking tools, and the resulting suggestions, derived under two different dictionary configurations, were re-sorted according to how frequently each term appeared in log data from a medical search engine.

Measurements: Univariable analysis was carried out to assess the effect of each factor (spell-checking tool, dictionary type, re-sort, or no re-sort) on the probability of success. The factors that were statistically significant in the univariable analysis were then used in multivariable analysis to evaluate the independent effect of each of the factors.

Results: The re-sorted suggestions proved to be significantly more accurate than the original list returned by the spell-checking tool. The odds of finding the correct suggestion in the number one rank were increased by 63% after re-sorting using the authors' method. This effect was independent of both the dictionary and the spell-checking tools that were used.

Conclusion: Using knowledge about the frequency of a given word's occurrence in the medical domain can significantly improve spelling correction for medical queries.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2004 by the American Medical Informatics Association.