help button home button JAMIA Bigger figures
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zafar, A.
Right arrow Articles by McDonald, C. J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zafar, A.
Right arrow Articles by McDonald, C. J.
Journal of the American Medical Informatics Association 6:195-204 (1999)
© 1999 American Medical Informatics Association


Technology Evaluation

Continuous Speech Recognition for Clinicians

Atif Zafar, MD, J. Marc Overhage, MD and Clement J. McDonald, MD

Indiana University, Regenstrief Institute for Health Care, Indianapolis, Indiana.

Correspondence and reprints: Atif Zafar, MD, Regenstrief Institute for Health Care, 1001 West 10th Street, RHC 5th Floor, Indianapolis, IN 46202-2859.e-mail: <zafar_a{at}regenstrief.iupui.edu >.

The current generation of continuous speech recognition systems claims to offer high accuracy (greater than 95 percent) speech recognition at natural speech rates (150 words per minute) on low-cost (under $2000) platforms. This paper presents a state-of-the-technology summary, along with insights the authors have gained through testing one such product extensively and other products superficially.

The authors have identified a number of issues that are important in managing accuracy and usability. First, for efficient recognition users must start with a dictionary containing the phonetic spellings of all words they anticipate using. The authors dictated 50 discharge summaries using one inexpensive internal medicine dictionary ($30) and found that they needed to add an additional 400 terms to get recognition rates of 98 percent. However, if they used either of two more expensive and extensive commercial medical vocabularies ($349 and $695), they did not need to add terms to get a 98 percent recognition rate. Second, users must speak clearly and continuously, distinctly pronouncing all syllables. Users must also correct errors as they occur, because accuracy improves with error correction by at least 5 percent over two weeks. Users may find it difficult to train the system to recognize certain terms, regardless of the amount of training, and appropriate substitutions must be created. For example, the authors had to substitute "twice a day" for "bid" when using the less expensive dictionary, but not when using the other two dictionaries. From trials they conducted in settings ranging from an emergency room to hospital wards and clinicians' offices, they learned that ambient noise has minimal effect. Finally, they found that a minimal "usable" hardware configuration (which keeps up with dictation) comprises a 300-MHz Pentium processor with 128 MB of RAM and a "speech quality" sound card (e.g., SoundBlaster, $99). Anything less powerful will result in the system lagging behind the speaking rate.

The authors obtained 97 percent accuracy with just 30 minutes of training when using the latest edition of one of the speech recognition systems supplemented by a commercial medical dictionary. This technology has advanced considerably in recent years and is now a serious contender to replace some or all of the increasingly expensive alternative methods of dictation with human transcription.




This article has been cited by other articles:


Home page
J. Am. Med. Inform. Assoc.Home page
T. K.L. Schleyer, T. P. Thyvalikakath, H. Spallek, M. H. Torres-Urquidy, P. Hernandez, and J. Yuhaniak
Clinical Computing in General Dentistry
J. Am. Med. Inform. Assoc., May 1, 2006; 13(3): 344 - 352.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
P. J. Embi, T. R. Yackel, J. R. Logan, J. L. Bowen, T. G. Cooney, and P. N. Gorman
Impacts of Computerized Physician Documentation in a Teaching Hospital: Perceptions of Faculty and Resident Physicians
J. Am. Med. Inform. Assoc., July 1, 2004; 11(4): 300 - 309.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
D. N. Mohr, D. W. Turner, G. R. Pond, J. S. Kamath, C. B. De Vos, and P. C. Carpenter
Speech Recognition as a Transcription Aid: A Randomized Comparison With Standard Transcription
J. Am. Med. Inform. Assoc., January 1, 2003; 10(1): 85 - 93.
[Abstract] [Full Text] [PDF]


Home page
Arch OphthalmolHome page
P. W. DeBry
Considerations for Choosing an Electronic Medical Record for an Ophthalmology Practice
Arch Ophthalmol, April 1, 2001; 119(4): 590 - 596.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Med. Inform. Assoc.Home page
S. M. Borowitz
Computer-based Speech Recognition as an Alternative to Medical Transcription
J. Am. Med. Inform. Assoc., January 1, 2001; 8(1): 101 - 102.
[Abstract] [Full Text]


Home page
J. Am. Med. Inform. Assoc.Home page
E. G. Devine, S. A. Gaehde, and A. C. Curtis
Comparative Evaluation of Three Continuous Speech Recognition Software Packages in the Generation of Medical Reports
J. Am. Med. Inform. Assoc., September 1, 2000; 7(5): 462 - 468.
[Abstract] [Full Text]


Home page
AAP Grand RoundsHome page
N. Ackerman Jr
Computer Recognition of Medical Record Voice Dictations
AAP Grand Rounds, August 1, 1999; 2(2): 22 - 22.
[Full Text] [PDF]


Home page
CMAJHome page
R. Patterson
"I need more power, Scotty"
Can. Med. Assoc. J., August 1, 1999; 161(3): 246 - 246.
[Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 1999 by the American Medical Informatics Association.