| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Submitted on December 2, 2004
Accepted on April 23, 2005
Affiliation of the authors: 1 Department of Computational Biology, University of Tokyo, Chiba, Japan; Basic Research Laboratory, Kanebo, COSMETICS, INC., Kanagawa, Japan; 2 Department of Computational Biology, University of Tokyo, Chiba, Japan
* To whom correspondence should be addressed.
Objective To help biomedical researchers recognize dynamically introduced abbreviations in biomedical literature, such as gene and protein names, we have constructed a support system called ALICE (Abbreviation LIfter using Corpus-based Extraction). ALICE aims to extract all types of abbreviations with their expansions from a target paper on the fly.
Methods ALICE extracts an abbreviation and its expansion from the literature by using heuristic pattern-matching rules. This system consists of three phases and potentially identifies valid 320 abbreviation-expansion patterns as combinations of the rules.
Results It achieved 95% recall and 97% precision on randomly selected titles and abstracts from the MEDLINE database.
Conclusion ALICE extracted abbreviations and their expansions from the literature efficiently. The subtly compiled heuristics enabled it to extract abbreviations with high recall without significantly reducing precision. ALICE does not only facilitate recognition of an undefined abbreviation in a paper by constructing an abbreviation database or a dictionary, but also makes biomedical literature retrieval more accurate. This system is freely available at http://uvdb3.hgc.jp/ALICE/ALICE_index.html.
This article has been cited by other articles:
![]() |
N. Okazaki and S. Ananiadou Building an abbreviation dictionary using a term recognition approach Bioinformatics, December 15, 2006; 22(24): 3089 - 3095. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zhou, V. I. Torvik, and N. R. Smalheiser ADAM: another database of abbreviations in MEDLINE Bioinformatics, November 15, 2006; 22(22): 2813 - 2818. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH |