| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Research Paper |
Affiliation of the author: Department of Medical Informatics, Columbia University College of Physicians and Surgeons, New York, NY.
Correspondence and reprint requests: James J. Cimino, Associate Professor of Medical Informatics in Medicine, Atchley Pavilion Room 1310, 161 Fort Washington Avenue, New York, NY 10032. e-mail: <James.Cimino{at}columbia.edu>.
Abstract Objective: The National Library of Medicine's (NLM) Unified Medical Language System (UMLS) includes a Metathesaurus (Meta), which is a compilation of medical terms drawn from over 30 controlled vocabularies, and a Semantic Net, which contains the semantic types used to categorize Meta concepts and the semantic relations to connect them. Meta has been constructed through lexical matching techniques and human review. The purpose of this study was to audit the Meta using semantic techniques to identify possible inconsistencies.
Methods: Five different techniques were applied: (1) detection of ambiguity in Meta concepts with two or more semantic types, (2) detection of interchangeable keyword synonyms, (3) detection of redundant pairs of Meta concepts (using lexical matching combined with keyword synonyms), (4) detection of inconsistent parent-child relationships in Meta (based on the semantic type information), and (5) discovery of pairs of semantic types for which relations could be added to the Semantic Net, based on "other" relationships between Meta concepts.
Results: Of 57,592 concepts with multiple semantic types, 1817 (3.2%) were judged to be ambiguous. Keyword analysis showed 7121 pairs of interchangeable words. Using the keyword pairs, 5031 pairs of potentially redundant concepts were suggested, of which 3274 (65.1%) were judged to actually be redundant. Review of the 100,586 parent-child relationships revealed 544 (0.54%) that were incorrect. Review of the 219,664 "Other" relationships suggested 1299 places in the Semantic Net where relations between pairs of semantic types could be added.
Conclusion: Semantic techniques, alone or in combination, can be used to audit the UMLS to detect inconsistencies that are not detectable through lexical techniques alone. Use of these methods to augment the UMLS maintenance process will lead to improvement in the UMLS.
This article has been cited by other articles:
![]() |
M. Bada and L. Hunter Identification of OBO nonalignments and its implications for OBO enrichment Bioinformatics, June 15, 2008; 24(12): 1448 - 1455. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-W. Fan and C. Friedman Semantic Classification of Biomedical Concepts Using Distributional Similarity J. Am. Med. Inform. Assoc., July 1, 2007; 14(4): 467 - 477. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Min, Y. Perl, Y. Chen, M. Halper, J. Geller, and Y. Wang Auditing as Part of the Terminology Design Life Cycle J. Am. Med. Inform. Assoc., November 1, 2006; 13(6): 676 - 690. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. E. Westberg and R. A. Miller The Basis for Using the Internet to Support the Information Needs of Primary Care J. Am. Med. Inform. Assoc., January 1, 1999; 6(1): 6 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. E. Campbell, D. E. Oliver, and E. H. Shortliffe The Unified Medical Language System: Toward a Collaborative Approach for Solving Terminologic Problems J. Am. Med. Inform. Assoc., January 1, 1998; 5(1): 12 - 16. [Abstract] [Full Text] |
||||
![]() |
A. T. McCray and R. A. Miller Making the Conceptual Connections: The UMLS after a Decade of Research and Development J. Am. Med. Inform. Assoc., January 1, 1998; 5(1): 129 - 130. [Full Text] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |