help button home button JAMIA Bigger figures
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

First published January 9, 2007 as JAMIA PrePrint; doi:10.1197/jamia.M2191
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
M2191v1
14/2/212    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Herskovic, J. R.
Right arrow Articles by Bernstam, E. V.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Herskovic, J. R.
Right arrow Articles by Bernstam, E. V.
J Am Med Inform Assoc. 2007;14:212-220. DOI 10.1197/jamia.M2191.
© 2007 American Medical Informatics Association


Research paper

A Day in the Life of PubMed: Analysis of a Typical Day’s Query Log

Jorge R. Herskovic, MD, MSa, Len Y. Tanaka, MDa,b, William Hersh, MDc and Elmer V. Bernstam, MD, MS, MSEa,d,*

a University of Texas School of Health Information Sciences at Houston, Houston, TX
b Department of Pediatrics, Division of Pediatric Critical Care, University of Texas School of Medicine at Houston, Houston, TX
c Department of Medical Informatics & Clinical Epidemiology, Oregon Health & Science University, Portland, OR
d Department of Internal Medicine, Division of General Internal Medicine, University of Texas School of Medicine at Houston, Houston, TX.

* Correspondence and reprints: Dr. Elmer V. Bernstam, University of Texas School of Health Information Sciences at Houston, 7000 Fannin St. Suite 600, Houston, TX 77030. (Email: Elmer.V.Bernstam{at}uth.tmc.edu).

Received for publication: 07/01/06; accepted for publication: 12/06/06.

Objective: To characterize PubMed usage over a typical day and compare it to previous studies of user behavior on Web search engines.

Design: We performed a lexical and semantic analysis of 2,689,166 queries issued on PubMed over 24 consecutive hours on a typical day.

Measurements: We measured the number of queries, number of distinct users, queries per user, terms per query, common terms, Boolean operator use, common phrases, result set size, MeSH categories, used semantic measurements to group queries into sessions, and studied the addition and removal of terms from consecutive queries to gauge search strategies.

Results: The size of the result sets from a sample of queries showed a bimodal distribution, with peaks at approximately 3 and 100 results, suggesting that a large group of queries was tightly focused and another was broad. Like Web search engine sessions, most PubMed sessions consisted of a single query. However, PubMed queries contained more terms.

Conclusion: PubMed’s usage profile should be considered when educating users, building user interfaces, and developing future biomedical information retrieval systems.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2007 by the American Medical Informatics Association.