Using argumentation to retrieve articles with similar citations: an inquiry into improving related articles search in the MEDLINE digital library.

TitleUsing argumentation to retrieve articles with similar citations: an inquiry into improving related articles search in the MEDLINE digital library.
Publication TypeJournal Article
Year of Publication2006
AuthorsTbahriti, I, Chichester, C, Lisacek, F, Ruch, P
JournalInternational journal of medical informatics
Volume75
Issue6
Pagination488-95
Date Published2006 Jun
ISSN1386-5056
KeywordsAbstracting and Indexing as Topic, Algorithms, Artificial Intelligence, Benchmarking, Bibliometrics, Database Management Systems, Information Storage and Retrieval, Libraries, Digital, MEDLINE, Natural Language Processing, Periodicals as Topic, Terminology as Topic, Vocabulary, Controlled
Abstract

The aim of this study is to investigate the relationships between citations and the scientific argumentation found abstracts. We design a related article search task and observe how the argumentation can affect the search results. We extracted citation lists from a set of 3200 full-text papers originating from a narrow domain. In parallel, we recovered the corresponding MEDLINE records for analysis of the argumentative moves. Our argumentative model is founded on four classes: PURPOSE, METHODS, RESULTS and CONCLUSION. A Bayesian classifier trained on explicitly structured MEDLINE abstracts generates these argumentative categories. The categories are used to generate four different argumentative indexes. A fifth index contains the complete abstract, together with the title and the list of Medical Subject Headings (MeSH) terms. To appraise the relationship of the moves to the citations, the citation lists were used as the criteria for determining relatedness of articles, establishing a benchmark; it means that two articles are considered as "related" if they share a significant set of co-citations. Our results show that the average precision of queries with the PURPOSE and CONCLUSION features is the highest, while the precision of the RESULTS and METHODS features was relatively low. A linear weighting combination of the moves is proposed, which significantly improves retrieval of related articles.

DOI10.1016/j.ijmedinf.2005.06.007
Alternate JournalInt J Med Inform
PubMed ID16165395