Using argumentation to extract key sentences from biomedical abstracts.

TitleUsing argumentation to extract key sentences from biomedical abstracts.
Publication TypeJournal Article
Year of Publication2007
AuthorsRuch, P, Boyer, C, Chichester, C, Tbahriti, I, Geissb├╝hler, A, Fabry, P, Gobeill, J, Pillet, V, Rebholz-Schuhmann, D, Lovis, C, Veuthey, A-L
JournalInternational journal of medical informatics
Volume76
Issue2-3
Pagination195-200
Date Published2007 Feb-Mar
ISSN1386-5056
KeywordsAbstracting and Indexing as Topic, Artificial Intelligence, Bayes Theorem, Bibliometrics, Information Storage and Retrieval, Libraries, Digital, MEDLINE, Natural Language Processing, Periodicals as Topic, Terminology as Topic, Vocabulary, Controlled
Abstract

PROBLEM: key word assignment has been largely used in MEDLINE to provide an indicative "gist" of the content of articles and to help retrieving biomedical articles. Abstracts are also used for this purpose. However with usually more than 300 words, MEDLINE abstracts can still be regarded as long documents; therefore we design a system to select a unique key sentence. This key sentence must be indicative of the article's content and we assume that abstract's conclusions are good candidates. We design and assess the performance of an automatic key sentence selector, which classifies sentences into four argumentative moves: PURPOSE, METHODS, RESULTS and

CONCLUSION:

METHODS: we rely on Bayesian classifiers trained on automatically acquired data. Features representation, selection and weighting are reported and classification effectiveness is evaluated on the four classes using confusion matrices. We also explore the use of simple heuristics to take the position of sentences into account. Recall, precision and F-scores are computed for the CONCLUSION class. For the CONCLUSION class, the F-score reaches 84%. Automatic argumentative classification using Bayesian learners is feasible on MEDLINE abstracts and should help user navigation in such repositories.

DOI10.1016/j.ijmedinf.2006.05.002
Alternate JournalInt J Med Inform
PubMed ID16815739