Experiments in newswire summarisation

Mackie, S., McCreadie, R. , Macdonald, C. and Ounis, I. (2016) Experiments in newswire summarisation. Lecture Notes in Computer Science, 9626, pp. 421-435. (doi: 10.1007/978-3-319-30671-1_31)

[img]
Preview
Text
116771.pdf - Accepted Version

392kB

Abstract

In this paper, we investigate extractive multi-document summarisation algorithms over newswire corpora. Examining recent findings, baseline algorithms, and state-of-the-art systems is pertinent given the current research interest in event tracking and summarisation. We first reproduce previous findings from the literature, validating that automatic summarisation evaluation is a useful proxy for manual evaluation, and validating that several state-of-the-art systems with similar automatic evaluation scores create different summaries from one another. Following this verification of previous findings, we then reimplement various baseline and state-of-the-art summarisation algorithms, and make several observations from our experiments. Our findings include: an optimised Lead baseline; indication that several standard baselines may be weak; evidence that the standard baselines can be improved; results showing that the most effective improved baselines are not statistically significantly less effective than the current state-of-the-art systems; and finally, observations that manually optimising the choice of anti-redundancy components, per topic, can lead to improvements in summarisation effectiveness.

Item Type:Articles
Additional Information:38th European Conference on IR Research, ECIR 2016, Padua, Italy, 20-23 March 2016.
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Mccreadie, Dr Richard and Macdonald, Professor Craig and Ounis, Professor Iadh
Authors: Mackie, S., McCreadie, R., Macdonald, C., and Ounis, I.
College/School:College of Science and Engineering > School of Computing Science
Journal Name:Lecture Notes in Computer Science
Publisher:Springer International Publishing
ISSN:0302-9743
ISBN:9783319306704
Copyright Holders:Copyright © 2016 Springer International Publishing
First Published:First published in Lecture Notes in Computer Science 9626:421-435
Publisher Policy:Reproduced in accordance with the copyright policy of the publisher
Related URLs:

University Staff: Request a correction | Enlighten Editors: Update this record