Wilkie, C. and Azzopardi, L. (2014) Efficiently estimating retrievability bias. Lecture Notes in Computer Science, 8416, pp. 720-726. (doi: 10.1007/978-3-319-06028-6_82)
Full text not currently available from Enlighten.
Abstract
Retrievability is the measure of how easily a document can be retrieved using a particular retrieval system. The extent to which a retrieval system favours certain documents over others (as expressed by their retrievability scores) determines the level of bias the system imposes on a collection. Recently it has been shown that it is possible to tune a retrieval system by minimising the retrievability bias. However, to perform such a retrievability analysis often requires posing millions upon millions of queries. In this paper, we examine how many queries are needed to obtain a reliable and useful approximation of the retrievability bias imposed by the system, and an estimate of the individual retrievability of documents in the collection. We find that a reliable estimate of retrievability bias can be obtained, in some cases, with 90% less queries than are typically used while estimating document retrievability can be done with up to 60% less queries.
Item Type: | Articles |
---|---|
Status: | Published |
Refereed: | Yes |
Glasgow Author(s) Enlighten ID: | Azzopardi, Dr Leif and Wilkie, Mr Colin |
Authors: | Wilkie, C., and Azzopardi, L. |
College/School: | College of Science and Engineering > School of Computing Science |
Journal Name: | Lecture Notes in Computer Science |
Publisher: | Springer |
ISSN: | 0302-9743 |
ISSN (Online): | 0302-9743 |
University Staff: Request a correction | Enlighten Editors: Update this record