Performance comparison of clustered and replicated information retrieval systems

Cacheda, F., Carneiro, V., Plachouras, V. and Ounis, I. (2007) Performance comparison of clustered and replicated information retrieval systems. Lecture Notes in Computer Science, 4425, pp. 124-135. (doi:10.1007/978-3-540-71496-5_14)

[img]
Preview
Text
cacheda3760.pdf

273kB

Publisher's URL: http://dx.doi.org/10.1007/978-3-540-71496-5_14

Abstract

The amount of information available over the Internet is increasing daily as well as the importance and magnitude of Web search engines. Systems based on a single centralised index present several problems (such as lack of scalability), which lead to the use of distributed information retrieval systems to effectively search for and locate the required information. A distributed retrieval system can be clustered and/or replicated. In this paper, using simulations, we present a detailed performance analysis, both in terms of throughput and response time, of a clustered system compared to a replicated system. In addition, we consider the effect of changes in the query topics over time. We show that the performance obtained for a clustered system does not improve the performance obtained by the best replicated system. Indeed, the main advantage of a clustered system is the reduction of network traffic. However, the use of a switched network eliminates the bottleneck in the network, markedly improving the performance of the replicated systems. Moreover, we illustrate the negative performance effect of the changes over time in the query topics when a distributed clustered system is used. On the contrary, the performance of a distributed replicated system is query independent.

Item Type:Articles
Keywords:Distributed information retrieval, performance, simulation.
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Ounis, Professor Iadh
Authors: Cacheda, F., Carneiro, V., Plachouras, V., and Ounis, I.
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
College/School:College of Science and Engineering > School of Computing Science
Journal Name:Lecture Notes in Computer Science
Publisher:Springer
ISSN:1611-3349
Copyright Holders:Copyright © 2007 Springer
First Published:First published in Lecture Notes in Computer Science 4425:124-135
Publisher Policy:Reproduced in accordance with the copyright policy of the publisher.

University Staff: Request a correction | Enlighten Editors: Update this record