Detecting missing content queries in an SMS-Based HIV/AIDS FAQ retrieval system

Thuma, E., Rogers, S. and Ounis, I. (2014) Detecting missing content queries in an SMS-Based HIV/AIDS FAQ retrieval system. In: 36th European Conference on Information Retrieval, Amsterdam, The Netherlands, 13-16 April 2014, pp. 247-259. (doi: 10.1007/978-3-319-06028-6_21)

89123.pdf - Accepted Version



Automated Frequently Asked Question (FAQ) answering systems use pre-stored sets of question-answer pairs as an information source to answer natural language questions posed by the users. The main problem with this kind of information source is that there is no guarantee that there will be a relevant question-answer pair for all user queries. In this paper, we propose to deploy a binary classifier in an existing SMS-Based HIV/AIDS FAQ retrieval system to detect user queries that do not have the relevant question-answer pair in the FAQ document collection. Before deploying such a classifier, we first evaluate different feature sets for training in order to determine the sets of features that can build a model that yields the best classification accuracy. We carry out our evaluation using seven different feature sets generated from a query log before and after retrieval by the FAQ retrieval system. Our results suggest that, combining different feature sets markedly improves the classification accuracy.

Item Type:Conference Proceedings
Glasgow Author(s) Enlighten ID:Ounis, Professor Iadh and Rogers, Dr Simon and Thuma, Mr Edwin
Authors: Thuma, E., Rogers, S., and Ounis, I.
College/School:College of Science and Engineering > School of Computing Science
Copyright Holders:Copyright © 2014 Springer
Publisher Policy:Reproduced in accordance with the copyright policy of the publisher

University Staff: Request a correction | Enlighten Editors: Update this record