Thuma, E., Rogers, S. and Ounis, I. (2014) Detecting missing content queries in an SMS-Based HIV/AIDS FAQ retrieval system. In: 36th European Conference on Information Retrieval, Amsterdam, The Netherlands, 13-16 April 2014, pp. 247-259. (doi: 10.1007/978-3-319-06028-6_21)
|
Text
89123.pdf - Accepted Version 325kB |
Abstract
Automated Frequently Asked Question (FAQ) answering systems use pre-stored sets of question-answer pairs as an information source to answer natural language questions posed by the users. The main problem with this kind of information source is that there is no guarantee that there will be a relevant question-answer pair for all user queries. In this paper, we propose to deploy a binary classifier in an existing SMS-Based HIV/AIDS FAQ retrieval system to detect user queries that do not have the relevant question-answer pair in the FAQ document collection. Before deploying such a classifier, we first evaluate different feature sets for training in order to determine the sets of features that can build a model that yields the best classification accuracy. We carry out our evaluation using seven different feature sets generated from a query log before and after retrieval by the FAQ retrieval system. Our results suggest that, combining different feature sets markedly improves the classification accuracy.
Item Type: | Conference Proceedings |
---|---|
Status: | Published |
Refereed: | Yes |
Glasgow Author(s) Enlighten ID: | Ounis, Professor Iadh and Rogers, Dr Simon and Thuma, Mr Edwin |
Authors: | Thuma, E., Rogers, S., and Ounis, I. |
College/School: | College of Science and Engineering > School of Computing Science |
Copyright Holders: | Copyright © 2014 Springer |
Publisher Policy: | Reproduced in accordance with the copyright policy of the publisher |
University Staff: Request a correction | Enlighten Editors: Update this record