Salamin, H., Polychroniou, A. and Vinciarelli, A. (2013) Automatic detection of laughter and fillers in spontaneous mobile phone conversations. In: IEEE SMC 2013: IEEE International Conference on Systems, Man, and Cybernetics, Manchester, UK, 13-16 Oct 2013, pp. 4282-4287. ISBN 9781479906529 (doi: 10.1109/SMC.2013.730)
|
Text
100495.pdf - Accepted Version 453kB |
Publisher's URL: http://dx.doi.org/10.1109/SMC.2013.730
Abstract
This article presents experiments on automatic detection of laughter and fillers, two of the most important nonverbal behavioral cues observed in spoken conversations. The proposed approach is fully automatic and segments audio recordings captured with mobile phones into four types of interval: laughter, filler, speech and silence. The segmentation methods rely not only on probabilistic sequential models (in particular Hidden Markov Models), but also on Statistical Language Models aimed at estimating the a-priori probability of observing a given sequence of the four classes above. The experiments are speaker independent and performed over a total of 8 hours and 25 minutes of data (120 people in total). The results show that F1 scores up to 0.64 for laughter and 0.58 for fillers can be achieved.
Item Type: | Conference Proceedings |
---|---|
Status: | Published |
Refereed: | Yes |
Glasgow Author(s) Enlighten ID: | Vinciarelli, Professor Alessandro |
Authors: | Salamin, H., Polychroniou, A., and Vinciarelli, A. |
College/School: | College of Science and Engineering > School of Computing Science |
ISBN: | 9781479906529 |
Copyright Holders: | Copyright © 2013 IEEE |
Publisher Policy: | Reproduced with the permission of the authors |
Related URLs: |
University Staff: Request a correction | Enlighten Editors: Update this record