Fusion of Learned Multi-Modal Representations and Dense Trajectories for Emotional Analysis in Videos

Acar, E., Hopfgartner, F. and Albayrak, S. (2015) Fusion of Learned Multi-Modal Representations and Dense Trajectories for Emotional Analysis in Videos. In: CBMI 2015: 13th International Workshop on Content-Based Multimedia Indexing, Prague, Czech Republic, 10-12 June 2015, pp. 1-6. ISBN 9781467368704 (doi:10.1109/CBMI.2015.7153603)

[img]
Preview
Text
105833.pdf - Accepted Version

345kB

Abstract

When designing a video affective content analysis algorithm, one of the most important steps is the selection of discriminative features for the effective representation of video segments. The majority of existing affective content analysis methods either use low-level audio-visual features or generate handcrafted higher level representations based on these low-level features. We propose in this work to use deep learning methods, in particular convolutional neural networks (CNNs), in order to automatically learn and extract mid-level representations from raw data. To this end, we exploit the audio and visual modality of videos by employing Mel-Frequency Cepstral Coefficients (MFCC) and color values in the HSV color space. We also incorporate dense trajectory based motion features in order to further enhance the performance of the analysis. By means of multi-class support vector machines (SVMs) and fusion mechanisms, music video clips are classified into one of four affective categories representing the four quadrants of the Valence-Arousal (VA) space. Results obtained on a subset of the DEAP dataset show (1) that higher level representations perform better than low-level features, and (2) that incorporating motion information leads to a notable performance gain, independently from the chosen representation.

Item Type:Conference Proceedings
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Hopfgartner, Dr Frank
Authors: Acar, E., Hopfgartner, F., and Albayrak, S.
Subjects:Z Bibliography. Library Science. Information Resources > ZA Information resources
College/School:College of Arts > School of Humanities > Humanities Advanced Technology and Information Institute (HATII)
ISBN:9781467368704
Copyright Holders:Copyright © 2015 Institute of Electrical and Electronics Engineers
Publisher Policy:Reproduced in accordance with the copyright policy of the publisher.
Related URLs:

University Staff: Request a correction | Enlighten Editors: Update this record