Commands for autonomous vehicles by progressively stacking visual-linguistic representations

Dai, H., Luo, S., Ding, Y. and Shao, L. (2021) Commands for autonomous vehicles by progressively stacking visual-linguistic representations. In: Bartoli, A. and Fusiello, A. (eds.) Computer Vision – ECCV 2020 Workshops. Series: Lecture notes in computer science, 12536. Springer, pp. 27-32. ISBN 9783030660963 (doi: 10.1007/978-3-030-66096-3_2)

Full text not currently available from Enlighten.

Abstract

In this work, we focus on the object referral problem in the autonomous driving setting. We use a stacked visual-linguistic BERT model to learn a generic visual-linguistic representation. Each element of the input is either a word or a region of interest from the input image. To train the deep model efficiently, we use a stacking algorithm to transfer knowledge from a shallow BERT model to a deep BERT model.

Item Type:Book Sections
Additional Information:Print ISBN: 9783030660956
Status:Published
Glasgow Author(s) Enlighten ID:Dai, Dr Hang
Authors: Dai, H., Luo, S., Ding, Y., and Shao, L.
College/School:College of Science and Engineering > School of Computing Science
Publisher:Springer
ISSN:0302-9743
ISBN:9783030660963
Related URLs:

University Staff: Request a correction | Enlighten Editors: Update this record