Dai, H., Luo, S., Ding, Y. and Shao, L. (2021) Commands for autonomous vehicles by progressively stacking visual-linguistic representations. In: Bartoli, A. and Fusiello, A. (eds.) Computer Vision – ECCV 2020 Workshops. Series: Lecture notes in computer science, 12536. Springer, pp. 27-32. ISBN 9783030660963 (doi: 10.1007/978-3-030-66096-3_2)
Full text not currently available from Enlighten.
Abstract
In this work, we focus on the object referral problem in the autonomous driving setting. We use a stacked visual-linguistic BERT model to learn a generic visual-linguistic representation. Each element of the input is either a word or a region of interest from the input image. To train the deep model efficiently, we use a stacking algorithm to transfer knowledge from a shallow BERT model to a deep BERT model.
Item Type: | Book Sections |
---|---|
Additional Information: | Print ISBN: 9783030660956 |
Status: | Published |
Glasgow Author(s) Enlighten ID: | Dai, Dr Hang |
Authors: | Dai, H., Luo, S., Ding, Y., and Shao, L. |
College/School: | College of Science and Engineering > School of Computing Science |
Publisher: | Springer |
ISSN: | 0302-9743 |
ISBN: | 9783030660963 |
Related URLs: |
University Staff: Request a correction | Enlighten Editors: Update this record