SeDAR: reading floorplans like a human—using deep learning to enable human-inspired localisation

Mendez, O., Hadfield, S., Pugeault, N. and Bowden, R. (2020) SeDAR: reading floorplans like a human—using deep learning to enable human-inspired localisation. International Journal of Computer Vision, 128(5), pp. 1286-1310. (doi: 10.1007/s11263-019-01239-4)

[img]
Preview
Text
218210.pdf - Published Version
Available under License Creative Commons Attribution.

4MB

Abstract

The use of human-level semantic information to aid robotic tasks has recently become an important area for both Computer Vision and Robotics. This has been enabled by advances in Deep Learning that allow consistent and robust semantic understanding. Leveraging this semantic vision of the world has allowed human-level understanding to naturally emerge from many different approaches. Particularly, the use of semantic information to aid in localisation and reconstruction has been at the forefront of both fields. Like robots, humans also require the ability to localise within a structure. To aid this, humans have designed high-level semantic maps of our structures called floorplans. We are extremely good at localising in them, even with limited access to the depth information used by robots. This is because we focus on the distribution of semantic elements, rather than geometric ones. Evidence of this is that humans are normally able to localise in a floorplan that has not been scaled properly. In order to grant this ability to robots, it is necessary to use localisation approaches that leverage the same semantic information humans use. In this paper, we present a novel method for semantically enabled global localisation. Our approach relies on the semantic labels present in the floorplan. Deep Learning is leveraged to extract semantic labels from RGB images, which are compared to the floorplan for localisation. While our approach is able to use range measurements if available, we demonstrate that they are unnecessary as we can achieve results comparable to state-of-the-art without them.

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Pugeault, Dr Nicolas
Authors: Mendez, O., Hadfield, S., Pugeault, N., and Bowden, R.
College/School:College of Science and Engineering > School of Computing Science
Journal Name:International Journal of Computer Vision
Publisher:Springer
ISSN:0920-5691
ISSN (Online):1573-1405
Published Online:28 September 2019
Copyright Holders:Copyright © 2019 The Authors
First Published:First published in International Journal of Computer Vision 128(5): 1286-1310
Publisher Policy:Reproduced under a Creative Commons License

University Staff: Request a correction | Enlighten Editors: Update this record