Abstract:
There are many examples in nature of animals using acoustics to understand and navigate the world around them. Inspired by this, we train an image-to-image translation ne...Show MoreMetadata
Abstract:
There are many examples in nature of animals using acoustics to understand and navigate the world around them. Inspired by this, we train an image-to-image translation network to learn the mapping from recorded chirps and echos from an environment to a 360° depth map of the environment. This work is focused on expanding on the capabilities of BatVision in a number of ways. We first propose various methods for data augmentation to help the model generalise on less data. We also propose changes to the model architecture to improve performance and training stability. Finally, we investigate the feasibility of 360° scene reconstruction by using more microphones and lidar based 3D SLAM data as ground truth for training the model.
Published in: IEEE Robotics and Automation Letters ( Volume: 6, Issue: 4, October 2021)