ATTENTION: This is a web archive! The IMS Group was split up in 2018 and does not exist anymore. Recent work of former members can be found at the VR/AR Group and the Computer Vision Group.

Interactive Media Systems, TU Wien

SLAMANTIC - Leveraging Semantics to Improve VSLAM in Dynamic Environments

By Matthias Schörghuber, Daniel Steininger, Yohan Carbon, and Margrit Gelautz


In this paper, we tackle the challenge for VSLAM of handling nonstatic environments. We propose to include semantic information obtained by deep learning methods in the traditional geometric pipeline. Specifically, we compute a confidence measure for each map point as a function of its semantic class (car, person, building, etc.) and its detection consistency over time. The confidence is then applied to guide the usage of each point in the mapping and localization stage. Points with high confidence are used to verify points with low confidence in order to select the final set of points for pose computation and mapping. Furthermore, we can handle map points whose state may change between static and dynamic (a car can be parked or in motion). Evaluating our method on public datasets, we show that it can successfully solve challenging situations in dynamic environments which cause state-of-theart baseline VSLAM algorithms to fail and that it maintains performance on static scenes. Code is available at


Reference currently not available.