This work combines two active areas of research in computer vision: unsupervised object extraction from a single image, and depth estimation from a stereo image pair. A recent, successful trend in unsupervised object extraction is to exploit so-called ``3D scene-consistency", that is enforcing that objects obey underlying physical constraints of the 3D scene, such as occupancy of 3D space and gravity of objects. Our main contribution is to introduce the concept of 3D scene-consistency into stereo matching. We show that this concept is beneficial for both tasks, object extraction and depth estimation. In particular, we demonstrate that our approach is able to create a large set of 3D scene-consistent object proposals, by varying e.g. the prior on the number of objects. After automatically ranking the proposals we show experimentally that our results are considerably closer to ground truth than state-of-the-art techniques which either use stereo or monocular images. We envision that our method will build the front-end of a future object recognition system for stereo images.
M. Bleyer, C. Rhemann, C. Rother: "Extracting 3D Scene-consistent Object Proposals and Depth from Stereo Images"; Poster: European Conference on Computer Vision (ECCV) 2012, Florenz, Italien; 10-07-2012 - 10-13-2012; in: "European Conference on Computer Vision (ECCV) 2012", (2012), 1 - 14.
Supplementary Material: You can find the supplementary material here
This work was supported by the Vienna Science and Technology Fund (WWTF) under project ICT08-019.
|Supplementary Material||2.98 MB||PDF document||Download|
Click into the text area and press Ctrl+A/Ctrl+C or ⌘+A/⌘+C to copy the BibTeX into your clipboard… or download the BibTeX.