In this talk I will present how semantic information, as extracted by object and region detectors, can be used to perform classical 3D Computer Vision tasks in a robust way. In particular, object priors can be used to simply calibrate camera motion and to identify the position and orientation of objects in 3D. Strikingly, this solution has a closed form that can be solved very efficiently. Moreover, the objects position can be used to initialise efficiently standard methods for 3D reconstruction and for higher order reasoning on the 3D scene. Practical applications show that this computational framework can efficiently inject semantic information in Structure from Motion problems.
Alessio Del Bue is a Tenure-Track Researcher leading the Visual Geometry and Modelling (VGM) Lab of the PAVIS Department at the Istituto Italiano di Tecnologia (IIT). Previously (2006-2009), he was a senior researcher at the Institute for Systems and Robotics (ISR) at the Instituto Superior Técnico (IST) in Lisbon, Portugal. Before that, he obtained his Ph.D. in 2006 under the supervision of Prof. Lourdes Agapito in the Department of Computer Science at Queen Mary University of London. His main area of research is computer vision with a particular focus on dynamic 3D scene understanding and analysis. He is also active in several interdisciplinary projects at IIT by scientifically supporting life science and robotic departments on computer vision and image processing related issues.