|
This research is based on a new and accurate technique for Visual 3D Odometry using a stereo-pair of cameras. It investigates a new image-based approach to tracking the 6dof trajectory of a stereo camera pair using a corresponding reference image pairs instead of explicit 3D feature reconstruction of the scene. A dense minimisation approach is employed which directly uses all grey-scale information available within the stereo pair (or stereo region) leading to very robust and precise results. Metric 3D structure constraints are imposed by consistently warping corresponding stereo images to generate novel viewpoints at each stereo acquisition. An iterative non-linear trajectory estimation approach is formulated based on a quadrifocal relationship between the image intensities within adjacent views of the stereo pair. A robust M-estimation technique is used to reject outliers corresponding to moving objects within the scene or other outliers such as occlusions and illumination changes. The technique is applied to recovering the trajectory of a moving vehicle in long and difficult sequences of images.
The 3D visual odometry algorithm was tested on real full-scale sequences as can be seen in the videos below. Several test sequences from different streets in Versailles, France, were used to validate the results. The estimated 3D trajectory from a stereo-pair of cameras was superimposed on Google Earth satelite images of the area.
This first sequence is that of a relatively straight road. The distance travelled by the car has been measured using road markings in the images and satellite views with a precision of 2.9 cm/pixel for the Versailles region. The path length measured by both Google earth and the tracker was about 440m. It is difficult to register the satellite image with the projection of the trajectory since no three non-collinear points were available and the best that can be said is that they have approximately the same absolute length (ignoring tilt of the cameras and the incline of the road). Throughout the sequence several moving vehicles pass in front of the cameras and at one stage a car is overtaken.

This second sequence is particularly illustrative since a full loop of the round-about was performed. In particular this enables the drift to be measured at the crossing point in the trajectory. In the case of this round-about the drift at the crossing point was approximately 20cm in the vertical direction to the road-plane. Considering that the trajectory around the round-about is approximately 200m long (measured using Google earth), this makes a drift of 0.01% on the measurable axis. In the case of large scale scenes such as this one it was necessary to detect and update the reference image periodically when it was no longer visible or too approximate. Due to the highly redundant amount of data, the robust estimator was able to successfully reject pedestrians and moving cars from the estimation process. It can be noted, however, that all static information available was used to estimate the pose (including the parked cars) therefore leading to a very precise result with minimal drift over large displacements.

(Click on image or here to play)
Recently a publication was presented at the International Conference on Robotics and automation (ICRA'07 ) in Rome where we were selected as finalist for the best vision paper prize. The publication is now available here:
- A.I. Comport, E. Malis, P. Rives. Accurate Quadri-focal Tracking for Robust 3D Visual Odometry. In IEEE Int. Conf. on Robotics and Automation, ICRA'07, Rome, Italy, April 2007.

Some of the new results for the LAAS blimp sequence were presented at the conference. To see the sequence and the estimated trajectory click here (approximately 12M).
The full round about sequence showing the reference images as well as robust outlier rejection can be downloaded here (approximately 6.8M).
|
|
Last Updated on Friday, 10 February 2012 08:43 |
|
Since 2005 I have been working on new techniques for dense localisation and mapping techniques that are "Direct" in the sense that no features are extracted or matched from subsequent images in a time sequence. Some of the recent work on this has involved using RGB-D sensors, both stereo and kinect along with monocular 3D tracking. On this subject I was invited to do a talk at LDRMC 2011. Our presentation is available here: comport_LDRMC_talk.pdf This was the First International Workshop on Live Dense Reconstruction from Moving Cameras in conjunction with the ICCV 2011, November 12, Barcelona, Spain. The workshop was highly successful and brought together researchers in robotics and vision to synthesise state-of-the-art theory and practice in live dense reconstruction from moving cameras including passive and active RGB-D devices.
Some videos of this work can be found here:
1. Kinect dense RGB-D SLAM - video
Related publication: Audras, C., Comport, A. I., Meilland, M. & Rives, P (2011). Real-time dense RGB-D localisation and mapping. In Australian Conference on Robotics and Automation. Monash University, Australia.

2. A symmetric dense mapping and localisation using and combining different sensor modalities including stereo, mono and kinect - video

Related publication: Comport, A. I., Meilland, M. & Rives, P (2011). An asymmetric real-time dense visual localisation and mapping system. In First International Workshop on Live Dense Reconstruction from Moving Cameras In conjunction with the International Conference on Computer Vision (ICCV). Barcelona, Spain.
|
|
Last Updated on Friday, 10 February 2012 08:38 |
|
|
Visual SLAM for Space robots |
|
|
|
|
Multi-view SLAM for the control of space robots

This project aims at developing new techniques in real-time Simultaneous Localisation and Mapping (SLAM) using multi-view computer vision. This project focuses on applying these techniques to Space Exploration in collaboration with Thales Alenia Space. This will involve the study of autonomous navigation of space robots in unknown environments and safe landing. We have also participated in stage 2 of the ProVIS mars challenge.
Some videos of this work are provided below:
ITOKAWA Asteroid:

Video 1: With only intensity minimisation.
Video 2: With both intensity and depth minimisation.
Bunny test sequence

Video: Trajectory estimation with intensity and depth minimisation
Mars sequences

The primary part of this study concerns developing a visual perception system that localises the spacecraft with respect to reference bodies and performs visual servoing to achieve environment mapping and safe landing. The interest here will be to use the theory of multi-view differential geometry to perform closed-loop visual servoing control. It will be necessary to develop an analytic relationship between the movement of the space vehicle and the movement of external bodies such as asteroids and planets perceived by the visual sensors. This will also include identifying pertinent control parameters that adhere to a rigorous set of criterion (robustness, stability, precision, efficiency) along with the choice of technical constraints such as the optimal placement of the cameras. In particular, it will be of interest to develop a global task function which allows the aircraft to behave correctly in accordance with the online dynamics (for example, the movement of an asteroid), changing illumination conditions (according to the luminosity and the visibility), perturbations (such as varying temperatures) or large differences in perceived scale of the environment.
The second part will involve developing algorithms for mapping the environment both in an off-line learning phase and an online update phase. The former case will allow to reduce the computational complexity of the online problem by performing the bulk of the computational effort off-line. This will involve creation of a navigational framework that allows the creation of image-based maps within which a robot is to execute a global task. Given a set of training measurements, task planning consists in constructing optimal representations of these maps in such a way that localisation and control of the aircraft may be perform precisely whilst remaining invariant to dynamic changes in the environment. The online map will be updated and integrated into the control objective when new and better quality measurements become available such as during a landing phase where higher resolution images are available closer to the ground.
Some associated publications are:
Tykkälä, T. M. & Comport, A. I (2011). Direct Iterative Closest Point for Real-time Visual Odometry. In The Second international Workshop on Computer Vision in Vehicle Technology: From Earth to Mars in conjunction with the International Conference on Computer Vision. Barcelona, Spain.
Tykkala, T. M. & Comport, A. I (2011). A Dense Structure Model for Image Based Stereo SLAM. In IEEE International Conference on Robotics and Automation, ICRA'11. Shanghai, China.
Tykkala, T. & Comport, A. I (2010). A Dense Structure Model for Image Based Stereo SLAM. In Journee des Jeunes Chercheurs en Robotique - Journees Nationales du GDR Robotique. Paris, France.
|
|
Last Updated on Friday, 10 February 2012 08:31 |
|
This research is carried out within the French National Project ANR PREDIT CityVIP. My work is aimed at real-time localisation and mapping for autonomous navigation of an urban vehicule. Some associated publications are:
Some first results on autonomous navigation in real-world urban environments are available in the following videos (please click on the images):
Autonomous Navigation:

Learning Phase:

Comport, A. I., Meilland, M. & Rives, P (2011). A Real-Time Dense Visual Localisation and Mapping System. In Live Dense Reconstruction with Moving Cameras Workshop (LDRMC/ICCV), Barcelona, Spain. Meilland, M., Comport, A. I. & Rives, P (2011). Dense visual mapping of large scale environments for real-time localisation. In IEEE/RSJ International Conference on Intelligent Robots and Systems. San Francisco, California. Meilland, M., Comport, A. I. & Rives, P (2011). Real-time Dense Visual Tracking under Large Lighting Variations. In British Machine Vision Conference. University of Dundee. Meilland, M., Comport, A. I. & Rives, P (2010). A Spherical Robot-Centered Representation for Urban Navigation. In IEEE/RSJ International Conference on Intelligent Robots and Systems. Taipei, Taiwan. Gallegos, G., Meilland, M., Comport, A. I. & Rives, P (2010). Appearance-Based SLAM Relying on a Hybrid Laser/Omnidirectional Sensor. In IEEE/RSJ International Conference on Intelligent Robots and Systems. Taipei, Taiwan. |
|
Last Updated on Friday, 02 December 2011 07:59 |
|