Tightly-coupled visual-inertial odometry with robust feature association in dynamic illumination environments

Jie Zhang; Cong Zhang; Qingchen Liu; Qichao Ma; Jiahu Qin

doi:10.1017/S0263574725000608

Tightly-coupled visual-inertial odometry with robust feature association in dynamic illumination environments

Published online by Cambridge University Press: 04 June 2025

Jie Zhang ,

Cong Zhang ,

and

Jie Zhang: Affiliation:
Institute of Advanced Technology, University of Science and Technology of China, Hefei 230031, China
Cong Zhang: Affiliation:
Department of Automation, University of Science and Technology of China, Hefei 230027, China
Qingchen Liu: Affiliation:
Department of Automation, University of Science and Technology of China, Hefei 230027, China
Qichao Ma: Affiliation:
Department of Automation, University of Science and Technology of China, Hefei 230027, China
Jiahu Qin*: Affiliation:
Department of Automation, University of Science and Technology of China, Hefei 230027, China
*: Corresponding author: Jiahu Qin; Email: jhqin@ustc.edu.cn

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

This paper focuses on the feature-based visual-inertial odometry (VIO) in dynamic illumination environments. While the performance of most existing feature-based VIO methods is degraded by the dynamic illumination, which leads to unstable feature association, we propose a tightly-coupled VIO algorithm termed RAFT-VINS, integrating a Lite-RAFT tracker into the visual inertial navigation system (VINS). The key module of this odometry algorithm is a lightweight optical flow network designed for accurate feature tracking with real-time operation. It guarantees robust feature association in dynamic illumination environments and thereby ensures the performance of the odometry. Besides, to further improve the accuracy of the pose estimation, a moving consistency check strategy is developed in RAFT-VINS to identify and remove the outlier feature points. Meanwhile, a tightly-coupled optimization-based framework is employed to fuse IMU and visual measurements in the sliding window for efficient and accurate pose estimation. Through comprehensive experiments in the public datasets and real-world scenarios, the proposed RAFT-VINS is validated for its capacity to provide trustable pose estimates in challenging dynamic illumination environments. Our codes are open-sourced on https://github.com/USTC-AIS-Lab/RAFT-VINS.

Keywords

simultaneous localization and mapping optical flow dynamic illumination visual-inertial odometry

Information

Type: Research Article
Information: Robotica , Volume 43 , Issue 6 , June 2025 , pp. 2304 - 2319

DOI: https://doi.org/10.1017/S0263574725000608 [Opens in a new window]
Copyright: © The Author(s), 2025. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Forster, C., Pizzoli, M. and Scaramuzza, D., “SVO: Fast Semi-Direct Monocular Visual Odometry,” 2014 IEEE International Conference on Robotics and Automation (ICRA) (2014) pp. 15–22.Google Scholar

Mur-Artal, R., Montiel, J. M. M. and Tardos, J. D., “ORB-SLAM: A versatile and accurate monocular SLAM system,” IEEE Trans. Robot. 31(5), 1147–1163 (2015).CrossRef Google Scholar

Engel, J., Koltun, V. and Cremers, D., “Direct sparse odometry,” IEEE Trans. Pattern Anal. Mach. Intell. 40(3), 611–625 (2017).CrossRef Google Scholar PubMed

Leutenegger, S., Lynen, S., Bosse, M., Siegwart, R. and Furgale, P., “Keyframe-based visual–inertial odometry using nonlinear optimization,” Int. J. Robot. Res. 34(3), 314–334 (2015).CrossRef Google Scholar

Qin, T., Li, P. and Shen, S., “VINS-Mono: A robust and versatile monocular visual-inertial state estimator,” IEEE Trans. Robot. 34(4), 1004–1020 (2018).CrossRef Google Scholar

Yu, L., Qin, J., Wang, S., Wang, Y. and Wang, S., “A tightly coupled feature-based visual-inertial odometry with stereo cameras,” IEEE Trans. Ind. Electron. 70(4), 3944–3954 (2022).CrossRef Google Scholar

Yu, X., Zheng, W. and Ou, L., “CPR-SLAM: RGB-D SLAM in dynamic environment using sub-point cloud correlations,” Robotica 42(7), 2367–2387 (2024).CrossRef Google Scholar

Ding, W., Pei, Z., Yang, T. and Chen, T., “Dynamic simultaneous localization and mapping based on object tracking in occluded environment,” Robotica 42(7), 2209–2225 (2024).CrossRef Google Scholar

Zhang, K., Dong, C., Guo, H., Ye, Q., Gao, L., Xiang, S., Chen, X. and Wu, Y., “A semantic visual SLAM based on improved mask R-CNN in dynamic environment,” Robotica 42(10), 1–22 (2024).CrossRef Google Scholar

Zuñiga-Noël, D., Jaenal, A., Gomez-Ojeda, R. and Gonzalez-Jimenez, J., “The UMA-VI dataset: Visual–inertial odometry in low-textured and dynamic illumination environments,” Int. J. Robot. Res. 39(9), 1052–1060 (2020).CrossRef Google Scholar

Horn, B. K. and Schunck, B. G., “Determining optical flow,” Artif. Intell. 17(1-3), 185–203 (1981).CrossRef Google Scholar

Lucas, B. D. and Kanade, T., “An Iterative Image Registration Technique with an Application to Stereo Vision,” IJCAI’81: 7th International Joint Conference on Artificial Intelligence, vol. 2 (1981) pp. 674–679.Google Scholar

Dosovitskiy, A., Fischer, P., Ilg, E., Hausser, P., Hazirbas, C., Golkov, V., Van Der Smagt, P., Cremers, D. and Brox, T., “FlowNet: Learning Optical Flow with Convolutional Networks,” Proceedings of the IEEE International Conference on Computer Vision (2015) pp. 2758–2766.Google Scholar

Ilg, E., Mayer, N., Saikia, T., Keuper, M., Dosovitskiy, A. and Brox, T., “FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017) pp. 2462–2470.Google Scholar

Ranjan, A. and Black, M. J., “Optical Flow Estimation Using a Spatial Pyramid Network,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017) pp. 4161–4170.Google Scholar

Sun, D., Yang, X., Liu, M.-Y. and Kautz, J., “PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018) pp. 8934–8943.Google Scholar

Teed, Z. and Deng, J., “RAFT: Recurrent All-Pairs Field Transforms for Optical Flow,” European Conferenceon Computer Vision (2020) pp. 402–419.Google Scholar

Pumarola, A., Vakhitov, A., Agudo, A., Sanfeliu, A. and Moreno-Noguer, F., “PL-SLAM: Real-Time Monocular Visual SLAM with Points and Lines,” 2017 IEEE International Conference on Robotics and Automation (ICRA) (2017) pp. 4503–4508.Google Scholar

Fu, Q., Wang, J., Yu, H., Ali, I., Guo, F., He, Y. and Zhang, H., PL-VINS: Real-time monocular visual-inertial SLAM with point and line features. arXiv preprint arXiv: 2009.07462 (2020)Google Scholar

Xu, K., Hao, Y., Yuan, S., Wang, C. and Xie, L., “AirVO: An Illumination-Robust Point-Line Visual Odometry,” 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2023) pp. 3429–3436.Google Scholar

Shen, S., Michael, N. and Kumar, V., “Tightly-Coupled Monocular Visual-Inertial Fusion for Autonomous Flight of Rotorcraft MAVs,” 2015 IEEE International Conference on Robotics and Automation (ICRA) (2015) pp. 5303–5310.Google Scholar

Jiang, J., Chen, X., Dai, W., Gao, Z. and Zhang, Y., “Thermal-inertial SLAM for the environments with challenging illumination,” IEEE Robot. Autom. Lett. 7(4), 8767–8774 (2022).CrossRef Google Scholar

Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A. and Zagoruyko, S., “End-to-End Object Detection with Transformers,” European Conference on Computer Vision (2020) pp. 213–229.Google Scholar

Mayer, N., Ilg, E., Hausser, P., Fischer, P., Cremers, D., Dosovitskiy, A. and Brox, T., “A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016) pp. 4040–4048.Google Scholar

Butler, D. J., Wulff, J., Stanley, G. B. and Black, M. J., “A Naturalistic Open Source Movie for Optical Flow Evaluation,” European Conferenceon Computer Vision (2012) pp. 611–625.Google Scholar

Shi, J. and Tomasi, C., “Good Features to Track,” Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (1994) pp. 593–600.Google Scholar

Huber, P. J., “Robust Estimation of a Location Parameter,” In: Breakthroughs in Statistics: Methodology and Distribution (Springer, 1992) pp. 492–518.CrossRef Google Scholar

Agarwal, S. , Mierle, K. and The Ceres Solver Team, “Ceres Solver,” (2023).Google Scholar

Xu, H., Zhang, J., Cai, J., Rezatofighi, H. and Tao, D., “GMFlow: Learning Optical Flow via Global Matching,” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022) pp. 8121–8130.Google Scholar

Shi, X., Li, D., Zhao, P., Tian, Q., Tian, Y., Long, Q., Zhu, C., Song, J., Qiao, F., Song, L., Guo, Y., Wang, Z., Zhang, Y., Qin, B., Yang, W., Wang, F., Chan, R. H. M. and She, Q., “Are We Ready for Service Robots? The OpenLORIS-Scene Datasets for Lifelong SLAM,” 2020 IEEE International Conference on Robotics and Automation (ICRA) (2020) pp. 3139–3145.Google Scholar

Campos, C., Elvira, R., Rodríguez, J. J. G., Montiel, J. M. and Tardós, J. D., “ORB-SLAM3: An accurate open-source library for visual, visual–inertial, and multimap SLAM,” IEEE Trans. Robot. 37(6), 1874–1890 (2021).CrossRef Google Scholar

Teed, Z. and Deng, J., “DROID-SLAM: Deep visual SLAM for Monocular, Stereo, and RGB-D Cameras,” Advances in Neural Information Processing Systems 34 (2021) pp.16558–16569.Google Scholar

Burri, M., Nikolic, J., Gohl, P., Schneider, T., Rehder, J., Omari, S., Achtelik, M. W. and Siegwart, R., “The EuRoC micro aerial vehicle datasets,” Int. J. Robot. Res. 35(10), 1157–1163 (2016).CrossRef Google Scholar

Zhang et al. supplementary material

File 7.4 MB

Article contents

Tightly-coupled visual-inertial odometry with robust feature association in dynamic illumination environments

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

Zhang et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests