pp. 167-181
S&M3161 Research Paper of Special Issue https://doi.org/10.18494/SAM4225 Published: January 31, 2023 A Model of Real-time Pose Estimation Fusing Camera and LiDAR in Simultaneous Localization and Mapping by a Geometric Method [PDF] De Chen, Qingdong Yan, Zhi Zeng, Junfeng Kang, and Junxiong Zhou (Received October 31, 2022; Accepted January 16, 2023) Keywords: light detection and ranging (LiDAR), RGB-D (RGB-depth map), robot, simultaneous localization and mapping (SLAM), pose estimation, minimum bounding rectangle (MBR)
Simultaneous localization and mapping (SLAM) is the key technology for achieving autonomous navigation and stable walking for robots. For addressing a dynamic and special environment indoors and outdoors, there are still some limitations in using a single sensor to estimate and locate a robot’s position and orientation. To further improve the accuracy of SLAM positioning in real time, in this study, we combine the advantages of the RGB-depth map (RGB-D) and light detection and ranging (LiDAR) and propose a model of a two-stage deep fusion framework named convolutional neural network (CNN)–LiDAR vision inertial measurement unit (CNN–LVI) for real-time pose estimation by a geometric method. Unlike existing methods that use either a two-stage framework or multistage pipelines, the proposed framework fuses image and raw 3D point cloud data after multisensor joint calibration, and then uses 3D point clouds as spatial anchors to predict the pose between two sequence frames. By using a CNN algorithm to identify and extract a 3D bounding box, the target object projection of an RGB image is tracked to obtain the target minimum bounding rectangle (MBR). Finally, the rotation angle and translation distance are calculated by a geometric method using the centroid of the target MBR, so as to combine an inertial measurement unit to perform joint optimization, achieve the pose estimation of a robot, and further improve the model’s location accuracy. Experiments show that the proposed model achieves significant performance improvement compared with many other methods in the car class and achieves the best trade-off between state-of-the-art performance and accuracy on the benchmark with the KITTI dataset.
Corresponding author: Qingdong YanThis work is licensed under a Creative Commons Attribution 4.0 International License. Cite this article De Chen, Qingdong Yan, Zhi Zeng, Junfeng Kang, and Junxiong Zhou, A Model of Real-time Pose Estimation Fusing Camera and LiDAR in Simultaneous Localization and Mapping by a Geometric Method, Sens. Mater., Vol. 35, No. 1, 2023, p. 167-181. |