LOCUS 2.0: Robust and Computationally Efficient Lidar Odometry for Real-Time Underground 3D Mapping

Andrzej Reinke; Matteo Palieri; Benjamin Morrell; Yun Chang; Kamak Ebadi; Luca Carlone; Ali-akbar Agha-mohammadi

doi:10.1109/lra.2022.3181357

Reinke, Andrzej;Palieri, Matteo;Morrell, Benjamin;Chang, Yun;Ebadi, Kamak;Carlone, Luca;Agha-mohammadi, Ali-akbar

2022-05-24 00:00:00

IEEE ROBOTICS AND AUTOMATION LETTERS. PREPRINT VERSION. ACCEPTED JUNE, 2022 1 LOCUS 2.0: Robust and Computationally Efﬁcient Lidar Odometry for Real-Time 3D Mapping 1;2 1;3 1 4 1 Andrzej Reinke , Matteo Palieri , Benjamin Morrell , Yun Chang , Kamak Ebadi , 4 1 Luca Carlone , Ali-akbar Agha-mohammadi Abstract—Lidar odometry has attracted considerable attention as a robust localization method for autonomous robots operat- ing in complex GNSS-denied environments. However, achieving reliable and efﬁcient performance on heterogeneous platforms in large-scale environments remains an open challenge due to the limitations of onboard computation and memory resources needed for autonomous operation. In this work, we present LOCUS 2.0, a robust and computationally-efﬁcient lidar odom- etry system for real-time underground 3D mapping. LOCUS 2.0 includes a novel normals-based Generalized Iterative Closest Point (GICP) formulation that reduces the computation time of point cloud alignment, an adaptive voxel grid ﬁlter that maintains the desired computation load regardless of the environment’s geometry, and a sliding-window map approach that bounds the memory consumption. The proposed approach is shown to be suitable to be deployed on heterogeneous robotic platforms involved in large-scale explorations under severe computation and memory constraints. We demonstrate LOCUS 2.0, a key element of the CoSTAR team’s entry in the DARPA Subterranean Challenge, across various underground scenarios. We release LOCUS 2.0 as an open-source library and also release a lidar-based odometry dataset in challenging and large- scale underground environments. The dataset features legged and wheeled platforms in multiple environments including fog, dust, Fig. 1: Four examples from our large underground lidar-based SLAM darkness, and geometrically degenerate surroundings with a total dataset consisting of over 16 km distance traveled and 11 h of operation of 11 h of operations and 16 km of distance traveled. across diverse environments: (a) a large-scale Limestone Mine (Kentucky Underground), (b) A 3-level urban environment with both large, open spaces Index Terms—SLAM, Data Sets for SLAM, Robotics in Under- and tight passages (LA Subway), (c) lava tubes with large vertical changes, Resourced Settings, Sensor Fusion and (d) lava tubes with narrow passages. LOCUS 2.0 performed successfully in all these environments on computationally constrained robots. I. I NTRODUCTION preferred over visual sensors to achieve reliable ego-motion estimation in cluttered environments with signiﬁcant illumina- IDAR odometry has emerged as a key tool for robust tion variations (e.g., search, rescue, industrial inspection and localization of autonomous robots operating in complex underground exploration). GNSS-denied environments. Lidar sensors do not rely on Lidar odometry algorithms aim to recover the robot’s motion external light sources and provide accurate long-range 3D between consecutive lidar acquisitions using scan registration. measurements by emitting pulsed light waves to estimate Through repeated observations of ﬁxed environmental fea- the range to surrounding obstacles, through time-of-ﬂight- tures, the robot can simultaneously estimate its movement, based techniques. For these reasons, lidar has been often construct a map of the unknown environment, and use this Manuscript received: February, 24, 2022; Revised May, 13, 2022; Accepted map to keep track of its position within it. May, 19, 2022. While many lidar odometry algorithms can achieve remark- This paper was recommended for publication by Editor Javier Civera upon evaluation of the Associate Editor and Reviewers’ comments. able accuracy, their computational cost can still be prohibitive This work was supported by the Jet Propulsion Laboratory - California for computationally-constrained platforms, reducing their ﬁeld Institute of Technology, under a contract with the National Aeronautics and of applicability in systems of heterogeneous robots, where Space Administration (80NM0018D0004). This work was partially funded by the Defense Advanced Research Projects Agency (DARPA). ©2022 All rights some of the robots may have very limited computational reserved. resources. Moreover, many existing approaches maintain the Reinke, Palieri, Morrell, Ebadi and Agha-mohammadi are with NASA global map in memory for localization purposes, making them Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA [email protected] unsuitable for large-scale explorations where the map size in Reinke is with University of Bonn, Germany [email protected] memory would signiﬁcantly increase. Palieri is with the Department of Electrical And Information Engineering, Our previous work [1] presents LOCUS 1.0, a multi-sensor Polytechnic University of Bari, IT [email protected] lidar-centric solution for high-precision odometry and 3D Chang and Carlone are with the Department of Aeronautics and As- tronautics, Massachusetts Institute of Technology, Cambridge, MA, USA. mapping in real-time featuring a multi-stage scan match- [email protected] ing module, equipped with health-aware sensor integration Digital Object Identiﬁer (DOI): see top of this page. arXiv:2205.11784v2 [cs.RO] 13 Jun 2022 2 IEEE ROBOTICS AND AUTOMATION LETTERS. PREPRINT VERSION. ACCEPTED JUNE, 2022 that fuses additional sensing modalities in a loosely-coupled solution exploiting bundle adjustment over a sliding window of scheme. While achieving remarkable accuracy and robustness lidar scans for enhanced mapping accuracy. While the paper in perceptually degraded settings, the previous version of claims nearly real-time operation, the method does not fuse LOCUS 1.0: i) had a more signiﬁcant computational load, ii) sensing modalities and maintains the entire map in memory. maintained the global map in memory, iii) was less robust to more generalized sensor failures, e.g., failure of one of B. Lidar-Inertial Odometry lidar sensor. LOCUS 2.0 presents algorithmic and system-level General challenges encountered in pure lidar-based odome- improvements to decrease the computational load and memory try estimators include degraded motion estimation in high-rate demand, enabling the system to achieve accurate and real-time motion scenarios [5], and degenerate motion observability in ego-motion estimation in challenging perceptual conditions geometrically-featureless areas (e.g. long corridors, tunnels) over large-scale exploration under severe computation and [6], [7]. For these reasons, lidars are commonly fused with memory constraints. additional sensing modalities to achieve enhanced accuracy in The new features and contributions of this work include perceptually-degraded settings [8], [9]. The work [10] presents (i) GICP from normals: a novel formulation of General- LIO-SAM, an accurate tightly-coupled lidar-inertial odometry ized Iterative Closest-Point (GICP) that leverages point cloud solution via smoothing and mapping that exploits a factor normals to approximate the point covariance calculation for graph for joint optimization of IMU and lidar constraints. enhanced computational efﬁciency, (ii) Adaptive voxel grid Scan-matching at a local scale instead of a global scale sig- ﬁlter that ensures deterministic and near-constant runtime, niﬁcantly improves the real-time performance. The work [11] independently of the surrounding environment and lidars, (iii) presents LILI-OM, a tightly-coupled lidar-inertial odometry improvement and evaluation of two sliding-window map solution with a lidar/IMU hierarchical keyframe-based sliding storage data structures: multi-threaded octree, ikd-tree [2], window optimization back-end. The work [12] presents LINS, and (iv) dataset release including in challenging, real-world a fast tightly-coupled fusion scheme of lidar and IMU with subterranean environments (urban, tunnel, cave), shown in error-state Kalman ﬁlter to recursively correct the estimated Fig. 1, collected by heterogeneous robot platforms. All these state by generating new feature correspondences in each features improve the computational and memory operation iteration. The work [13] presents RTLIO, a tightly-coupled while maintaining accuracy at the same level. The source code lidar-inertial odometry pipeline that delivers accurate and high- of LOCUS 2.0 has been released as an open-source library . frequency estimation for the feedback control of UAVs by The paper is organized as follows. Sec. II reviews related solving a cost function consisting of lidar and IMU residuals. work on lidar odometry. Sec. III describes the proposed system The work [14] presents FAST-LIO, a computationally-efﬁcient architecture with a focus on the updates made to the frame- lidar-inertial odometry pipeline that fuses lidar feature points work to enhance its performance in terms of computational with IMU data in a tightly-coupled scheme with an iterated load and memory usage. Sec. IV provides an ablation study of extended Kalman ﬁlter. A novel formula for computing the the system on datasets collected by heterogeneous robots dur- Kalman gain results in a considerable decrease of computa- ing the three circuits of the DARPA Subterranean Challenges, tional complexity with respect to the standard formulation, an international robotic competition where robots are tasked translating into decreased computation time. The work [15] to explore complex GNSS-denied underground environments presents DLIO, a lightweight loosely-coupled lidar-inertial autonomously. odometry solution for efﬁcient operation over constrained platforms. The work provides efﬁcient derivation of local II. R ELATED WORKS submaps for global reﬁnement constructed by concatenating Motivated by the need to enable real-time operation under point clouds associated with historical key-frames, along with computation constraints and large-scale explorations under a custom iterative closest point solver for fast and lightweight memory constraints in perceptually-degraded settings, we re- point cloud registration with data structure recycling that view the current state-of-the-art to assess whether any solution eliminates redundant calculations. While these methods are can satisfy these requirements simultaneously. computationally efﬁcient, the methods maintain a global map in memory, rendering it unsuitable for large-scale explorations over memory constraints. A. Lidar Odometry The work [3] proposes DILO, a lidar odometry technique C. Lidar-Visual-Inertial Odometry that projects a three-dimensional point cloud onto a two- dimensional spherical image plane and exploits image-based The work [9] presents Super Odometry, a robust, IMU- odometry techniques to recover the robot ego-motion in a centric multi-sensor fusion framework that achieves accu- frame-to-frame fashion without requiring map generation. This rate operation in perceptually-degraded environments. The results in dramatic speed improvements, however, the method approach divides the sensor data processing into several sub- does not fuse additional sensing modalities and is not open- factor-graphs where each sub-factor-graph receives the predic- source. The work [4] presents BALM, a lidar odometry tion from an IMU pre-integration factor, recovering the motion from a coarse to ﬁne manner and enhancing the real-time https://github.com/NeBula-Autonomy/nebula- odometry-dataset performance. The approach also adopts a dynamic octree data https://github.com/NeBula-Autonomy/LOCUS structure to organize the 3D points, making the scan-matching REINKE et al.: LOCUS 2.0 3 process very efﬁcient and reducing the overall computational Point Cloud Preprocessor demand. However, the method maintains the global map in Lidar N MDC Adaptive memory. Point Body Voxel Normal Lidar 2 MDC Cloud The work [8] presents LVI-SAM, a real-time tightly-coupled Filter Grid Computation Merger Filter lidar-visual-inertial odometry solution via smoothing and map- Lidar 1 MDC ping built atop a factor graph, comprising a visual-inertial sub- system (VIS) and a lidar-inertial subsystem (LIS). However, Scan Matching Unit the method maintains the global map in memory, which it not Point Cloud Space Lidar Mapper unsuitable for large-scale explorations with memory limited Map Monitor processing units. The work [16] proposes R2LIVE, an accurate Scan-to- Scan-to- Odom Sensor and computationally-efﬁcient sensor fusion framework for Scan Submap Integration Odom Inital Module IMU lidar, camera, and IMU that exhibits extreme robustness to Guess various failures of individual sensing modalities through ﬁlter- Fig. 2: LOCUS 2.0 architecture. based odometry. While not explicitly mentioned in the paper, the open-source implementation of this method features the the robot. LOCUS 2.0, in comparison to its predecessor, does integration of an ikd-tree data structure for map storage which not recalculate covariances but instead leverages a novel GICP could be exploited to keep in memory only a robot-centered formulation to use normals, which only need to be computed submap. once and stored in the map (Sec. III-A). In robots with multi-modal sensing, when available, LOCUS 2.0 uses an initial estimate from a non-lidar source (from III. SYSTEM DESCRIPTION Sensor Integration Module) to ease the convergence of the LOCUS 2.0 provides an accurate Generalized Iterative GICP in the scan-to-scan matching stage, by initializing the Closest Point (GICP) algorithm [17] based multi-stage scan optimization with a near-optimal seed that improves accuracy matching unit and a health-aware sensor integration module and reduces computation, enhancing real-time performance, as for robust fusion of additional sensing modalities in a loosely explained in [1]. coupled scheme. The architecture, shown in Fig. 2, contains LOCUS 2.0 also includes a more efﬁcient technique for map three main components: i) point cloud preprocessor, ii) scan storage. The system uses a sliding-window approach because matching unit, iii) sensor integration module. The point cloud large-scale areas are not feasible to be maintained in memory. preprocessor is responsible for the management of multiple- For example, in one of the cave datasets presented here, a input lidar streams to produce a uniﬁed 3D data product that global map at 1 cm resolution requires 50 GB of memory, can be efﬁciently processed by the scan matching unit. The far exceeding the typically available memory on small mobile preprocessor module consists of Motion Distortion Correction robots. This approach demands efﬁcient computational solu- (MDC) of the point clouds. This module corrects the distortion tions for insertion, deletion, and search. in the point cloud from sensor rotation during a scan due to robot movement using IMU measurements. A. GICP from normals Next, the Point Cloud Merger enlarges the robot ﬁeld-of- LOCUS 2.0 uses GICP for scan-to-scan and scan-to-submap view by combining point clouds from different lidar sensors matching. GICP generalizes the point-to-point and point-to- in the robot body frame using their known extrinsic transfor- plane ICP registration by using a probabilistic model for mation. To enable resilient merging of multiple lidar feeds, the registration problem [17]. To do this, GICP requires the we introduce an external timeout-based health monitor that availability of covariances for each point in the point clouds dynamically updates which lidars should be combined in the to be aligned. Covariances are usually calculated based on Point Cloud Merger (i.e. a lidar is ignored if its messages the distributions of neighboring points around a given point. are too delayed). The health monitoring makes the submodule Segal et al. [17] presents plane-to-plane application with the robust to lags and failures of individual lidars so that an output assumption that real-world surfaces are at least locally planar. product is always provided to the downstream pipeline. Then, In this formulation, points on surfaces are locally represented the Body Filter removes the 3D points that belong to the by a covariance matrix, where the point is known to belong robot. Next, the Adaptive Voxel Grid Filter maintains a ﬁxed to a plane with high-conﬁdence, but its exact location in the number of voxelized points to manage CPU load and to ensure plane has higher uncertainty. deterministic behavior. It allows the robot to have consistent Here, we show how plane-to-plane covariance calculation computational load regardless of the size of the environment is equivalent to calculating covariances from pre-computed or the number of lidars (or if potential lidar failures). In normals. The fact that only the normal is needed is especially comparison to LOCUS 1.0, the Adaptive Voxel Grid Filter important for scan-to-submap alignment since the map would changes the strategy of point cloud reduction from a blind otherwise require recomputing point covariances, which is an voxelization strategy with ﬁxed leaf size and random ﬁlter expensive operation involving the creation of a kd-tree and to an adaptive system (Sec. III-B). The Normal Computation nearest neighbors search. By instead using normals, the co- module calculates normals from the voxelized point cloud. variance computation is only performed once (since it is not The scan matching unit performs a GICP scan-to-scan and inﬂuenced by the addition of extra points), and the result can scan-to-submap registration to estimate the 6-DOF motion of be stored and reused. 4 IEEE ROBOTICS AND AUTOMATION LETTERS. PREPRINT VERSION. ACCEPTED JUNE, 2022 The most common implementations of GICP [18], [19] rely B A on computing the covariances C and C for each point i i i in two scans. That calculation of covariances in GICP takes place as a pre-processing step whenever two scans have to be registered. In the following, we describe how to obtain the covariances, without recomputing them every time a scan is collected. Any well-deﬁned covariance matrix C can be eigendecomposed to eigenvalues and eigenvectors [20], C = T T T u u + u u + u u where , , are 1 1 1 2 2 2 3 3 3 1 2 3 the eigenvalues of matrix C, and u , u , u are eigenvectors 1 2 3 of matrix C. Two eigenvalues with the same value can be Fig. 3: Illustration of our our sliding map approach. All points are maintained interpreted and visualized as eigenvectors that span equally until the robot reaches the boundary of the original window (step 3). Then, a 2D planar surface with the same distribution in each direction. new window is set, and points outside that window are deleted. This allows the covariance computation problem to be thought of geometrically. the variability of the input points that stems from different In the plane-to-plane metric, for a point a from scan A we i sensors conﬁgurations and cross-sectional geometry of the know the position along the normal with very high conﬁdence, environment. This design goal comes from the fact that almost but we are less sure about its location in the plane. To represent all computations in the registration stages are dependent on this, we set u = n, and assign = as the variance in the 1 1 a number of points N . Therefore the idea is to keep the normal direction, where is a small constant. We can then voxelized number of 3D points ﬁxed to have approximately choose the other eigenvectors to be arbitrary vectors in the ﬁxed computation time per scan. The approach is as follows: plane orthogonal to n and assign them comparatively large let us take any size of the initial voxel size d and set init eigenvalues = = 1, indicating that we are unsure about 2 3 d = d , where d is the size of the voxel leaf in the leaf init leaf location in the plane. current time stamp. We propose the following control scheme: scan Let us take any vector that lies on the plane and is d = d . The formula describes how much leaf leaf t+1 t desired perpendicular to the normal. The vector needs to satisfy should the current voxel size change d in comparison leaf t+1 the plane equation (that it is perpendicular to the normal to what the current size is d based on the ratio on the leaf vector) and cross the origin: n x + n y + n z = 0. Then x y z number of points in the current input scan N to the points scan n x+n y x y z = , therefore the family of vectors on that are desired for computation for given robot N . This n desired (x;y;(n x+n y)=n ) x y z simple technique maintains the number of points on the ﬁxed the plane is u = , where n and n 2 z x k(x;y;(n x+n y)=n k x y z level, while avoiding any large jumps in the numbers of points, corresponds to the component z and x of a normal vector n having too few points (e.g. a faulty scan) or having too many and n = 0 means the horizontal vector. The third vector u z 3 points. The result is an improvement in the efﬁciency and needs to simultaneously be perpendicular to u and u since 1 2 reduction of the computational load of the system. eigenvectors need to span the whole 3D space. Therefore, u = nu . If we know the eigenvectors and eigenvalues of 3 2 T T T matrix C, we have C = u u +1:0 u u +1:0 u u . 1 1 2 2 3 3 C. Sliding-window Map Substituting from above, we get: LOCUS 1.0 [1] stored the global map in memory through T T T an octree data structure. The native octree implementation C = n n + 1:0 u u + 1:0 (n u ) (n u ) 2 2 2 2 does not have an efﬁcient way to prune data out. While a (1) possible workaround is to ﬁlter the points around the robot and Then, if we take arbitrarily x = 1 and y = 0 then z = , rebuild the octree accordingly, this might be computationally and the covariance simpliﬁes to: expensive and lead to long refreshing times. (1; 0;n =n ) (1; 0;n =n ) x z x z To account for these challenges, and enable large-scale C = n n + (2) k(1; 0;n =n )k k(1; 0;n =n )k x z x z explorations under memory constraints, LOCUS 2.0 provides two map sliding-window approaches (Fig. 3): i) multi-threaded (1; 0;n =n ) (1; 0;n =n ) x z x z +n n octree, ii) incremental k-dtree [21] (ikd-tree). k(1; 0;n =n )k k(1; 0;n =n )k x z x z Multi-Threaded Octree approach maintains only a robot- The results mean that the covariance can be purely expressed centered submap of the environment in memory. Two parallel via precomputed normals at each point. threads (thread and thread ) each working on dedicated a b data structures (map /octree and map /octree ) are respon- a a b b B. Adaptive Voxel Grid Filter sible to dynamically ﬁlter the point cloud map around the current robot position through a box-ﬁlter, and rebuild the To manage the computation load of lidar odometry, regard- octree accordingly with the updated map, while accounting less of the environment and lidar conﬁguration (in terms of for robot motions between parallel worker processes. number of lidars and types), we propose an adaptive voxel grid Ikd-tree [21] is a binary search tree that dynamically stores ﬁlter. In this approach, the goal is to maintain the voxelized 3D points by merging new scans. Ikd-tree does not maintain number of points at a ﬁxed level (desired by the user) rather 3D points only in the leaf nodes: they have points in the than specifying the voxel leaf size and exposing the system to REINKE et al.: LOCUS 2.0 5 TABLE I: Dataset summary. Distance Duration ID Place Robot Characteristic lidars (m) (min) power plant feature-poor corridors, Elma, WA A Husky 631.53 59:56 3 large open spaces (Urban) 2-level, power plant stairs, Elma, WA B Spot 664.27 32:26 1 feature-poor corridors, (Urban) large & narrow spaces power plant (a) NeBula Spot robot (b) NeBula Husky robot feature-poor corridors, C Elma, WA Husky 757.40 24:21 3 large & narrow spaces (Urban) Fig. 4: Type of robots for heterogeneous robotic system in Nebula framework Bruceton Mine self-similar for DARPA Subterranean Challenge D Pittsburgh, PA Husky 1795.88 65:36 3 self-repetitive geometries (Tunnel) 3D map (provided by DARPA in the Subterranean Challenge Lava Beds National lava tubes and pools, E Monument, CA Spot 590.85 25:20 non-uniform environment, 1 or produced by the team) is used. The ground-truth trajectory (Cave) degraded lightning is produced by running LOCUS 1.0 against the survey-grade Bruceton Mine self-similar F Pittsburgh, PA Husky 1569.73 49:13 3 map (i.e. scan-to-map is scan-to-survey-map). In this mode, self-repetitive geometries (Tunnel) LOCUS 1.0 is tuned for maximum accuracy at the cost of com- power plant feature-poor corridors, G Elma, WA Husky 877.21 93:10 3 large & narrow spaces putational efﬁciency, as it does not need to be run in real-time. (Urban) The ground truth trajectory of the robot is determined based on 3-level, Subway Station multiple stairs, H Los Angeles, CA Spot 1777.45 46:57 3 LOCUS 1.0 and its multi-stage registration technique: scan- feature-poor corridors, (Urban) large & narrow spaces to-scan and scan-to-map (with high computational parameters Kentucky Underground large area, and slower pace of data processing) and some manual post- I Limestone Mine, KY Spot 768.82 19:28 non-uniform environment, 1 (Cave) degraded lightning processing work. These datasets have been made open-source Kentucky Underground large area, J Limestone Mine, KY Husky 2339.81 57:55 non-uniform environment, 3 to promote further research on lidar odometry and SLAM (Cave) degraded lightning in underground environments: github.com/NeBula-Autonomy/ For our experiments we use only two lidars. nebula-odometry-dataset. internal nodes as well. This structure allows dynamic insertion and deletion capabilities and relies on lazy labels storage B. Metrics across the whole data structure. Initial building of an ikd-tree is For CPU and memory proﬁling, a cross-platform library for similar to a kd-tree, where space is split at the median point retrieving information on running processes and system utiliza- along the longest dimension recursively. Points that are moved tion is used [23]. The library is used for system monitoring out of the boundaries of the ikd-tree data structure are not and proﬁling. The CPU represents the percentage value of deleted immediately, but they are labeled as deleted = True the current system-wide CPU utilization, where 100% means and maintain information until a rebalancing procedure is 1 core is used. The memory represents statistics by summing triggered. different memory values depending on the platform. Odometry delay measures the difference between odometry message IV. E XPERIM ENTAL RESULTS creation and the current timestamp of the system. The system A. Dataset is implemented in Robot Operating System (ROS) framework. This work considers maximum delay and mean delay since Over the last 3 years, Team CoSTAR [22] has intensively those two metrics more directly impact the performance of tested our lidar odometry system in real world environment the modules using the odometry result, e.g., controllers and such as caves, tunnels and abandoned factories. Each dataset path planners. Lidar callback time measures the duration time (Tab. I) is selected to contain components that are challenging for a scan at the time stamp t to go through a pipeline of for lidar odometry. The dataset provides lidar scans, IMU k processing from the queue of the lidar scans. Scan-to-scan and wheeled inertial odometry (WIO) measurements, as well time measures the duration time for a scan at time t to align cameras stream. All datasets have been recorded on differ- k with a scan at time t in GICP registration stage. Scan-to- ent robotics platforms, e.g., Husky and Spot (Fig. 4) with k1 submap time measures the duration time for a pre-aligned scan vibrations and large accelerations as is characteristic of both a from scan-to-scan at time t to align with a reference local skid-steer wheeled robot traversing rough terrain and a legged k map in GICP registration. robot that slips and acts dynamically in rough terrain. The Husky robot is equipped with 3 on-board VLP16 lidar sensors extrinsic calibrated (one ﬂat, one pitched forward 30 deg, one C. Computation time pitched backward 30 deg). The Spot robot is equipped with 1) GICP from normals: The experiments presented in this one on-board lidar sensor extrinsic calibrated. Spot out-of- section are designed to show the beneﬁt of GICP from normals the-box implements (kinematic inertial odometry) KIO and over GICP and support the claim that this reformulation yields (visual inertial odometry) V IO, therefore the data records better computation performance without sacriﬁcing accuracy. those readouts as well. Lidar scans are recorded at 10 Hz. For each dataset, we compute statistics over 5 runs. The WIO and IMU are recorded at 50 Hz. To determine the GICP parameters for this experiment are chosen based on ground truth of the robot in the environment, a survey-grade [24]. The parameters for scan-to-scan and scan-to-submap are 6 IEEE ROBOTICS AND AUTOMATION LETTERS. PREPRINT VERSION. ACCEPTED JUNE, 2022 the same: optimization step 1e10, maximum corresponding TABLE II: Relative memory and CPU change. distances for associations 0:3, maximum number of iterations ikd-tree mto 0:001 mto 0:01 mto 0:1 octree 0:001 in optimization 20, rotational ﬁtness score threshold 0:005. Memory 68:09% 38:88% 62:15% 87:76% X Husky computation runs in 4 threads, while Spot uses only 1 CPU 9:36% 50:42% 44:36% 19:61% X thread due to CPU limitations. The octree stores the map with a leaf size 0:001 m. 0:001 m is the baseline used in LOCUS 1.0 to maintain full The Fig. 5.a-e present the comparison results between GICP map information. To assess different parameters mto runs with from normals and GICP across datasets, while Fig. 5.f shows leaf size 0:1 m, 0:01 m, and 0:001 m. the average percentage change across all dataset for each For sliding-window approaches, the map size is 50 m metric with respect to the GICP method. GICP from normals since it is the maximum range of lidars. For scan-to-scan reduces all the computational metrics in LOCUS 2.0: mean and scan-to-submap stage GICP from normals is used with and max CPU usage, mean and max odometry delay, scan- the parameters chosen based on previous experiments. Fig. 9 to-scan, scan-to-submap, lidar callback duration and their presents the maximum memory use for F and I dataset maximum times. The computation burden is, on average, and how memory occupation evolves over time. The largest reduced by 18:57% for all those metrics and datasets. This memory occupancy is for octree and mto version for 0:001 m reduction beneﬁts the odometry update rate since frequency leaf size. The ikd-tree achieves similar performance in terms increases by 11:10% that is beneﬁcial for another part of of memory and CPU usage as the mto with leaf size 0:01m. the system, i.e. for path planning and control algorithms. Tab. II shows how sliding-window map approaches reduce the The lidar callback is generally higher for datasets I and J memory usage while increasing the CPU usage in comparison largely due to the consistently large cross-section of Kentucky to the reference method from LOCUS 1.0. Underground making GICP take longer. One drawback of Fig. 6.b shows the deletion, insertion and searching proce- this method leads to slight increase in the mean and max dure timings for different mapping strategies. Ikd-tree has the APE errors. The reason is that normals are calculated from most time-consuming procedures for searching, deletion, and sparse point clouds, and those normals are stored in the map insertion across all datasets. For insertion, ikd-tree uses on without any further recomputation. In GICP, the covariances average 222% more computation time than the octree data are recalculated from dense map point clouds. The mean structure as new points need to be stored in a particular and max APE error increases across all datasets on average manner. For search, ikd-tree gives on average 140% more 10:82%, but the increase is not for all datasets. Without computation than the octree data structure. including the tunnel dataset (F) the average APE error is only 2) Map size: These experiments show that the size of 5:23%. The rotational APE errors do not change much since the sliding map is an important parameter to consider while max APE decreases 0:94%, while mean APE increases 0:1%. trading off the computational, memory load and accuracy of the result. The sliding-window map allows the system to be 2) Adaptive Voxel Grid Filter: The second experiment bounded by the maximal memory that the robot allocates, presented in this section shows LOCUS 2.0 adaptive behavior. following the paradigms of Real-Time Operating System re- The experiments are run across all datasets with GICP from quirements. Fig. 10 shows the max APE, CPU and memory normals and the same parameters as in Sec. IV-C1 with an metrics for ikd-tree and mto in terms of map size. The smaller ikd-tree data structure for map maintenance with a box size map size gives the robot a lower upper bound for the memory, 50 m, and N ranging from 1000 to 10000. Fig. 6.a desired but on the other hand, instances with larger maps have lower shows how the adaptive voxel grid ﬁlter keeps the number of APE as there is more overlap between scan and map. Other points relatively consistent across a 1 hour dataset, no matter than memory, these larger maps also see larger the mean and what the input N is. desired max CPU load. There is still some variability in the computation time across a dataset, though. Nonetheless, as shown in Fig. 8, the ap- proach produces a consistent average computation time across E. Comparison to the state-of-the-art different environments and sensor conﬁgurations, without any Tab. III shows the comparison study for LOCUS 2.0 large spikes in computation time. This performance gives against the state-of-the-art methods FAST-LIO [14] and LINS more predictable computational loads, regardless of robot or [12] for different environment domains: urban, tunnel, cave environment, as further reinforced in Fig. 7, where the average (A,C,F,H,I,J). The table shows that LOCUS 2.0 presents a callback time and CPU load are similar for all datasets at the state-of-the-art performance in terms of max and mean APE same adaptive voxelization setting. error metrics and achieves the smallest errors in 5 out of 6 presented datasets. In addition, LOCUS 2.0 is the only method D. Memory that does not fail in the tunnel type of the environment (dataset F) where lidar slip occurs. In terms of computation, LOCUS 1) Map maintenance: The third experiment presented in 2.0 achieves equivalent performance to FAST-LIO. Memory this section presents the beneﬁt of sliding-window maps in usage for LOCUS 2.0 is slightly larger, yet this is likely real-time systems compared to classic, static octree-based related to the resolution of the map chosen by default in all structures. In these experiments, LOCUS 2.0 uses: ikd-tree the systems. and multi-threaded octree (mto). The octree with leaf size REINKE et al.: LOCUS 2.0 7 Fig. 5: Results of GICP from normals and GICP comparison in LOCUS 2.0. For the meaning of the labels A-J see Tab. I (a) Husky urban dataset (A). (b) Husky cave dataset (J). Fig. 8: A comparison of the consistency of computation time for adaptive Fig. 6: (a) Number of points after the adaptive voxel grid ﬁlter for different voxelization with 3000 points and constant leaf size 0:25 (our previously set-points (on dataset I). (b) Timeplots for deleting, adding, searching for used static voxel size). a) Urban dataset using 2 lidars. b) Cave dataset using different map storage mechanisms. 3 lidars. TABLE III: Comparison of LOCUS 2.0 to the state-of-the art methods FAST-LIO [14] and LINS [12] for different environment domains. Dataset Algorithms APE CPU [%] max memory max [m] mean [%] max mean [GB] LOCUS 2.0 0.19 0.09 102:38 185:50 1:06 A FAST-LIO 0:79 0:30 89:11 126:40 0.36 LINS 0:43 0:18 40.84 81.50 0:42 LOCUS 2.0 0.16 0.24 114:79 198:00 1:30 C FAST-LIO 2:21 4:22 76:46 307:20 0:99 LINS 0:43 0:60 38.43 75.30 0.47 LOCUS 2.0 0.67 0.45 119:00 229:20 1:98 F FAST-LIO 48555:33 9268:71 156:73 401:30 11:31 Fig. 7: The ﬁgure presents comparison between adaptive voxelization LINS 52:73 23:35 28.10 52.30 0.47 1000 10000 points and constant leaf size 0:25 (our previously used static LOCUS 2.0 0.57 0.23 61:05 169:90 2:42 voxel size). For 0:25 voxel the average number points for each dataset (A, H FAST-LIO 5:92 5:69 75:15 160:80 0:62 C, D, F, G, J) is 8423, 5658, 2967, 2368, 8511, 15901. LINS 12:11 8:05 39.19 97.90 0.61 LOCUS 2.0 1:39 1:95 72.11 141:60 1:01 V. CONCLUSIONS I FAST-LIO 0:99 1:44 117:87 167:80 0.80 LINS 0.86 0.85 75:90 101.40 0:85 This work presents LOCUS 2.0, a robust and computational LOCUS 2.0 2:42 3:88 107:72 185:00 2:13 efﬁcient lidar odometry system for real-time, large-scale explo- J FAST-LIO 1.72 2.60 126:72 332:50 2:54 LINS 3:56 5:79 73.76 176.50 1.85 rations under severe computation and memory constraints suit- able to be deployed over heterogeneous robotic platforms. This the computational load independent on the environment and work reformulates GICP covariance calculations from precom- sensor conﬁguration. Adaptive behavior keeps the number of puted normals that improves the computational performance of points from the lidar consistent while keeping the voxelized GICP. LOCUS 2.0 uses an adaptive voxel grid ﬁlter and makes 8 IEEE ROBOTICS AND AUTOMATION LETTERS. PREPRINT VERSION. ACCEPTED JUNE, 2022 (a) Husky tunnel (F). (b) Spot cave (I). Fig. 10: Results based on size of the map for ikd-tree and mto 0:001 across all datasets (A-J). (c) Memory in time for (F). (d) Memory in time for (I). [9] S. Zhao, H. Zhang, P. Wang, L. Nogueira, and S. Scherer, “Super odometry: Imu-centric lidar-visual-inertial estimator for challenging Fig. 9: The plots show the metrics for different datasets relative to the environments,” arXiv preprint arXiv:2104.14938, 2021. number of ﬁxed voxelized points. [10] T. Shan, B. Englot, D. Meyers, W. Wang, C. Ratti, and D. Rus, “Lio-sam: structure of the environment, which stabilizes and improves Tightly-coupled lidar inertial odometry via smoothing and mapping,” in 2020 IEEE/RSJ International Conference on Intelligent Robots and the computational load. We evaluate two sliding map strategies Systems (IROS). IEEE, 2020, pp. 5135–5142. for reducing memory use: multi-threaded octree and ikd-tree, [11] K. Li, M. Li, and U. D. Hanebeck, “Towards high-performance solid- and show both their computational cost, and improvement in state-lidar-inertial odometry and mapping,” IEEE Robotics and Automa- tion Letters, vol. 6, no. 3, pp. 5167–5174, 2021. memory usage. We open-source both LOCUS 2.0, and our [12] C. Qin, H. Ye, C. E. Pranata, J. Han, S. Zhang, and M. Liu, “Lins: A dataset for challenging and large-scale underground environ- lidar-inertial state estimator for robust and efﬁcient navigation,” in 2020 ments that features various real-world conditions such as fog, IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2020, pp. 8899–8906. dust, darkness, and geometrically degenerate environments [13] J.-C. Yang, C.-J. Lin, B.-Y. You, Y.-L. Yan, and T.-H. Cheng, “Rtlio: that restrict mobility. Overall the datasets include 11 h of Real-time lidar-inertial odometry and mapping for uavs,” Sensors, operations and 16 km distance traveled. vol. 21, no. 12, p. 3955, 2021. [14] W. Xu, Y. Cai, D. He, J. Lin, and F. Zhang, “Fast-lio2: Fast direct REFERENCES lidar-inertial odometry,” arXiv preprint arXiv:2107.06829, 2021. [1] M. Palieri, B. Morrell, A. Thakur, K. Ebadi, J. Nash, A. Chatterjee, [15] K. Chen, B. T. Lopez, A.-a. Agha-mohammadi, and A. Mehta, “Direct C. Kanellakis, L. Carlone, C. Guaragnella, and A.-a. Agha-Mohammadi, lidar odometry: Fast localization with dense point clouds,” arXiv preprint “Locus: A multi-sensor lidar-centric solution for high-precision odome- arXiv:2110.00605, 2021. try and 3d mapping in real-time,” IEEE Robotics and Automation Letters, [16] J. Lin, C. Zheng, W. Xu, and F. Zhang, “R2live: A robust, real-time, vol. 6, no. 2, pp. 421–428, 2020. lidar-inertial-visual tightly-coupled state estimator and mapping,” arXiv [2] Y. Cai, W. Xu, and F. Zhang, “ikd-tree: An incremental kd tree for preprint arXiv:2102.12400, 2021. robotic applications,” arXiv preprint arXiv:2102.10808, 2021. [17] A. Segal, D. Haehnel, and S. Thrun, “Generalized-icp,” in In Robotics: [3] S.-J. Han, J. Kang, K.-W. Min, and J. Choi, “Dilo: Direct light detection science and systems, vol. 2, no. 4, 2009, p. p. 435. and ranging odometry based on spherical range images for autonomous [18] R. B. Rusu and S. Cousins, “3d is here: Point cloud library (pcl),” in driving,” ETRI Journal, vol. 43, no. 4, pp. 603–616, 2021. ICRA. IEEE, 2011, pp. 1–4. [4] Z. Liu and F. Zhang, “BALM: Bundle adjustment for lidar mapping,” [19] K. Koide, M. Yokozuka, S. Oishi, and A. Banno, “Voxelized gicp for IEEE RAL, vol. 6, no. 2, pp. 3184–3191, 2021. fast and accurate 3d point cloud registration,” EasyChair Preprint no. [5] J. Zhang and S. Singh, “Loam: Lidar odometry and mapping in real- 2703, EasyChair, 2020. time.” in Robotics: Science and Systems, vol. 2, 2014, p. 9. [20] G. Strang, Introduction to Linear Algebra, 4th ed. Wellesley, MA: [6] A. Tagliabue, J. Tordesillas, X. Cai, A. Santamaria-Navarro, J. P. Wellesley-Cambridge Press, 2009. How, L. Carlone, and A.-a. Agha-mohammadi, “Lion: Lidar-inertial [21] Y. Cai, W. Xu, and F. Zhang, “ikd-tree: An incremental k-d tree for observability-aware navigator for vision-denied environments,” arXiv robotic applications,” 02 2021. preprint arXiv:2102.03443, 2021. [22] A. Agha et al., “Nebula: Quest for robotic autonomy in challenging [7] K. Ebadi, M. Palieri, S. Wood, C. Padgett, and A. akbar Agha- environments; team costar at the darpa subterranean challenge,” ArXiv, mohammadi, “DARE-SLAM: Degeneracy-aware and resilient loop vol. abs/2103.11470, 2021. closing in perceptually-degraded environments,” Journal of Intelligent [23] “psutil,” https://psutil.readthedocs.io/en/latest/. Robotic Systems, vol. 102, no. 1, pp. 1–25, 2021. [24] A. Reinke, “Advanced lidar odometry for exploration of unknown [8] T. Shan, B. Englot, C. Ratti, and D. Rus, “Lvi-sam: Tightly-coupled underground environments,” Master’s thesis, University of Bonn, 2022. lidar-visual-inertial odometry via smoothing and mapping,” arXiv preprint arXiv:2104.10831, 2021.

http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png

Computing Research Repository arXiv (Cornell University)

http://www.deepdyve.com/lp/arxiv-cornell-university/locus-2-0-robust-and-computationally-efficient-lidar-odometry-for-real-t0Na5tgsVL

LOCUS 2.0: Robust and Computationally Efficient Lidar Odometry for Real-Time Underground 3D Mapping

Loading next page...

References

References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.

eISSN: ARCH-3344
DOI: 10.1109/lra.2022.3181357
Publisher site: See Article on Publisher Site

Abstract

Journal

Computing Research Repository – arXiv (Cornell University)

Published: May 24, 2022

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 7-Day Trial for You or Your Team.

Learn More →

LOCUS 2.0: Robust and Computationally Efficient Lidar Odometry for Real-Time Underground 3D Mapping

LOCUS 2.0: Robust and Computationally Efficient Lidar Odometry for Real-Time Underground 3D Mapping

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 7-Day Trial for You or Your Team.

Learn More →

LOCUS 2.0: Robust and Computationally Efficient Lidar Odometry for Real-Time Underground 3D Mapping

LOCUS 2.0: Robust and Computationally Efficient Lidar Odometry for Real-Time Underground 3D Mapping

References

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies