A Distributed Multi-UGV Exploration Framework With Loop-Aware Planning and Descriptor-Aided Localization in Resource-Limited Environments

TL;DR

This paper introduces a fully distributed multi-UGV exploration framework combining descriptor-based loop closure detection and loop-aware hierarchical planning, achieving AR@1 of 89.9% and reducing exploration time by 15%.

cs.RO 🔴 Advanced 2026-06-10 58 views

Zhiwei Li Haiou Liu Xijun Zhao Ji Li Yingze Wang Boyang Wang

AI Reader Arxiv Page Download PDF

multi-robot systems distributed SLAM loop closure hierarchical planning resource-limited environments

Key Findings

Methodology

The proposed framework integrates a lightweight, spectral-guided LiDAR global descriptor with a prealignment mechanism to enable robust cross-UGV place recognition under large yaw and lateral variations. Verified loop closures are used to maintain globally consistent trajectories through decentralized pose graph optimization (e.g., iSAM2). The system employs an uncertainty-aware loop selection module that scores candidate loops based on pose uncertainty and utility, selecting high-value loops as planning anchors. Hierarchical exploration combines global task allocation via MDVRP and local path refinement through TSP, with loop information actively incorporated into planning. The entire system operates asynchronously, reducing communication overhead while ensuring real-time global consistency across multiple UGVs.

Key Results

The loop closure module achieves an AR@1 of 89.9% and AR@1% of 95.5%, outperforming baseline descriptors like LiDAR-Iris, Scan Context++, and OverlapTransformer, especially in challenging large-yaw and lateral shift scenarios.
Distributed optimization reduces absolute trajectory error significantly, with average errors below 0.45 meters across complex environments, outperforming standalone Fast-LIO2 and other SLAM systems.
Communication volume is substantially decreased (~40%), enabling efficient operation under bandwidth constraints. Exploration time is shortened by 15%, and travel distance by 14%, demonstrating high efficiency and robustness.

Significance

This work addresses critical challenges in multi-UGV autonomous exploration in GPS-denied, resource-constrained environments. By combining robust descriptor-based loop detection with uncertainty-aware planning, it enhances localization robustness, reduces redundant coverage, and improves global map consistency. The framework’s decentralized nature and active loop utilization mark a significant step forward, enabling scalable, reliable multi-robot operations in complex scenarios such as disaster response, underground inspection, and planetary exploration. It bridges the gap between theoretical SLAM advances and practical deployment needs, offering a comprehensive solution that balances accuracy, efficiency, and communication economy.

Technical Contribution

The paper’s main contributions include: (1) a spectral-guided range image prealignment technique that improves cross-UGV place recognition robustness; (2) an uncertainty-aware loop selection mechanism that actively filters high-utility loops for global and local planning; (3) a hierarchical exploration strategy integrating global task allocation with loop-informed local path planning; (4) an asynchronous distributed optimization framework ensuring real-time global consistency with minimal communication load. These innovations collectively advance the state-of-the-art in decentralized multi-robot SLAM and exploration, enabling more robust and efficient autonomous operations in resource-limited environments.

Novelty

This work is the first to explicitly incorporate a spectral-guided prealignment mechanism into a distributed multi-UGV SLAM system, coupled with an active, uncertainty-based loop selection strategy that directly influences hierarchical planning. Unlike prior approaches that treat loop closures as passive constraints, this framework actively scores and integrates loop information as planning anchors, significantly improving robustness and efficiency. Its decentralized architecture and lightweight communication protocol make it particularly suitable for large-scale, bandwidth-limited scenarios, filling a notable gap in the current research landscape.

Limitations

The system relies heavily on high-quality LiDAR sensors and spectral features; in environments with severe dynamic clutter or sensor degradation, performance may decline.
The computational load of descriptor extraction, loop verification, and distributed optimization, while optimized, still poses challenges for extremely large environments or real-time constraints in highly dynamic scenes.
Robustness in highly dynamic or cluttered environments with frequent occlusions remains to be further validated, and the current framework may need adaptation for such scenarios.

Future Work

Future research will explore integrating multi-modal sensors, such as visual cameras and radar, to enhance robustness in diverse environments. Learning-based methods for adaptive loop screening and path planning will be investigated to further improve efficiency. Scaling the framework for larger teams of UGVs and extending it to dynamic, highly cluttered environments are key directions. Additionally, optimizing communication protocols and computational efficiency will be prioritized to support deployment in real-world, bandwidth-limited scenarios.

AI Executive Summary

In recent years, autonomous multi-robot exploration has emerged as a promising solution for mapping and navigating complex, GPS-denied environments such as disaster zones, underground tunnels, and extraterrestrial terrains. Traditional approaches often rely on centralized architectures or dense map sharing, which become impractical in resource-limited settings due to bandwidth constraints and the need for real-time operation. This paper addresses these challenges by proposing a fully distributed multi-UGV exploration framework that combines robust descriptor-based loop closure detection with an active, loop-aware hierarchical planning strategy.

The core innovation lies in developing a spectral-guided LiDAR global descriptor that achieves high invariance to large yaw and lateral shifts, enabling reliable cross-UGV place recognition even under severe viewpoint variations. This descriptor is used to identify loop closures, which are then verified via Fast-GICP and incorporated into a decentralized pose graph optimized asynchronously using iSAM2. The system maintains a sparse topological map built from obstacle-aware voxel graphs, reducing communication overhead while preserving environmental structure.

A key feature of the framework is the uncertainty-aware loop selection module. It scores candidate loops based on pose uncertainty and utility metrics, selecting only the most informative loops as planning anchors. These anchors are actively integrated into a hierarchical exploration process that combines global task allocation via a multi-depot vehicle routing problem (MDVRP) with local path refinement through a Traveling Salesman Problem (TSP). This integration ensures efficient coverage, workload balancing, and enhanced localization robustness.

Extensive experiments in simulation and real-world UGV platforms demonstrate the system’s effectiveness. The approach achieves an AR@1 of 89.9%, surpassing baseline descriptors, and reduces exploration time and travel distance by 15% and 14%, respectively. The communication load is also significantly decreased, making it suitable for bandwidth-limited environments. These results validate the framework’s potential for real-world deployment in scenarios demanding high robustness, efficiency, and scalability.

Overall, this work marks a substantial advancement in decentralized multi-robot SLAM and exploration, providing a practical, scalable solution that enhances the autonomy and reliability of UGV teams operating in challenging environments. Its active use of loop information as planning anchors, combined with lightweight communication and distributed optimization, opens new avenues for large-scale autonomous exploration in resource-constrained settings.

Deep Analysis

Background

The evolution of autonomous exploration has transitioned from single-robot SLAM systems like ORB-SLAM and RTAB-Map to multi-robot frameworks aiming for scalability and robustness. Early multi-UGV systems relied on centralized architectures, which faced bottlenecks in communication and computation, especially in GPS-denied environments. Distributed SLAM approaches, such as iSAM2 and decentralized pose graph optimization, addressed some scalability issues but struggled with reliable loop closure detection under viewpoint variations. Traditional descriptors like Scan Context and LiDAR-Iris provided some invariance but lacked robustness in large yaw shifts or dynamic scenes. Recent deep learning-based descriptors, such as OverlapTransformer, improved recognition but at higher computational costs. Path planning strategies like frontier-based exploration and hierarchical task allocation have been developed to improve coverage efficiency, yet they often treat loop closures passively, missing opportunities for active refinement. Overall, the field has made significant progress but still faces challenges in achieving robust, real-time, resource-efficient multi-UGV exploration in complex, unknown environments.

Core Problem

The core challenge in multi-UGV exploration within resource-limited, GPS-denied environments is maintaining accurate, globally consistent localization without relying on centralized infrastructure. This involves overcoming pose drift caused by environmental ambiguities, viewpoint changes, and limited communication bandwidth. Traditional loop closure detection methods often fail under severe yaw and lateral shifts, leading to map misalignments and redundant coverage. Additionally, existing path planning approaches lack active utilization of loop information, resulting in inefficiencies and increased exploration time. The problem becomes more complex when considering the need for decentralized operation, real-time performance, and minimal communication overhead, all while ensuring environmental coverage and localization robustness. Addressing these issues requires innovative descriptors, active loop screening, and integrated hierarchical planning strategies.

Innovation

This work introduces several key innovations:

1) Spectral-guided range image prealignment: By leveraging spectral features and gradient saliency, the system achieves viewpoint-invariant descriptors that are robust to large yaw and lateral shifts, enabling reliable cross-UGV place recognition.

2) Uncertainty-aware loop selection: The framework scores candidate loops based on pose uncertainty and utility metrics derived from covariance propagation, actively selecting high-value loops as planning anchors.

3) Loop-informed hierarchical planning: Integrates loop information into global task allocation via MDVRP and local path refinement through TSP, ensuring efficient coverage and robust localization.

4) Asynchronous distributed optimization: Employs decentralized pose graph updates with minimal communication, maintaining global consistency in resource-constrained environments.

These innovations collectively enable a highly robust, efficient, and scalable multi-UGV exploration system suited for complex unknown terrains.

Methodology

�� Sensor input: Each UGV employs a rotating 3D LiDAR (e.g., RoboSense Helios-16) to acquire dense point clouds, which are converted into multi-elevation range images.
�� Descriptor extraction: Spectral cues are computed by summing low-frequency Fourier magnitudes along elevation, combined with median vertical range gradients, to generate saliency maps. The range image is circularly shifted to align with maximum saliency, then passed through a lightweight Swin Transformer-based network to produce a 256-dimensional global descriptor.
�� Cross-UGV loop detection: Descriptors are stored in local KD-trees; upon new keyframes, nearest neighbors are retrieved, gating matches by similarity. Verified matches undergo Fast-GICP alignment, and confirmed loop closures are asynchronously inserted into a decentralized pose graph optimized via iSAM2.
�� Map representation: A sparse obstacle-aware topological graph is constructed within a local planning horizon, with nodes representing traversable voxel centers and edges representing collision-free paths. Subgraph fusion occurs asynchronously through descriptor-based node matching.
�� Hierarchical planning: Frontier voxels are clustered to identify exploration targets. Candidate viewpoints are sampled on a spherical shell, scored by information gain, and assigned via MDVRP. Path refinement employs TSP on the selected viewpoints, incorporating loop closures as waypoints.
�� Loop selection: Candidate loops are scored based on pose covariance and Euclidean distance, with top-K loops chosen as planning anchors, actively influencing path planning and localization.

Experiments

�� Platforms: Experiments utilize a fleet of UGVs equipped with RoboSense Helios-16 LiDAR, IMU, onboard GPU (Nvidia RTX 4080), and onboard computation, both in real-world environments and Gazebo simulation.
�� Datasets: Evaluation uses KITTI and Mulran datasets for loop detection robustness, covering diverse scenarios such as dynamic occlusion, large yaw variation, and reverse revisits.
�� Metrics: Performance is measured via AR@1 and AR@1%, absolute trajectory error (ATE), communication volume, exploration time, and path length.
�� Baselines: Comparisons include Fast-LIO2, DCL-SLAM (LiDAR-Iris), Kimera-Multi, and other recent descriptors.
�� Hyperparameters: Descriptor dimension set to 256, top-K loop candidates for planning, and path optimization via A* and TSP algorithms.
�� Validation: Both simulation and real-world tests confirm the system’s robustness, efficiency, and scalability across environments with different structural complexities.

Results

�� Loop closure detection: Achieves AR@1 of 89.9% and AR@1% of 95.5%, outperforming LiDAR-Iris (AR@1 84.57%) and deep learning descriptors like OverlapTransformer (AR@1 80.67%) especially under large viewpoint shifts.
�� Localization accuracy: Average trajectory error drops below 0.45 meters across complex environments, outperforming standalone Fast-LIO2 and other SLAM systems.
�� Communication efficiency: Total data exchange volume is reduced by approximately 40%, enabling operation under bandwidth constraints.
�� Exploration efficiency: Overall exploration time is shortened by 15%, and total travel distance by 14%, demonstrating the effectiveness of active loop utilization and hierarchical path planning.
�� Scalability: The framework maintains high performance with increasing number of UGVs, validating its suitability for large-scale deployments.

Applications

�� Immediate applications include disaster response in GPS-denied urban or underground environments, underground mining, tunnel inspection, and planetary surface exploration.
�� The system requires high-quality LiDAR sensors, stable wireless communication, and onboard computational resources.
�� Industry impact involves improved autonomous navigation robustness, reduced operational costs, and enhanced scalability for multi-robot deployments in complex terrains.

Limitations & Outlook

�� Dependence on high-performance sensors: In environments with poor sensor data or dynamic clutter, descriptor robustness may decline.
�� Computational load: Descriptor extraction, loop verification, and distributed optimization pose challenges for real-time operation in extremely large or highly dynamic environments.
�� Environmental complexity: Highly dynamic scenes with frequent occlusions or moving obstacles may impair loop detection and map consistency, requiring further robustness enhancements.

Plain Language Accessible to non-experts

Imagine a team of explorers venturing into a vast, unfamiliar maze without any maps or GPS signals. Each explorer carries a special camera that can scan the surroundings and create a unique 'fingerprint' of each place they visit. Every time they pass through a new spot, they take a picture and generate this fingerprint, which helps them recognize if they've been there before.

Sometimes, different parts of the maze look very similar, making it hard to tell if they’re revisiting the same place. To solve this, the explorers compare their fingerprints to see if they match. When they find a match, they can use that information to correct their position and update their map, making their navigation more accurate.

The explorers also decide which places are most important to revisit or use as checkpoints, based on how uncertain their current position is. They plan their routes smartly, balancing between exploring new areas and revisiting known spots to improve accuracy.

All this happens without a central leader—each explorer communicates with nearby teammates, shares updates, and adjusts their paths accordingly. This way, they work together efficiently, even in the most complicated parts of the maze, avoiding getting lost or wasting time.

In essence, this system is like a team of smart, cooperative adventurers who use clever tricks to map and explore a mysterious environment quickly and reliably, even when communication is limited and the environment is unpredictable.

ELI14 Explained like you're 14

Imagine you and your friends are exploring a huge, complicated maze, but you don’t have a map or GPS to tell you where you are. Instead, each of you has a special camera that scans the walls and creates a unique 'fingerprint' of each spot you visit. Every time you pass through a new area, you take a picture and turn it into this fingerprint.

Now, if later you see a place that looks familiar, you can compare your fingerprint to the ones you’ve already made. If they match, you know you’ve been there before! This helps everyone figure out their exact location and avoid walking in circles.

But sometimes, different parts of the maze look very similar, so the system uses smart ways to decide which places are worth revisiting or using as checkpoints. It chooses the most useful spots to go back to, making sure you cover the maze efficiently and don’t waste time.

All of this happens without a boss telling everyone what to do. Instead, each person talks to their neighbors, shares what they’ve found, and adjusts their route on the fly. This teamwork makes exploring faster, more accurate, and less likely to get lost, even in the most confusing parts of the maze.

So, it’s like a group of clever explorers working together, using their special cameras and smart planning to map out a mysterious place quickly and safely, even when they can’t see the whole maze at once.

Abstract

Robust and efficient cooperative exploration with multiple unmanned ground vehicles (UGVs) in unknown, GPSdenied, and bandwidth-limited environments without prior maps remains challenging, as localization drift degrades map consistency and induces redundant coverage. This paper presents a fully distributed exploration framework that couples descriptoraided inter-UGV loop closure with loop-aware hierarchical planning while enabling autonomous localization and exploration. We develop a lightweight LiDAR global descriptor with range-image prealignment to enable robust cross-UGV place recognition under large yaw and lateral variations, and use verified loop closures to maintain globally consistent trajectories and a sparse topological representation. We further introduce an uncertainty-aware crossUGV loop-closure selection module that scores candidate loop closures under pose uncertainty and retains high-utility loop closures as planning anchors for global task allocation and local route refinement. Simulations and real-UGV experiments show that the loop-closure module achieves AR@1/AR@1% of 89.9%/95.5%, distributed optimization reduces absolute trajectory error, the system substantially reduces two-way communication volume, and the overall framework reduces exploration time and travel distance by 15% and 14%, respectively, compared with an mTSP baseline.

cs.RO

References (20)

DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm

Shipeng Zhong, Yuhua Qi, Zhiqiang Chen et al.

2022 89 citations View Analysis →

Swarm-LIO2: Decentralized Efficient LiDAR-Inertial Odometry for Aerial Swarm Systems

Fangcheng Zhu, Yunfan Ren, Longji Yin et al.

2025 25 citations

S3E: A Multi-Robot Multimodal Dataset for Collaborative SLAM

Dapeng Feng, Yuhua Qi, Shipeng Zhong et al.

2022 45 citations View Analysis →

Representation granularity enables time-efficient autonomous exploration in large, complex worlds

Chao Cao, H. Zhu, Z. Ren et al.

2023 87 citations

Distributed mapping with privacy and communication constraints: Lightweight algorithms and object-based models

Siddharth Choudhary, L. Carlone, Carlos Nieto-Granda et al.

2017 146 citations View Analysis →

Voxelized GICP for Fast and Accurate 3D Point Cloud Registration

Kenji Koide, Masashi Yokozuka, Shuji Oishi et al.

2020 374 citations

Are we ready for autonomous driving? The KITTI vision benchmark suite

Andreas Geiger, Philip Lenz, R. Urtasun

2012 15117 citations

A Novel Informative Autonomous Exploration Strategy With Uniform Sampling for Quadrotors

Xuetao Zhang, Yubin Chu, Yisha Liu et al.

2022 41 citations

LiDAR Iris for Loop-Closure Detection

Ying Wang, Zezhou Sun, Jian Yang et al.

2019 232 citations View Analysis →

Fast and Communication-Efficient Multi-UAV Exploration Via Voronoi Partition on Dynamic Topological Graph

Qianli Dong, Haobo Xi, Shiyong Zhang et al.

2024 23 citations View Analysis →

FAST-LIO2: Fast Direct LiDAR-Inertial Odometry

Wei Xu, Yixi Cai, Dongjiao He et al.

2021 1549 citations View Analysis →

Omni-Swarm: A Decentralized Omnidirectional Visual–Inertial–UWB State Estimation System for Aerial Swarms

Hao Xu, Yichen Zhang, Boyu Zhou et al.

2021 168 citations View Analysis →

Multi-Robot Active Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization

Ruofei Bai, Shenghai Yuan, Hongliang Guo et al.

2024 11 citations

A multi-robot cooperative exploration algorithm considering working efficiency and working load

Meng Zhao, Hui Lu, Shi Cheng et al.

2022 9 citations

RACER: Rapid Collaborative Exploration With a Decentralized Multi-UAV System

Boyu Zhou, Hao Xu, S. Shen

2022 232 citations View Analysis →

Autonomous Exploration Method for Fast Unknown Environment Mapping by Using UAV Equipped With Limited FOV Sensor

Yinghao Zhao, Li Yan, Hong Xie et al.

2023 74 citations View Analysis →

TARE: A Hierarchical Framework for Efficiently Exploring Complex 3D Environments

Chao Cao, H. Zhu, H. Choset et al.

2021 216 citations

Dual-Layer Path Planning With Pose SLAM for Autonomous Exploration in GPS-Denied Environments

Shi Zhang, Rongxin Cui, Weisheng Yan et al.

2024 23 citations

RING++: Roto-Translation Invariant Gram for Global Localization on a Sparse Scan Map

Xuecheng Xu, Shan Lu, Jun Wu et al.

2022 96 citations View Analysis →

MR-GMMExplore: Multi-Robot Exploration System in Unknown Environments based on Gaussian Mixture Model

Yichun Wu, Qiuyi Gu, Jincheng Yu et al.

2022 9 citations

A Distributed Multi-UGV Exploration Framework With Loop-Aware Planning and Descriptor-Aided Localization in Resource-Limited Environments

Key Findings

Methodology

Key Results

Significance

Technical Contribution

Novelty

Limitations

Future Work

AI Executive Summary

Deep Analysis

Background

Core Problem

Innovation

Methodology

Experiments

Results

Applications

Limitations & Outlook

Plain Language Accessible to non-experts

ELI14 Explained like you're 14

Abstract

References (20)

Related Papers

Increasing Resilience of Continuum Robots via Motion Planning Algorithms

ARC: Adaptive Robust Joint State and Covariance Estimation

Do as I Do: Dexterous Manipulation Data from Everyday Human Videos

Observability and Consistency Analysis for Visual-Inertial Navigation with Anchored Feature Parameterizations

Visual Verification Enables Inference-time Steering and Autonomous Policy Improvement

R2RDreamer: 3D-aware Data Augmentation for Spatially-generalized 2D Manipulation Policies