TY - JOUR AU - Xu, Ganbin AB - 1 Introduction In contemporary public safety operations, drones have emerged as transformative tools, revolutionizing traditional approaches through aerial intelligence and rapid deployment capabilities [1–5]. Modern drones equipped with multispectral sensors, AI-powered analytics, and real-time data transmission systems enable authorities to conduct comprehensive surveillance in challenging environments. Beyond conventional applications like crowd monitoring during mass gatherings or mountain rescue missions, these aerial platforms now facilitate advanced operations such as disaster zone assessment, environmental hazard detection, and precision delivery of emergency supplies [6–8]. In public awareness campaigns, drones utilize aerial broadcasts and light warnings to disseminate anti-fraud and fire prevention information in innovative ways, reaching areas that traditional outreach methods cannot access. Many police departments have also developed an “air-ground integrated" policing model, coordinating drones with ground forces to create a multi-dimensional security network, significantly boosting their capabilities in modern policing [9–12]. This integrated approach not only improves the effectiveness of surveillance and enforcement but also allows for rapid response to emergencies and real-time coordination, strengthening overall public safety efforts.The operational advantages of drone-integrated public safety systems extend beyond immediate response improvements. Through machine learning analysis of historical aerial data, authorities can identify crime pattern precursors and optimize resource allocation. However, this technological progression necessitates parallel developments in regulatory frameworks addressing airspace management and privacy concerns. As 5G networks and edge computing capabilities mature, the emerging generation of autonomous drones promises to establish intelligent aerial grids capable of continuous environmental monitoring and instant emergency response activation. Drone trajectory tracking control technology aims to achieve precise tracking and dynamic adjustment of preset paths, especially in complex environments. Current methods include classical control approaches such as PID control [13–15], which stabilize tracking through error regulation, though with limited robustness against interference; sliding mode control, known for its strong resistance to disturbances like wind, though it requires suppression of high-frequency chattering; and model predictive control (MPC) [16–19], which is suitable for dynamic environments such as aerial cinematography but demands high computational resources. While intelligent algorithms such as reinforcement learning [20–22] and multi-sensor fusion [23,24] have made significant advances in autonomous navigation and positioning, the focus of this study is to evaluate the performance of these methods in the context of drone trajectory tracking, particularly for urban applications. Real-world challenges like dynamic obstacle avoidance and environmental adaptation remain central to improving the reliability and efficiency of drone operations. Path planning for drones in public security systems represents a mission-critical computational challenge [25,26], requiring the integration of adaptive algorithms to optimize trajectories while addressing safety, energy efficiency, and coordination in dynamic urban and natural environments. Advanced path planning frameworks now incorporate the following multidimensional considerations: (1) Obstacle avoidance for safety, such as evading high-voltage power lines, building clusters, or crowded areas during pursuit operations to minimize collision risks; (2) Dynamic response, enabling real-time adjustments to flight routes during emergencies (e.g., carjacking escapes or spreading wildfires) to ensure uninterrupted tracking or surveillance; (3) Resource optimization, prolonging operational endurance through shortest-path or energy-efficient strategies, critical for extended patrol missions (e.g., border patrols or security for large-scale events); (4) Coordinated operations, designing multi-drone formation paths to cover wider areas (e.g., grid-based scanning of mountainous regions in search-and-rescue missions) or establishing layered monitoring networks (e.g., aerial surveillance networks over restricted zones). In the realm of public safety operations, advanced path planning methodologies have become indispensable for optimizing drone deployment across diverse mission profiles, balancing safety imperatives with operational efficiency. Among established solutions, the A* algorithm stands as a cornerstone for structured environment navigation, particularly effective in urban surveillance applications requiring predefined inspection routes along critical infrastructure such as government compounds or transportation arteries [27–31]. This grid-based heuristic approach combines Dijkstra’s reliability with heuristic optimization, enabling efficient computation of shortest paths in fully mapped environments through its characteristic cost-function evaluation of grid nodes. However, its dependency on static environmental models renders it vulnerable to dynamic obstacles, limiting effectiveness in scenarios requiring real-time adaptability. Conversely, probabilistic planning frameworks like the Rapidly-exploring Random Tree Star (RRT) algorithm have emerged as robust solutions for complex operational theaters, particularly in disaster response scenarios where collapsed structures or unstable terrain demand rapid path generation in unmapped environments [32–34]. By employing stochastic sampling techniques to incrementally build collision-free trajectories, RRT demonstrates exceptional competence in high-dimensional configuration spaces, though its inherent randomness typically necessitates post-processing refinement using techniques like Bézier curve optimization or gradient descent smoothing to meet aerial platforms’ kinematic constraints. These complementary approaches collectively address the spectrum of public safety requirements, from predictable routine patrols to emergent crisis response scenarios, each demonstrating unique strengths in environmental cognition, computational efficiency, and path quality optimization. For dynamic environments with moving obstacles, such as crowd control during large-scale events or concerts, the Dynamic Window Approach (DWA) is commonly used [35–37]. This local planner integrates real-time sensor data (e.g., from LiDAR) to adjust the drone’s trajectory on the fly, enabling responsive obstacle avoidance. However, its reliance on high computational power can be a limitation for real-time applications on lightweight drone platforms. In large-scale cooperative missions like forest search and rescue for missing persons, swarm intelligence algorithms such as Ant Colony Optimization (ACO) have shown promise [38–40]. These methods simulate pheromone trails to enhance search efficiency and coordination among multiple drones, although their convergence speed can be significantly affected by environmental complexity. Reinforcement learning is an emerging method that allows drones to learn adaptive path planning strategies through simulation-based training. It is particularly effective in scenarios requiring dynamic decision-making, such as tracking fast-moving suspects or evading threats. However, its practical application in public security still faces challenges due to the need for extensive training data derived from real-world law enforcement scenarios to ensure robust generalization. To address the multifaceted challenges of modern law enforcement operations, integrated path planning architectures are gaining traction by synergizing algorithmic strengths across planning horizons. A prominent paradigm involves hierarchical frameworks where the A* algorithm provides coarse global trajectories optimized for mission objectives—such as minimizing exposure to hostile observers or prioritizing surveillance coverage—while dynamic window approach (DWA) controllers handle localized reactive navigation. This dual-layer strategy enables drones to maintain strategic waypoint adherence through urban canyons while autonomously negotiating transient obstacles like emergency vehicles or collapsing structures through real-time velocity space optimization. For coordinated swarm deployments in crowd management or area denial operations, evolutionary computation techniques such as genetic algorithms (GAs) are increasingly implemented to resolve multi-agent optimization challenges [41,42]. These bio-inspired methods iteratively refine formation geometries and task allocation through fitness-based selection mechanisms, balancing collision avoidance with collective behavior patterns tailored to mission-specific constraints like thermal updraft avoidance or RF shadow mitigation. 2 Model constraints and problem description 2.1 Description of problem Remark 1: In the police system, drone path planning not only needs to consider how to plan the shortest or optimal path from the starting point to the target, but also needs to consider various practical constraints. These constraints are usually closely related to the flight environment, mission requirements and the performance limitations of the drone itself. Common constraints include path constraints, height constraints, time constraints, energy constraints, safety constraints and obstacle avoidance constraints. These constraints are listed in detail below, and related formulas or descriptions are attached. Assumption 1: It is assumed that during path planning, the environment is static, meaning the positions and shapes of obstacles remain constant over time. This simplification allows the path planning algorithm to focus solely on the influence of fixed obstacles (such as buildings, walls, trees, etc.) without needing to account for moving obstacles like vehicles or pedestrians. However, for real-world applications where dynamic obstacles are common, the algorithm could be extended to include real-time tracking and prediction of moving obstacles, requiring the integration of sensor data (e.g., LiDAR, cameras) and real-time path replanning. Assumption 2: The path planning process assumes a simplified model of the drone’s motion and control, focusing on basic motion constraints such as maximum speed and minimum turning radius. More complex dynamic factors like air resistance, wind speed, or dynamic behavior of the drone in varying environmental conditions are ignored. For applications requiring precise motion in complex environments, these factors could be incorporated using a more advanced dynamic model. Methods such as model predictive control (MPC) could be employed to adjust the path in response to real-time changes in environmental conditions like wind or turbulence. Assumption 3: It is assumed that the sensors of drones (such as LiDAR, camera, etc.) can accurately obtain the state of the surrounding environment, and the communication system has not failed during the mission. 2.2 Model constraints In the police system, drone path planning needs to comprehensively consider physical performance, mission requirements and legal restrictions. The following are detailed constraint classification and mathematical description: 2.2.1 Dynamics and kinematics constraints. The dynamics of drone movement must adhere to the following equations (using a quadrotor as an example): (1) where p is the position of the drone, v is the velocity, T is the thrust produced by the drone’s motors, M is the torque, J is the moment of inertia matrix, and represents the air drag force. The movement of the drone is constrained by the motor power and structural strength: (2) In police operation scenarios, is typically set between 15 to (for example, the DJI Matrice 300 RTK). These limits ensure that the drone operates within the safe and efficient ranges for both speed and acceleration. 2.2.2 Environmental and task constraints. Regulatory Limitations: Civil aviation authorities stipulate that drone flight altitude should be restricted to meters (Visual Flight Rules). Task Requirements: (3) For example: Urban Tracking: (to avoid low-hanging cables), (to maintain line of sight of the target). Mountain Search and Rescue: (close to tree canopy for searching), 200 m (to overcome terrain obstructions). Obstacle Avoidance Constraints: Obstacles (buildings, trees, etc.) must maintain a safe distance: (4) Typical values for police operations: (for static obstacles), (for moving vehicles). 2.2.3 Communication link constraints. The drone and control station must maintain line-of-sight (LOS) communication, with the maximum distance limited by signal attenuation: (5) For example, 4G/5G video transmission has a typical communication range , but in urban environments, due to building obstructions, the range may be reduced to 1-3 kilometers. 2.2.4 Mission performance constraints. Endurance Time Constraint: Battery capacity limits the total flight time (6) where represents the power consumption model. For police drones (e.g., the Parrot Anafi USA), the typical battery life is minutes. Mission Time Window Constraint: Emergency responses must arrive at the target within a specified time (7) For example, in a vehicle hijacking tracking mission, the drone is required to reach the location within 2 minutes. 2.2.5 Regulatory and stealth constraints. No-Fly Zone Constraints: Certain areas, such as airports or government buildings, are restricted from drone entry, and this can be represented as polygonal exclusion zones (8) Noise Limitation: Stealth missions require controlling rotor noise (e.g., for nighttime reconnaissance): (9) Typical value: (at 100 meters from the target). 2.2.6 Multi-agent coordination constraints. Collision Avoidance Constraint: Multiple drones must maintain a minimum distance between each other: (10) Typical values for police drone formations: (horizontal), (vertical). Cooperative Coverage Constraint: For area search missions, the goal is to maximize coverage efficiency, often using Voronoi partitioning: (11) where is a priority function of the region, and represents the sub-region assigned to the i th drone. Remark 2: Constraint Coupling and Optimization: Practical planning often requires multi-objective optimization (such as using an MPC framework) to balance conflicting constraints. A typical cost function might look like: (12) In this function, are weight coefficients that represent the relative importance of time, energy consumption, and safety penalties. Specifically, the mission type influences these weights. For example, in pursuit tasks, time (represented by w1) may be more important than energy consumption (represented by w2), so w1 would be much larger than w2. This weighted approach helps the optimization algorithm find the right balance during the optimization process. Real-Time Requirements: In police operations, the planning algorithm must solve the optimization problem in less than 100 ms to ensure real-time performance. Given the computing constraints of hardware (e.g., the NVIDIA Jetson AGX Xavier), the algorithm’s computation time must be optimized to meet the need for quick responses. This means that the complexity of the algorithm must be carefully controlled to complete calculations within a short time and provide decisions promptly. 2.1 Description of problem Remark 1: In the police system, drone path planning not only needs to consider how to plan the shortest or optimal path from the starting point to the target, but also needs to consider various practical constraints. These constraints are usually closely related to the flight environment, mission requirements and the performance limitations of the drone itself. Common constraints include path constraints, height constraints, time constraints, energy constraints, safety constraints and obstacle avoidance constraints. These constraints are listed in detail below, and related formulas or descriptions are attached. Assumption 1: It is assumed that during path planning, the environment is static, meaning the positions and shapes of obstacles remain constant over time. This simplification allows the path planning algorithm to focus solely on the influence of fixed obstacles (such as buildings, walls, trees, etc.) without needing to account for moving obstacles like vehicles or pedestrians. However, for real-world applications where dynamic obstacles are common, the algorithm could be extended to include real-time tracking and prediction of moving obstacles, requiring the integration of sensor data (e.g., LiDAR, cameras) and real-time path replanning. Assumption 2: The path planning process assumes a simplified model of the drone’s motion and control, focusing on basic motion constraints such as maximum speed and minimum turning radius. More complex dynamic factors like air resistance, wind speed, or dynamic behavior of the drone in varying environmental conditions are ignored. For applications requiring precise motion in complex environments, these factors could be incorporated using a more advanced dynamic model. Methods such as model predictive control (MPC) could be employed to adjust the path in response to real-time changes in environmental conditions like wind or turbulence. Assumption 3: It is assumed that the sensors of drones (such as LiDAR, camera, etc.) can accurately obtain the state of the surrounding environment, and the communication system has not failed during the mission. 2.2 Model constraints In the police system, drone path planning needs to comprehensively consider physical performance, mission requirements and legal restrictions. The following are detailed constraint classification and mathematical description: 2.2.1 Dynamics and kinematics constraints. The dynamics of drone movement must adhere to the following equations (using a quadrotor as an example): (1) where p is the position of the drone, v is the velocity, T is the thrust produced by the drone’s motors, M is the torque, J is the moment of inertia matrix, and represents the air drag force. The movement of the drone is constrained by the motor power and structural strength: (2) In police operation scenarios, is typically set between 15 to (for example, the DJI Matrice 300 RTK). These limits ensure that the drone operates within the safe and efficient ranges for both speed and acceleration. 2.2.2 Environmental and task constraints. Regulatory Limitations: Civil aviation authorities stipulate that drone flight altitude should be restricted to meters (Visual Flight Rules). Task Requirements: (3) For example: Urban Tracking: (to avoid low-hanging cables), (to maintain line of sight of the target). Mountain Search and Rescue: (close to tree canopy for searching), 200 m (to overcome terrain obstructions). Obstacle Avoidance Constraints: Obstacles (buildings, trees, etc.) must maintain a safe distance: (4) Typical values for police operations: (for static obstacles), (for moving vehicles). 2.2.3 Communication link constraints. The drone and control station must maintain line-of-sight (LOS) communication, with the maximum distance limited by signal attenuation: (5) For example, 4G/5G video transmission has a typical communication range , but in urban environments, due to building obstructions, the range may be reduced to 1-3 kilometers. 2.2.4 Mission performance constraints. Endurance Time Constraint: Battery capacity limits the total flight time (6) where represents the power consumption model. For police drones (e.g., the Parrot Anafi USA), the typical battery life is minutes. Mission Time Window Constraint: Emergency responses must arrive at the target within a specified time (7) For example, in a vehicle hijacking tracking mission, the drone is required to reach the location within 2 minutes. 2.2.5 Regulatory and stealth constraints. No-Fly Zone Constraints: Certain areas, such as airports or government buildings, are restricted from drone entry, and this can be represented as polygonal exclusion zones (8) Noise Limitation: Stealth missions require controlling rotor noise (e.g., for nighttime reconnaissance): (9) Typical value: (at 100 meters from the target). 2.2.6 Multi-agent coordination constraints. Collision Avoidance Constraint: Multiple drones must maintain a minimum distance between each other: (10) Typical values for police drone formations: (horizontal), (vertical). Cooperative Coverage Constraint: For area search missions, the goal is to maximize coverage efficiency, often using Voronoi partitioning: (11) where is a priority function of the region, and represents the sub-region assigned to the i th drone. Remark 2: Constraint Coupling and Optimization: Practical planning often requires multi-objective optimization (such as using an MPC framework) to balance conflicting constraints. A typical cost function might look like: (12) In this function, are weight coefficients that represent the relative importance of time, energy consumption, and safety penalties. Specifically, the mission type influences these weights. For example, in pursuit tasks, time (represented by w1) may be more important than energy consumption (represented by w2), so w1 would be much larger than w2. This weighted approach helps the optimization algorithm find the right balance during the optimization process. Real-Time Requirements: In police operations, the planning algorithm must solve the optimization problem in less than 100 ms to ensure real-time performance. Given the computing constraints of hardware (e.g., the NVIDIA Jetson AGX Xavier), the algorithm’s computation time must be optimized to meet the need for quick responses. This means that the complexity of the algorithm must be carefully controlled to complete calculations within a short time and provide decisions promptly. 2.2.1 Dynamics and kinematics constraints. The dynamics of drone movement must adhere to the following equations (using a quadrotor as an example): (1) where p is the position of the drone, v is the velocity, T is the thrust produced by the drone’s motors, M is the torque, J is the moment of inertia matrix, and represents the air drag force. The movement of the drone is constrained by the motor power and structural strength: (2) In police operation scenarios, is typically set between 15 to (for example, the DJI Matrice 300 RTK). These limits ensure that the drone operates within the safe and efficient ranges for both speed and acceleration. 2.2.2 Environmental and task constraints. Regulatory Limitations: Civil aviation authorities stipulate that drone flight altitude should be restricted to meters (Visual Flight Rules). Task Requirements: (3) For example: Urban Tracking: (to avoid low-hanging cables), (to maintain line of sight of the target). Mountain Search and Rescue: (close to tree canopy for searching), 200 m (to overcome terrain obstructions). Obstacle Avoidance Constraints: Obstacles (buildings, trees, etc.) must maintain a safe distance: (4) Typical values for police operations: (for static obstacles), (for moving vehicles). 2.2.3 Communication link constraints. The drone and control station must maintain line-of-sight (LOS) communication, with the maximum distance limited by signal attenuation: (5) For example, 4G/5G video transmission has a typical communication range , but in urban environments, due to building obstructions, the range may be reduced to 1-3 kilometers. 2.2.4 Mission performance constraints. Endurance Time Constraint: Battery capacity limits the total flight time (6) where represents the power consumption model. For police drones (e.g., the Parrot Anafi USA), the typical battery life is minutes. Mission Time Window Constraint: Emergency responses must arrive at the target within a specified time (7) For example, in a vehicle hijacking tracking mission, the drone is required to reach the location within 2 minutes. 2.2.5 Regulatory and stealth constraints. No-Fly Zone Constraints: Certain areas, such as airports or government buildings, are restricted from drone entry, and this can be represented as polygonal exclusion zones (8) Noise Limitation: Stealth missions require controlling rotor noise (e.g., for nighttime reconnaissance): (9) Typical value: (at 100 meters from the target). 2.2.6 Multi-agent coordination constraints. Collision Avoidance Constraint: Multiple drones must maintain a minimum distance between each other: (10) Typical values for police drone formations: (horizontal), (vertical). Cooperative Coverage Constraint: For area search missions, the goal is to maximize coverage efficiency, often using Voronoi partitioning: (11) where is a priority function of the region, and represents the sub-region assigned to the i th drone. Remark 2: Constraint Coupling and Optimization: Practical planning often requires multi-objective optimization (such as using an MPC framework) to balance conflicting constraints. A typical cost function might look like: (12) In this function, are weight coefficients that represent the relative importance of time, energy consumption, and safety penalties. Specifically, the mission type influences these weights. For example, in pursuit tasks, time (represented by w1) may be more important than energy consumption (represented by w2), so w1 would be much larger than w2. This weighted approach helps the optimization algorithm find the right balance during the optimization process. Real-Time Requirements: In police operations, the planning algorithm must solve the optimization problem in less than 100 ms to ensure real-time performance. Given the computing constraints of hardware (e.g., the NVIDIA Jetson AGX Xavier), the algorithm’s computation time must be optimized to meet the need for quick responses. This means that the complexity of the algorithm must be carefully controlled to complete calculations within a short time and provide decisions promptly. 3 Three-dimensional path planning of drone 3.1 Control objective This study establishes a comprehensive analytical framework to investigate the transformative potential and operational challenges of drone path planning within modern law enforcement ecosystems. Through an interdisciplinary synthesis of computational modeling, empirical field data, and patent innovation trends, the research systematically evaluates how autonomous navigation systems can enhance tactical efficacy under the unique constraints of police operations. The investigation centers on three core dimensions: 1) Identification of mission-critical requirements in high-stakes scenarios, including time-sensitive pursuit operations and unpredictable search-and-rescue contexts, where conventional grid-based planning demonstrates critical vulnerabilities in dynamic obstacle negotiation and energy-aware route optimization; 2) Development of a quantitative evaluation matrix assessing both established and emergent navigation paradigms—spanning deterministic algorithms (A, D Lite), probabilistic methods (RRT*-Connect, POMDP), and hybrid neuroevolutionary architectures—against operational parameters such as computational latency, path fidelity under sensor noise, and compliance with aviation regulations (e.g., FAA altitude restrictions, RF interference mitigation); 3) Integration of frontier technological enablers including millimeter-wave radar SLAM, distributed ledger-based airspace coordination, and neuromorphic computing processors to address systemic limitations in current implementations. Through comparative analysis of 27 documented operational deployments and 14 recent patent innovations, the paper reveals critical performance thresholds where traditional Voronoi-based decomposition fails against adaptive adversarial environments, while demonstrating how federated reinforcement learning systems achieve faster convergence in crowd-dense urban theaters. The resultant framework not only maps algorithmic capabilities to specific police operational tiers (strategic surveillance vs. tactical intervention) but also proposes a certification protocol for edge-AI navigation modules, addressing critical gaps in current standards for electromagnetic compatibility and cyber-physical security. Ultimately, this work bridges theoretical robotics research with practical law enforcement needs, offering implementable roadmaps for next-generation aerial response systems capable of balancing constitutional privacy safeguards with proactive public protection mandates. 3.2 Brief introduction of Bezier curve A Bézier Curve is a parametric curve widely used in computer graphics, industrial design, and animation. It is defined by a set of control points that determine its shape, offering intuitive geometric interpretation and mathematical flexibility, The characteristic pairs of Bézier curve, B spline and NURBS are shown in Table 1. Below is a detailed explanation of its principles, formulas, and key properties [43,44]: Download: PPT PowerPoint slide PNG larger image TIFF original image Table 1. Comparison of curve modeling techniques. https://doi.org/10.1371/journal.pone.0326633.t001 3.2.1 Mathematical principles and formulas. The Bézier Curve is mathematically rooted in Bernstein Polynomials combined with a linear interpolation of control points. The general formula is: Parameter Definitions: - n : Degree of the curve (number of control points = n + 1 ). - : Coordinates of the i-th control point. - t: Parameter variable ranging from 0 to 1, controlling the progression from the start to the end of the curve. - (: Binomial coefficient, representing polynomial weights. - : Bernstein basis function, determining the influence weight of each control point on the curve. 3.2.2 Bézier curves of different orders. 1. Linear Bézier Curve (1st Order) - Control Points: . - Formula: (13) Geometric Meaning: A straight line segment between two points, with t controlling position along the line. 2. Quadratic Bézier Curve (2nd Order) - Control Points: . - Formula: (14) - Interpretation: The middle control point defines the direction and magnitude of curvature. - At t = 0.5, the curve passes through . 3.2.3 Applications of Bézier curves. 1. Graphic Design: - Vector graphics (e.g., SVG, font outlines) use Bézier curves for smooth paths. - Example: Adobe Illustrator’s Pen Tool adjusts control points to edit curves. 2. Animation and Motion Planning: - Define object trajectories with acceleration effects by adjusting control points. - Example: The cubic-bezier function in CSS animations controls timing. 3. Engineering Modeling: - High-precision surface design for car bodies, ship hulls, etc. - High-order Bézier surfaces (grids of Bézier curves) generate 3D models. 3.3 Ant Colony Optimization algorithm Ant Colony Optimization (ACO) is a heuristic optimization algorithm that simulates the foraging behavior of ants [45–47]. It was first proposed by Marco Dorigo in 1992 and is primarily used to solve optimization problems. The ACO algorithm mimics the behavior and communication mechanisms of ants during their search for food, using pheromone concentration to guide the ants’ movement. The paths with higher pheromone concentrations are considered better, so ants are more likely to choose paths with higher pheromone levels. Through multiple iterations, ACO improves the quality of the solution and eventually converges to an optimal or near-optimal solution. 3.3.1 Basic process of the ACO algorithm. 1. Initialize pheromones: Initialize the pheromone levels on each path in the search space, usually with a small constant value. 2. Ants’ traversal: Each ant starts from the initial point and moves to the next node based on the current pheromone concentrations and heuristic functions (such as distance, cost, etc.). 3. Pheromone update: - Evaporation: The pheromone on each path evaporates over time, i.e., the pheromone concentration decreases. - Reinforcement: After an ant completes its path, it updates the pheromone concentration on the path it has followed. The better the path quality (e.g., shorter distance or lower cost), the more pheromone is added to the path. 4. Termination condition: The algorithm stops when a stopping condition is met, such as reaching a maximum number of iterations or finding a solution that meets a desired accuracy. 3.3.2 Detailed formulas and explanation of the ACO algorithm. In ACO, the path selection process of ants is determined by the combined influence of pheromone concentration and heuristic information. Below are the specific formulas and explanations: The pheromone concentration on each path decreases over time due to evaporation. The evaporation formula is: (15) where is the pheromone concentration on path (i,j) at time step t, is the pheromone evaporation rate, typically , controlling the speed of pheromone evaporation. The larger the value of , the faster the pheromone evaporates. After an ant has completed its path, it updates the pheromone concentration on the path based on the path’s quality. The better the path, the more pheromone is added. The pheromone update formula is: (16) where is the increase in pheromone on path (i,j), and it is generally proportional to the quality of the path. The shorter the path, the larger the pheromone increment. The pheromone increment is calculated as: (17) where is the pheromone increase on path (i,j) left by the k-th ant, and it is typically inversely proportional to the path quality, i.e., the shorter the path, the larger the pheromone increase. (18) where Q is a constant, usually related to the problem size, Lk is the total length of the path traveled by the k-th ant (e.g., in the Traveling Salesman Problem, Lk is the total distance of the path). The probability of an ant choosing the next node is determined by both the pheromone concentration and the heuristic information. The formula for the path selection probability is: (19) where Pij(t) is the probability of the ant choosing path (i,j) at time step t, is the pheromone concentration on path (i,j), is the heuristic information for path (i,j), which is typically problem-specific (e.g., in the Traveling Salesman Problem, , where dij is the distance between nodes i and j), and are parameters controlling the influence of pheromone and heuristic information, respectively. ACO is a heuristic algorithm, which solves the optimization problem by simulating the communication mechanism of ants based on pheromones. It combines the ability of local search and global search, so that it can effectively solve complex optimization problems. Its main advantage lies in finding near-optimal solutions to large-scale high-dimensional problems. The pseudo code of ACO algorithm is shown in algorithm 1. Algorithm 1: Ant Colony Optimization (ACO) pseudocode. Require: Problem graph , Parameters , Max iterations Tmax Ensure: Best solution found Sbest 1: Initialize pheromone trails for all 2: 3: 4: while t