Hostname: page-component-68c7f8b79f-pksg9 Total loading time: 0 Render date: 2026-01-16T05:52:52.662Z Has data issue: false hasContentIssue false

A novel path planning algorithm based on synergistic bidirectional optimization

Published online by Cambridge University Press:  02 January 2026

Shixuan Qi
Affiliation:
College of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Haofei Lu
Affiliation:
Portland Institute, Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Ziyi Wang
Affiliation:
Portland Institute, Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Xin Gao
Affiliation:
College of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Siyao Chen
Affiliation:
College of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Yuanjian Liu
Affiliation:
College of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
Weigang Wang*
Affiliation:
College of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu, China
*
Corresponding author: Weigang Wang; Email: wangwg2020@163.com

Abstract

Path planning, as a critical component of mobile robotic systems, significantly impacts operational efficiency and energy consumption ratios. State-of-the-art algorithms often suffer from inadequate real-time adjustment capability, insufficient dynamic environment adaptation, and suboptimal computational efficiency. To resolve these limitations, we propose a bidirectionally optimized path planning algorithm named Bidirectional Q-learning LPA* (BQ-LPA*), which incorporates three key innovations. Specifically, to enhance the global search capability of the LPA* framework, we replace fixed heuristic functions with a Q-learning-driven adaptive heuristic mechanism, which improves path quality through dynamic heuristic weighting and update strategies. Additionally, to improve the convergence rate and sample efficiency of Q-learning in complex environments, we propose integrating the LPA* framework to provide prior knowledge guidance, which can effectively minimize redundant exploration attempts by informed pathfinding initialization. Moreover, the Q-learning method inherently faces dimensionality challenges in high-dimensional continuous spaces, which manifest as action space congestion, storage bottlenecks, and computational inefficiency. To mitigate these risks, we devise an LPA*-based space discretization strategy that can reduce action space dimensionality and preserve the path feasibility. Experimental results show that, compared with mainstream path planning algorithms, BQ-LPA* achieves higher accuracy and faster convergence in mobile robot path planning.

Information

Type
Research Article
Copyright
© The Author(s), 2026. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Zheng, J., Chen, T., He, J., Wang, Z. and Gao, B., “Review on security range perception methods and path-planning techniques for substation mobile robots,” Energies 17(16), 4106 (2024).10.3390/en17164106CrossRefGoogle Scholar
Wang, X., Duan, P., Meng, L. and Yang, K., “An improved iterated greedy algorithm for solving rescue robot path planning problem with limited survival time,” Comput. Mater. Contin. 80(1), (2024).10.32604/cmc.2024.050612CrossRefGoogle Scholar
Dai, Y., Li, S., Chen, X., Nie, X., Rui, X. and Zhang, Q., “Three-dimensional truss path planning of cellular robots based on improved sparrow algorithm,” Robotica 42(2), 347366 (2024).10.1017/S0263574723001480CrossRefGoogle Scholar
Ayawli, B. B. K., Dawson, J. K., Badu, E., Ayawli, I. E. B. and Lamusah, D., “Comparative analysis of popular mobile robot roadmap path-planning methods,” Robotica, 124 (2025).10.1017/S0263574725000281CrossRefGoogle Scholar
Zhu, L., Zhou, G., Zhou, Y., Luo, Q., Huang, H. and Wei, X., “Self-adaptive differential evolution-based coati optimization algorithm for multi-robot path planning,” Robotica 43(3), 959996 (2025).10.1017/S0263574725000049CrossRefGoogle Scholar
Liu, S., Liu, S. and Xiao, H., “Improved gray wolf optimization algorithm integrating a* algorithm for path planning of mobile charging robots,” Robotica 42(2), 536559 (2024).10.1017/S0263574723001625CrossRefGoogle Scholar
Zhang, Q., Ning, X., Li, Y., Pan, L., Gao, R. and Zhang, L., “Path planning of patrol robot based on modified grey wolf optimizer,” Robotica 41(7), 19471975 (2023).10.1017/S0263574723000231CrossRefGoogle Scholar
Luo, Y., Lin, S., Wang, Y. and Liang, K. Path planning for mobile robots in complex environments based on enhanced sparrow search algorithm and dynamic window approach,” Robotica, 124 (2025).10.1017/S026357472500061XCrossRefGoogle Scholar
Qadir, Z., Ullah, F., Munawar, H. S. and Al-Turjman, F., “Addressing disasters in smart cities through uavs path planning and 5g communications: A systematic review,” Comput. Commun. 168, 114135 (2021).10.1016/j.comcom.2021.01.003CrossRefGoogle Scholar
Aradi, S., “Survey of deep reinforcement learning for motion planning of autonomous vehicles,” IEEE Trans. Intell. Transp. 23(2), 740759 (2020).10.1109/TITS.2020.3024655CrossRefGoogle Scholar
Wu, F., Tang, W., Zhou, Y., Lin, S.-W., Ding, Z. and Liu, Y., “Robust motion planning for mobile robots under attacks against obstacle localization,” Robotica 42(8), 27812800 (2024).10.1017/S0263574724001115CrossRefGoogle Scholar
Zhang, H.-y., Lin, W.-m. and Chen, A.-x., “Path planning for the mobile robot: A review,” Symmetry 10(10), 450 (2018).10.3390/sym10100450CrossRefGoogle Scholar
Raja, P. and Pugazhenthi, S., “Optimal path planning of mobile robots: A review,” Int. J. Phys. Sci. 7(9), 13141320 (2012).10.5897/IJPS11.1745CrossRefGoogle Scholar
Sánchez-Ibáñez, J. R., Pulgar, C. J. P.-D. and García-Cerezo, A., “Path planning for autonomous mobile robots: A review,” Sensors 21(23), 7898 (2021).10.3390/s21237898CrossRefGoogle ScholarPubMed
Liao, T., Chen, F., Wu, Y., Zeng, H., Ouyang, S. and Guan, J., “Research on path planning with the integration of adaptive a-star algorithm and improved dynamic window approach,” Electronics 13(2), 455 (2024).10.3390/electronics13020455CrossRefGoogle Scholar
Yu, J., Chen, C., Arab, A., Yi, J., Pei, X. and Guo, X., “Rdt-rrt: Real-time double-tree rapidly-exploring random tree path planning for autonomous vehicles,” Expert Syst. Appl. 240, 122510 (2024).10.1016/j.eswa.2023.122510CrossRefGoogle Scholar
Adam, M. S., Nordin, R., Abdullah, N. F., Abu-Samah, A., Amodu, O. A. and Alsharif, M. H., “Optimizing disaster response through efficient path planning of mobile aerial base station with genetic algorithm,” Drones 8(6), 272 (2024).10.3390/drones8060272CrossRefGoogle Scholar
Hu, L. and Cai, C., “Arime-dwa: Robot path planning utilizing advanced rime optimization and dynamic windows approach,” IEEE Access (2024).10.1109/ACCESS.2024.3514320CrossRefGoogle Scholar
Wang, Y., Lu, C., Wu, P. and Zhang, X., “Path planning for unmanned surface vehicle based on improved q-learning algorithm,” Ocean Eng. 292, 116510 (2024).10.1016/j.oceaneng.2023.116510CrossRefGoogle Scholar
Bo, L., Zhang, T., Zhang, H., Yang, J., Zhang, Z., Zhang, C. and Liu, M., “Improved q-learning algorithm based on flower pollination algorithm and tabulation method for unmanned aerial vehicle path planning,” IEEE Access (2024).10.1109/ACCESS.2024.3434621CrossRefGoogle Scholar
Jones, M. R., Djahel, S. and Welsh, K., “An efficient and rapidly adaptable lightweight multi-destination urban path planning approach for uavs using q-learning,” IEEE Trans. Intell. Veh. (2024).10.1109/TIV.2024.3387018CrossRefGoogle Scholar
Wang, C., Yang, X. and Li, H., “Improved q-learning applied to dynamic obstacle avoidance and path planning,” IEEE Access 10, 9287992888 (2022).10.1109/ACCESS.2022.3203072CrossRefGoogle Scholar
Xiang, Z., Sun, H. and Zhang, J., “Application of improved q-learning algorithm in dynamic path planning for aircraft at airports,” IEEE Access 11, 107892107905 (2023).10.1109/ACCESS.2023.3321196CrossRefGoogle Scholar
Watkins, C. J. and Dayan, P., “Q-learning,” Mach. Learn. 8, 279292 (1992).Google Scholar
Li, Y., Wang, H., Fan, J. and Geng, Y., “A novel q-learning algorithm based on improved whale optimization algorithm for path planning,” PLoS One 17(12), e0279438 (2022).10.1371/journal.pone.0279438CrossRefGoogle ScholarPubMed
Koenig, S., Likhachev, M. and Furcy, D., “Lifelong planning a* ,” Artif. Intell. 155(1-2), 93146 (2004).10.1016/j.artint.2003.12.001CrossRefGoogle Scholar
Khriji, L., Touati, F., Benhmed, K. and Al-Yahmedi, A., “Mobile robot navigation based on q-learning technique,” Int. J. Adv. Robot. Syst. 8(1), 4 (2011).10.5772/10528CrossRefGoogle Scholar
Hidalgo-Paniagua, A., Vega-Rodríguez, M. A., Ferruz, J. and Pavón, N., “Solving the multi-objective path planning problem in mobile robotics with a firefly-based approach,” Soft Comput. 21(4), 949964 (2017).10.1007/s00500-015-1825-zCrossRefGoogle Scholar