RLG | Publications

Publications

Journal Papers

An Improved Stable Fine-Tuning framework for offline-to-online reinforcement learning. Zhao, K., Li, Y., & Lin, K. Computers and Electrical Engineering, 2025.
CBTMP: Optimizing Multi-Agent Path Finding in Heterogeneous Cooperative Environments. Gao, J., Li, Y., Mu, Y., Liu, Q., Chen, H., & Lou, Y. IEEE Robotics and Automation Letters, 2025.
Reinforcement learning-based optimal control for Markov jump systems with completely unknown dynamics. Shi, X., Li, Y., Du, C., Chen, C., Zong, G., & Gui, W. Automatica, 2025.
Filter-Based Fully Distributed Output Regulation of Heterogeneous Learning Agents. Shi, X., Li, Y., Du, C., Chen, C., Hua, C., & Gui, W. IEEE Transactions on Circuits and Systems I: Regular Papers, 2025.
Fully Distributed Event-Triggered Control of Nonlinear Multiagent Systems Under Directed Graphs: A Model-Free DRL Approach. Shi, X., Li, Y., Du, C., Shi, Y., Yang, C., & Gui, W. IEEE Transactions on Automatic Control, 2025.
Distributional Policy Gradient With Distributional Value Function. Liu, Q., Li, Y., Shi, X., Lin, K., Liu, Y. & Lou, Y. IEEE Transactions on Neural Networks and Learning Systems, 2025.
Safe Reinforcement Learning in Autonomous Driving With Epistemic Uncertainty Estimation Zhang, Z., Liu, Q., Li, Y., Lin, K. & Li, L. IEEE Transactions on Intelligent Transportation Systems, 2024.
PCE: Multi-Agent Path Finding via Priority-Aware Communication & Experience Learning. Gao, J., Li, Y., Ye Z. & Wu, X. IEEE Transactions on Intelligent Vehicles, 2024.
Learning Agile Quadrotor Flight in Restricted Environments with Safety Guarantees. Chen, S., Li, Y., Lou, Y., Lin, K. & Wu, X. IEEE Transactions on Intelligent Vehicles, 2024.
A Time-Aggregated Model-Free RL Algorithm for Optimal Containment Control of MASs. Shi, X., Li, Y., Du, C., & Gui, W. IEEE Transactions on Circuits and Systems II: Express Briefs, 2024.
Almost Surely Safe Exploration and Exploitation for Deep Reinforcement Learning with State Safety Estimation. Lin, K., Li, Y., Liu, Q., Li, D., Shi, X., & Chen, S. Information Sciences, 2024.
Data Efficient Deep Reinforcement Learning with Action-ranked Temporal Difference Learning. Liu, Q., Li, Y., Liu, Y. & Lin, K. IEEE Transactions on Emerging Topics in Computational Intelligence, 2024.
Distributional reinforcement learning with epistemic and aleatoric uncertainty estimation. Liu, Q., Li, Y., Chen, S., Lin, K., Shi, X., & Lou, Y. Information Sciences, 2023.
FHCPL: An Intelligent Fixed-Horizon Constrained Policy Learning System for Risk-Sensitive Industrial Scenario. Lin K., Li, D., Li, Y., Chen, S., & Wu, X. IEEE Transactions on Industrial Informatics, 2023.
Optimal Lateral Path-Tracking Control of Vehicles With Partial Unknown Dynamics Via DPG-Based Reinforcement Learning Methods. Shi, X., Li, Y., Hu, W., Du, C., Chen, C., & Gui, W. IEEE Transactions on Intelligent Vehicles, 2023.
A review of graph-based multi-agent pathfinding solvers: From classical to beyond classical. Gao, J., Li, Y., Li, X., Yan, K. and Lin, K., & Wu, X. Knowledge-Based Systems, 2023.
Motion Planner with Fixed-Horizon Constrained Reinforcement Learning for Complex Autonomous Driving Scenarios. Lin, K., Li, Y., Chen, S., Li, D., & Wu, X. IEEE Transactions on Intelligent Vehicles, 2023.
TAG: Teacher-Advice Mechanism With Gaussian Process for Reinforcement Learning. Lin, K., Li, D., Li, Y., Chen, S., Liu, Q., Gao, J., Jin, Y., & Gong, L. IEEE Transactions on Neural Networks and Learning Systems, 2023.
A fully distributed adaptive event-triggered control for output regulation of multi-agent systems with directed network. Shi, X., Li, Y., Liu, Q., Lin, K., & Chen, S. Information Sciences, 2023.
Learning Real-Time Dynamic Responsive Gap-Traversing Policy for Quadrotors with Safety-Aware Exploration. Chen, S., Li, Y., Lou, Y., Lin, K., & Wu, X. IEEE Transactions on Intelligent Vehicles, 2022.
A Two-Objective ILP Model of OP-MATSP for the Multi-Robot Task Assignment in an Intelligent Warehouse. Gao, J., Li, Y., Xu, Y., & Lv, S. Applied Sciences, 2022.
Rotating consensus for double-integrator multi-agent systems with communication delay. Shi, X., Li, Y., Yang, Y., Sun, B., & Li, Y.. ISA Transactions, 2021.
Iterative learning control for a soft exoskeleton with hip and knee joint assistance. Chen, C., Zhang, Y., Li, Y., Wang, Z., Liu, Y., Cao, W. & Wu, X. Sensors, 2020.
Online Extrinsic Parameter Calibration for Robotic Camera–Encoder System. Wang, X., Chen, H., Li, Y., & Huang, H. IEEE Transactions on Industrial Informatics, 2019.
Vision and laser fused SLAM in indoor environments with multi-robot system. Chen, H., Huang, H., Qin, Y., Li, Y., & Liu, Y. Assembly Automation, 2019.
Coupling Based Estimation Approaches for the Average Reward Performance Potential in Markov Chains. Li, Y., Wu, X., Lou, Y., Chen, H., & Li, J. Automatica, 2018.
Motion Tracking of the Carotid Artery Wall From Ultrasound Image Sequences: a Nonlinear State-Space Approach. Gao, Z., Li, Y., Sun, Y., Yang, J., Xiong, H., Zhang, H., ... & Li, S. IEEE Transactions on Medical Imaging, 2018.
Online optimization of dynamic power management. Zhai, J.-F., Li, Y., Chen, & H.-Y. Control Theory and Applications, 2018.
Autonomous wi-fi relay placement with mobile robots. Gao, Y., Chen, H., Li, Y., Lyu, C., & Liu, Y. IEEE/ASME Transactions on Mechatronics, 2017.
A unified approach to time-aggregated Markov decision processes. Li, Y., & Wu, X. Automatica, 2016.
A basic formula for performance gradient estimation of semi-Markov decision processes. Li, Y., & Cao, F. European Journal of Operational Research, 2013.
Nonrigid registration of lung CT images based on tissue features. Zhang, R., Zhou, W., Li, Y., Yu, S. & Xie, Y. Computational and Mathematical Methods in Medicine, 2013.
Finding optimal memoryless policies of POMDPs under the expected average reward criterion. Li, Y., Yin, B., & Xi, H. European Journal of Operational Research, 2011.
On-line policy gradient estimation with multi-step sampling. Li, Y., Cao, F., & Cao, X. Discrete Event Dynamic Systems, 2010.
Partially observable Markov decision processes and performance sensitivity analysis. Li, Y., Yin, B., & Xi, H. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 2008.
Performance optimization of semi-Markov decision processes with discounted-cost criteria. Yin, B., Li, Y., Zhou, Y. & Xi, H. European Journal of Control, 2008.
Sensitivity analysis and estimates of the performance for M/G/1 queueing systems. Yin, B., Dai, G., Li, Y. & Xi, H. Performance Evaluation, 2007.
Performance optimization algorithms based on potentials for semi-Markov control processes. Dai, G., Yin, B., Li, Y. & Xi, H. International Journal of Control, 2005.

Conference Papers

A Fast Planning Algorithm for Humanoids with Supervised Learning and Subproblem Decomposition. Fang, Z., Li, Y., He, Z., & Lin, K. 2024 43rd Chinese Control Conference (CCC), 2024.
A Risk-sensitive Automatic Stock Trading Strategy Based on Deep Reinforcement Learning and Transformer. Li, L., Liu, Q., Li, Y., Mu, Y., & Zhang, Z. 2024 IEEE 20th International Conference on Automation Science and Engineering (CASE), 2024.
Offline Deep Reinforcement Learning Two-stage Optimization Framework Applied to Recommendation Systems. Jiang, Y., Liu, Q., & Li, Y.. 2024 43rd Chinese Control Conference (CCC), 2024.
Path Optimization Problem of Multi-Probe Flying Probe Tester. Lai, H., Li, Y., & Gao, J. 2024 43rd Chinese Control Conference (CCC), 2024.
An Environmental-Complexity-Based Navigation Method Based on Hierarchical Deep Reinforcement Learning. Chen, P., Liu, Q., Li, Y. & Ma, S. IEEE International Conference on Robotics and Automation (ICRA), 2024.
Optimal Containment Control of Nonlinear MASs: A Time-Aggregation-Based Policy Iteration Algorithm. Shi, X., Li, Y., & Du, C. IEEE Conference on Decision and Control (CDC), 2023.
Consensus of the General Second-Order MASs with Nonuniform Uncertain Time-Varying Communication Delays. Shi, X., Li, Y., & Du, C. 2023 China Automation Congress (CAC), 2023.
Quadrotor Control using Reinforcement Learning under Wind Disturbance. Lu, S., Li, Y., & Liu, Z. 2023 35th Chinese Control and Decision Conference (CCDC), 2023.
HGLP: Hierarchical Solver for Combined Task Assignment and Path Finding. Gao, J., Ye, Z., Li, Y., & Li, Y. 2023 35th Chinese Control and Decision Conference (CCDC), 2023.
Deep Reinforcement Learning Based Mobile Robot Navigation Using Sensor Fusion. Yan, K., Gao, J., & Li, Y.. 2023 42nd Chinese Control Conference (CCC), 2023.
Multi-Agent Path Finding Based on Graph Neural Network. Li, X., Gao, J., & Li, Y.. 2023 42nd Chinese Control Conference (CCC), 2023.
Multi-Agent Path Finding with Time Windows: Preliminary Results. Gao, J., Liu, Q., Chen, S., Yan, K., Li, X. & Li, Y. International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2023.
Battery Management for Warehouse Robots via Average-Reward Reinforcement Learning. Mu, Y., Li, Y., Lin, K., Deng, K., & Liu, Q. In IEEE International Conference on Robotics and Biomimetics (ROBIO), 2022.
Multi-Robot Real-time Game Strategy Learning based on Deep Reinforcement Learning. Deng, K., Li, Y., Lu, S., Mu, Y., Pang, X., & Liu, Q. In IEEE International Conference on Robotics and Biomimetics (ROBIO), 2022.
Multi-agent Pathfinding with Communication Reinforcement Learning and Deadlock Detection. Ye, Z., Li, Y., Guo, R., Gao, J., & Fu, W. In Intelligent Robotics and Applications: 15th International Conference, (ICIRA), 2022.
Decision Making for Autonomous Driving Via Multimodal Transformer and Deep Reinforcement Learning. Fu, W., Li, Y., Ye, Z., & Liu, Q. In IEEE International Conference on Real-time Computing and Robotics (RCAR), 2022.
A Mapless Navigation Method Based on Reinforcement Learning and Local Obstacle Map. Pang, X., Li, Y., Liu, Q., & Deng, K. 2022 China Automation Congress (CAC), Xiamen, China, 2022.
Exploration via Distributional Reinforcement Learning with Epistemic and Aleatoric Uncertainty Estimation. Liu, Q., Li, Y., Liu, Y., Chen, M., Lv, S., & Xu, Y. IEEE International Conference on Automation Science and Engineering, 2021.
Multi-agent pathfinding with local and global guidance. Xu, Y., Li, Y., Liu, Q., Gao, J., Liu, Y. & Chen, M. 2021 IEEE International Conference on Networking, Sensing and Control (ICNSC), 2021.
A deep safe reinforcement learning approach for mapless navigation. Lv, S., Li, Y., Liu, Q., Gao, J., Pang, X. & Chen, M. 2021 IEEE International Conference on Robotics and Biomimetics (ROBIO), 2021.
Towards Autonomous Driving Decision by Combining Self-attention and Deep Reinforcement Learning. Chen, M., Li, Y., Liu, Q., Lv, S., Xu, Y., & Liu, Y. IEEE International Conference on Real-time Computing and Robotics, 2021.
Efficient Power Grid Topology Control via Two-Stage Action Search. Liu, Y., Li, Y., Liu, Q., Xu, Y., Lv, S., & Chen, M. International Conference on Intelligent Robotics and Applications, 2021.
A 3D Simulation Environment and Navigation Approach for Robot Navigation via Deep Reinforcement Learning in Dense Pedestrian Environment. Liu, Q., Li, Y., & Liu, L. 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), 2020.
A lightweight soft exoskeleton in lower limb assistance. Zhang, Y., Wang, Z., Chen, C., Fang, T., Sun, R. & Li, Y.. 2020 Chinese Automation Congress (CAC), 2020.
An Overview of Robust Reinforcement Learning. Chen, S., & Li, Y. IEEE International Conference on Networking, Sensing and Control, 2020.
A coordinated path planning algorithm for multi-robot in intelligent warehouse. Chen, X., Li, Y., & Liu, L. 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO), 2019.
Robust identification of visual markers under boundary occlusion condition. Chang, R., Li, Y., & Wu, C. IEEE International Conference on Robotics and Biomimetics, 2019.
Deep Reinforcement Learning Apply in Electromyography Data Classification. Song, C., Chen, C., Li, Y., & Wu, X. IEEE International Conference on Cyborg and Bionic Systems, 2019.
A deep reinforcement learning algorithm with expert demonstrations and supervised loss and its application in autonomous driving. Liu, K., Wan, Q., & Li, Y. Chinese Control Conference, 2018.
Sliding mode control of quasi-static micro mirrors with implicit-Euler implementation. Xiong, X., Kamal, S., Deveerasetty, K., Jin, S. & Li, Y.. 2018 IEEE International Conference on Real-time Computing and Robotics (RCAR), 2018.
Visual Grasping for a Lightweight Aerial Manipulator Based on NSGA-II and Kinematic Compensation. Fang, L., Chen, H., Lou, Y., Li, Y., & Liu, Y. IEEE International Conference on Robotics and Automation, 2018.
Singularity-Robust Hybrid Visual Servoing Control for Aerial Manipulator. Quan, F., Chen, H., Li, Y., Chen, J., & Liu, Y. IEEE International Conference on Robotics and Biomimetics, 2018.
A monocular vision localization algorithm based on maximum likelihood estimation. Chen, S., Li, Y., & Chen, H. IEEE International Conference on Real-Time Computing and Robotics, 2018.
An Inverse Reinforcement Learning Algorithm for semi-Markov Decision Processes. Tan, C., Li, Y., & Cheng, Y. IEEE International Conference on Information and Automation, 2018.
Online calibration for monocular vision and odometry fusion. Wang, X., Chen, H., & Li, Y.. Proceedings of 2017 IEEE International Conference on Unmanned Systems, 2018.
A cross-coupled iterative learning control design for biaxial systems based on natural local approximation of contour error. Liu, S., & Li, Y. Chinese Control Conference, 2017.
The control of two-wheeled self-balancing vehicle based on reinforcement learning in a continuous domain. Xia, P., & Li, Y. Youth Academic Annual Conference of Chinese Association of Automation, 2017.
Face recognition based on convolutional neural network & support vector machine. Guo, S., Chen, S., & Li, Y. IEEE International Conference on Information and Automation, IEEE, 2017.
Real-Time tracking a ground moving target in complex indoor and outdoor environments with UAV. Chen, S., Guo, S., & Li, Y. IEEE International Conference on Information and Automation, 2017.
Average Reward Reinforcement Learning for Semi-Markov Decision Processes. Yang, J., Li, Y., Chen, H., & Li, J. International Conference on Neural Information Processing, 2017.
Visual Servo Tracking Control of Quadrotor with a Cable Suspended Load. Jia, E., Chen, H., Li, Y., Lou, Y., & Liu, Y. International Conference on Computer Vision Systems, 2017.
Carotid artery wall motion estimated from ultrasound imaging sequences using a nonlinear state space approach. Gao, Z., Sun, Y., Zhang, H., Ghista, D., Li, Y., Xiong, H., Liu, X., Xie, Y., Wu, W. & Li, S. MICCAI 2016: Medical Image Computing and Computer-Assisted Intervention, Part III, 2016.
A semi-Markov decision process based dynamic power management for mobile devices. Zhang, M., Li, Y., & Chen, H. IEEE International Conference on Real-Time Computing and Robotics, 2016.
Autonomous WiFi-relay control with mobile robots. Gao, Y., Chen, H., Li, Y., & Liu, Y. IEEE International Conference on Real-Time Computing and Robotics, 2016.
Sample-path based performance sensitivity construction of semi-Markov systems. Li, Y., & Zhang, J. Chinese Control Conference, 2016.
An online optimization for dynamic power management. Zhai, J., Li, Y., & Chen, H. IEEE International Conference on Industrial Technology, 2016.
Visual laser-SLAM in large-scale indoor environments. Liang, X., Chen, H., Li, Y., & Liu, Y. 2016 IEEE International Conference on Robotics and Biomimetics, 2016.
A Gradient Learning Optimization for Dynamic Power Management. Li, Y., & Jiang, F. IEEE International Conference on Systems, Man, and Cybernetics, 2015.
An adaptive kalman filter to estimate state-of-charge of lithium-ion batteries. Luo, Z., Li, Y., & Lou, Y. IEEE International Conference on Information and Automation, 2015.
A simulation study of control methods for three-phase energy storage inverter. Du, J., Li, Y., & Lou, Y. IEEE International Conference on Information and Automation, 2015.
A unified approach for semi-Markov decision processes with discounted and average reward criteria. Li, Y., Wang, H., & Chen, H. The World Congress on Intelligent Control and Automation (WCICA), 2015.
Auction-based multi-agent task assignment in smart logistic center. Guo, Y., Li, Y., & Zhang, Y. Chinese Control Conference, 2014.
Convex optimization of battery energy storage station in a micro-grid. Zhang, R., Li, Y., & Lou, Y. IEEE International Conference on Information and Automation, 2013.
Sensitivity-based inverse reinforcement learning. Tao, Z., Chen, Z., & Li, Y. Chinese Control Conference, 2013.

Performance analysis of a small-scale unmanned helicopter under large wind disturbance. Zeng, W., Zhu, X., Li, Y., & Li, L. Chinese Control Conference, 2013.
An average reward performance potential estimation with geometric variance reduction. Li, Y. Chinese Control Conference, 2012.
An average-reward reinforcement learning algorithm based on Schweitzer’s Transformation. Li, J., Ren, J., & Li, Y. Chinese Control Conference, 2012.
Reinforcement learning algorithms for semi-Markov decision processes with average reward. Li, Y. IEEE International Conference on Networking, Sensing and Control, 2012
Less computational unscented Kalman filter for practical state estimation of small scale unmanned helicopters. Zeng, W., Zhu, X., Li, Y., & Li, Z. IEEE International Conference on Robotics and Automation, 2011.
Infinite-horizon gradient estimation for semi-Markov decision processes. Li Y., & Cao F. 2011 8th Asian Control Conference (ASCC), 2011.
Combining Sub-bands SNR on Cochlear Model for Voice Activity Detection. Liu, Q., Liu, Y., & Li, Y.. 2010 International Conference on Asian Language Processing, 2010.
RVI reinforcement learning for Semi-Markov decision processes with average reward. Li, Y., & Cao, F. The World Congress on Intelligent Control and Automation (WCICA), 2010.
An improvement of policy gradient estimation algorithms. Li, Y., Cao, F., & Cao, X.-R. International Workshop on Discrete Event Systems, 2008.