Publications
Journal Papers
- CBTMP: Optimizing Multi-Agent Path Finding in Heterogeneous Cooperative Environments. Gao, J., Li, Y., Mu, Y., Liu, Q., Chen, H., & Lou, Y. IEEE Robotics and Automation Letters, 2025.
- Reinforcement learning-based optimal control for Markov jump systems with completely unknown dynamics. Shi, X., Li, Y., Du, C., Chen, C., Zong, G., & Gui, W. Automatica, 2025.
- Filter-Based Fully Distributed Output Regulation of Heterogeneous Learning Agents. Shi, X., Li, Y., Du, C., Chen, C., Hua, C., & Gui, W. IEEE Transactions on Circuits and Systems I: Regular Papers, 2025.
- Fully Distributed Event-Triggered Control of Nonlinear Multiagent Systems Under Directed Graphs: A Model-Free DRL Approach. Shi, X., Li, Y., Du, C., Shi, Y., Yang, C., & Gui, W. IEEE Transactions on Automatic Control, 2025.
- Distributional Policy Gradient With Distributional Value Function. Liu, Q., Li, Y., Shi, X., Lin, K., Liu, Y. & Lou, Y. IEEE Transactions on Neural Networks and Learning Systems, 2025.
- Safe Reinforcement Learning in Autonomous Driving With Epistemic Uncertainty Estimation Zhang, Z., Liu, Q., Li, Y., Lin, K. & Li, L. IEEE Transactions on Intelligent Transportation Systems, 2024.
- PCE: Multi-Agent Path Finding via Priority-Aware Communication & Experience Learning. Gao, J., Li, Y., Ye Z. & Wu, X. IEEE Transactions on Intelligent Vehicles, 2024.
- Learning Agile Quadrotor Flight in Restricted Environments with Safety Guarantees. Chen, S., Li, Y., Lou, Y., Lin, K. & Wu, X. IEEE Transactions on Intelligent Vehicles, 2024.
- A Time-Aggregated Model-Free RL Algorithm for Optimal Containment Control of MASs. Shi, X., Li, Y., Du, C., & Gui, W. IEEE Transactions on Circuits and Systems II: Express Briefs, 2024.
- Almost Surely Safe Exploration and Exploitation for Deep Reinforcement Learning with State Safety Estimation. Lin, K., Li, Y., Liu, Q., Li, D., Shi, X., & Chen, S. Information Sciences, 2024.
- Data Efficient Deep Reinforcement Learning with Action-ranked Temporal Difference Learning. Liu, Q., Li, Y., Liu, Y. & Lin, K. IEEE Transactions on Emerging Topics in Computational Intelligence, 2024.
- Distributional reinforcement learning with epistemic and aleatoric uncertainty estimation. Liu, Q., Li, Y., Chen, S., Lin, K., Shi, X., & Lou, Y. Information Sciences, 2023.
- FHCPL: An Intelligent Fixed-Horizon Constrained Policy Learning System for Risk-Sensitive Industrial Scenario. Lin K., Li, D., Li, Y., Chen, S., & Wu, X. IEEE Transactions on Industrial Informatics, 2023.
- Optimal Lateral Path-Tracking Control of Vehicles With Partial Unknown Dynamics Via DPG-Based Reinforcement Learning Methods. Shi, X., Li, Y., Hu, W., Du, C., Chen, C., & Gui, W. IEEE Transactions on Intelligent Vehicles, 2023.
- A review of graph-based multi-agent pathfinding solvers: From classical to beyond classical. Gao, J., Li, Y., Li, X., Yan, K. and Lin, K., & Wu, X. Knowledge-Based Systems, 2023.
- Motion Planner with Fixed-Horizon Constrained Reinforcement Learning for Complex Autonomous Driving Scenarios. Lin, K., Li, Y., Chen, S., Li, D., & Wu, X. IEEE Transactions on Intelligent Vehicles, 2023.
- TAG: Teacher-Advice Mechanism With Gaussian Process for Reinforcement Learning. Lin, K., Li, D., Li, Y., Chen, S., Liu, Q., Gao, J., Jin, Y., & Gong, L. IEEE Transactions on Neural Networks and Learning Systems, 2023.
- A fully distributed adaptive event-triggered control for output regulation of multi-agent systems with directed network. Shi, X., Li, Y., Liu, Q., Lin, K., & Chen, S. Information Sciences, 2023.
- Learning Real-Time Dynamic Responsive Gap-Traversing Policy for Quadrotors with Safety-Aware Exploration. Chen, S., Li, Y., Lou, Y., Lin, K., & Wu, X. IEEE Transactions on Intelligent Vehicles, 2022.
- A Two-Objective ILP Model of OP-MATSP for the Multi-Robot Task Assignment in an Intelligent Warehouse. Gao, J., Li, Y., Xu, Y., & Lv, S. Applied Sciences, 2022.
- Rotating consensus for double-integrator multi-agent systems with communication delay. Shi, X., Li, Y., Yang, Y., Sun, B., & Li, Y.. ISA Transactions, 2021.
- Iterative learning control for a soft exoskeleton with hip and knee joint assistance. Chen, C., Zhang, Y., Li, Y., Wang, Z., Liu, Y., Cao, W. & Wu, X. Sensors, 2020.
- Online Extrinsic Parameter Calibration for Robotic Camera–Encoder System. Wang, X., Chen, H., Li, Y., & Huang, H. IEEE Transactions on Industrial Informatics, 2019.
- Vision and laser fused SLAM in indoor environments with multi-robot system. Chen, H., Huang, H., Qin, Y., Li, Y., & Liu, Y. Assembly Automation, 2019.
- Coupling Based Estimation Approaches for the Average Reward Performance Potential in Markov Chains. Li, Y., Wu, X., Lou, Y., Chen, H., & Li, J. Automatica, 2018.
- Motion Tracking of the Carotid Artery Wall From Ultrasound Image Sequences: a Nonlinear State-Space Approach. Gao, Z., Li, Y., Sun, Y., Yang, J., Xiong, H., Zhang, H., ... & Li, S. IEEE Transactions on Medical Imaging, 2018.
- Online optimization of dynamic power management. Zhai, J.-F., Li, Y., Chen, & H.-Y. Control Theory and Applications, 2018.
- Autonomous wi-fi relay placement with mobile robots. Gao, Y., Chen, H., Li, Y., Lyu, C., & Liu, Y. IEEE/ASME Transactions on Mechatronics, 2017.
- A unified approach to time-aggregated Markov decision processes. Li, Y., & Wu, X. Automatica, 2016.
- A basic formula for performance gradient estimation of semi-Markov decision processes. Li, Y., & Cao, F. European Journal of Operational Research, 2013.
- Nonrigid registration of lung CT images based on tissue features. Zhang, R., Zhou, W., Li, Y., Yu, S. & Xie, Y. Computational and Mathematical Methods in Medicine, 2013.
- Finding optimal memoryless policies of POMDPs under the expected average reward criterion. Li, Y., Yin, B., & Xi, H. European Journal of Operational Research, 2011.
- On-line policy gradient estimation with multi-step sampling. Li, Y., Cao, F., & Cao, X. Discrete Event Dynamic Systems, 2010.
- Partially observable Markov decision processes and performance sensitivity analysis. Li, Y., Yin, B., & Xi, H. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 2008.
- Performance optimization of semi-Markov decision processes with discounted-cost criteria. Yin, B., Li, Y., Zhou, Y. & Xi, H. European Journal of Control, 2008.
- Sensitivity analysis and estimates of the performance for M/G/1 queueing systems. Yin, B., Dai, G., Li, Y. & Xi, H. Performance Evaluation, 2007.
- Performance optimization algorithms based on potentials for semi-Markov control processes. Dai, G., Yin, B., Li, Y. & Xi, H. International Journal of Control, 2005.
Conference Papers
- A Fast Planning Algorithm for Humanoids with Supervised Learning and Subproblem Decomposition. Fang, Z., Li, Y., He, Z., & Lin, K. 2024 43rd Chinese Control Conference (CCC), 2024.
- A Risk-sensitive Automatic Stock Trading Strategy Based on Deep Reinforcement Learning and Transformer. Li, L., Liu, Q., Li, Y., Mu, Y., & Zhang, Z. 2024 IEEE 20th International Conference on Automation Science and Engineering (CASE), 2024.
- Offline Deep Reinforcement Learning Two-stage Optimization Framework Applied to Recommendation Systems. Jiang, Y., Liu, Q., & Li, Y.. 2024 43rd Chinese Control Conference (CCC), 2024.
- Path Optimization Problem of Multi-Probe Flying Probe Tester. Lai, H., Li, Y., & Gao, J. 2024 43rd Chinese Control Conference (CCC), 2024.
- An Environmental-Complexity-Based Navigation Method Based on Hierarchical Deep Reinforcement Learning. Chen, P., Liu, Q., Li, Y. & Ma, S. IEEE International Conference on Robotics and Automation (ICRA), 2024.
- Optimal Containment Control of Nonlinear MASs: A Time-Aggregation-Based Policy Iteration Algorithm. Shi, X., Li, Y., & Du, C. IEEE Conference on Decision and Control (CDC), 2023.
- Consensus of the General Second-Order MASs with Nonuniform Uncertain Time-Varying Communication Delays. Shi, X., Li, Y., & Du, C. 2023 China Automation Congress (CAC), 2023.
- Quadrotor Control using Reinforcement Learning under Wind Disturbance. Lu, S., Li, Y., & Liu, Z. 2023 35th Chinese Control and Decision Conference (CCDC), 2023.
- HGLP: Hierarchical Solver for Combined Task Assignment and Path Finding. Gao, J., Ye, Z., Li, Y., & Li, Y. 2023 35th Chinese Control and Decision Conference (CCDC), 2023.
- Deep Reinforcement Learning Based Mobile Robot Navigation Using Sensor Fusion. Yan, K., Gao, J., & Li, Y.. 2023 42nd Chinese Control Conference (CCC), 2023.
- Multi-Agent Path Finding Based on Graph Neural Network. Li, X., Gao, J., & Li, Y.. 2023 42nd Chinese Control Conference (CCC), 2023.
- Multi-Agent Path Finding with Time Windows: Preliminary Results. Gao, J., Liu, Q., Chen, S., Yan, K., Li, X. & Li, Y. International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2023.
- Battery Management for Warehouse Robots via Average-Reward Reinforcement Learning. Mu, Y., Li, Y., Lin, K., Deng, K., & Liu, Q. In IEEE International Conference on Robotics and Biomimetics (ROBIO), 2022.
- Multi-Robot Real-time Game Strategy Learning based on Deep Reinforcement Learning. Deng, K., Li, Y., Lu, S., Mu, Y., Pang, X., & Liu, Q. In IEEE International Conference on Robotics and Biomimetics (ROBIO), 2022.
- Multi-agent Pathfinding with Communication Reinforcement Learning and Deadlock Detection. Ye, Z., Li, Y., Guo, R., Gao, J., & Fu, W. In Intelligent Robotics and Applications: 15th International Conference, (ICIRA), 2022.
- Decision Making for Autonomous Driving Via Multimodal Transformer and Deep Reinforcement Learning. Fu, W., Li, Y., Ye, Z., & Liu, Q. In IEEE International Conference on Real-time Computing and Robotics (RCAR), 2022.
- A Mapless Navigation Method Based on Reinforcement Learning and Local Obstacle Map. Pang, X., Li, Y., Liu, Q., & Deng, K. 2022 China Automation Congress (CAC), Xiamen, China, 2022.
- Exploration via Distributional Reinforcement Learning with Epistemic and Aleatoric Uncertainty Estimation. Liu, Q., Li, Y., Liu, Y., Chen, M., Lv, S., & Xu, Y. IEEE International Conference on Automation Science and Engineering, 2021.
- Multi-agent pathfinding with local and global guidance. Xu, Y., Li, Y., Liu, Q., Gao, J., Liu, Y. & Chen, M. 2021 IEEE International Conference on Networking, Sensing and Control (ICNSC), 2021.
- A deep safe reinforcement learning approach for mapless navigation. Lv, S., Li, Y., Liu, Q., Gao, J., Pang, X. & Chen, M. 2021 IEEE International Conference on Robotics and Biomimetics (ROBIO), 2021.
- Towards Autonomous Driving Decision by Combining Self-attention and Deep Reinforcement Learning. Chen, M., Li, Y., Liu, Q., Lv, S., Xu, Y., & Liu, Y. IEEE International Conference on Real-time Computing and Robotics, 2021.
- Efficient Power Grid Topology Control via Two-Stage Action Search. Liu, Y., Li, Y., Liu, Q., Xu, Y., Lv, S., & Chen, M. International Conference on Intelligent Robotics and Applications, 2021.
- A 3D Simulation Environment and Navigation Approach for Robot Navigation via Deep Reinforcement Learning in Dense Pedestrian Environment. Liu, Q., Li, Y., & Liu, L. 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), 2020.
- A lightweight soft exoskeleton in lower limb assistance. Zhang, Y., Wang, Z., Chen, C., Fang, T., Sun, R. & Li, Y.. 2020 Chinese Automation Congress (CAC), 2020.
- An Overview of Robust Reinforcement Learning. Chen, S., & Li, Y. IEEE International Conference on Networking, Sensing and Control, 2020.
- A coordinated path planning algorithm for multi-robot in intelligent warehouse. Chen, X., Li, Y., & Liu, L. 2019 IEEE International Conference on Robotics and Biomimetics (ROBIO), 2019.
- Robust identification of visual markers under boundary occlusion condition. Chang, R., Li, Y., & Wu, C. IEEE International Conference on Robotics and Biomimetics, 2019.
- Deep Reinforcement Learning Apply in Electromyography Data Classification. Song, C., Chen, C., Li, Y., & Wu, X. IEEE International Conference on Cyborg and Bionic Systems, 2019.
- A deep reinforcement learning algorithm with expert demonstrations and supervised loss and its application in autonomous driving. Liu, K., Wan, Q., & Li, Y. Chinese Control Conference, 2018.
- Sliding mode control of quasi-static micro mirrors with implicit-Euler implementation. Xiong, X., Kamal, S., Deveerasetty, K., Jin, S. & Li, Y.. 2018 IEEE International Conference on Real-time Computing and Robotics (RCAR), 2018.
- Visual Grasping for a Lightweight Aerial Manipulator Based on NSGA-II and Kinematic Compensation. Fang, L., Chen, H., Lou, Y., Li, Y., & Liu, Y. IEEE International Conference on Robotics and Automation, 2018.
- Singularity-Robust Hybrid Visual Servoing Control for Aerial Manipulator. Quan, F., Chen, H., Li, Y., Chen, J., & Liu, Y. IEEE International Conference on Robotics and Biomimetics, 2018.
- A monocular vision localization algorithm based on maximum likelihood estimation. Chen, S., Li, Y., & Chen, H. IEEE International Conference on Real-Time Computing and Robotics, 2018.
- An Inverse Reinforcement Learning Algorithm for semi-Markov Decision Processes. Tan, C., Li, Y., & Cheng, Y. IEEE International Conference on Information and Automation, 2018.
- Online calibration for monocular vision and odometry fusion. Wang, X., Chen, H., & Li, Y.. Proceedings of 2017 IEEE International Conference on Unmanned Systems, 2018.
- A cross-coupled iterative learning control design for biaxial systems based on natural local approximation of contour error. Liu, S., & Li, Y. Chinese Control Conference, 2017.
- The control of two-wheeled self-balancing vehicle based on reinforcement learning in a continuous domain. Xia, P., & Li, Y. Youth Academic Annual Conference of Chinese Association of Automation, 2017.
- Face recognition based on convolutional neural network & support vector machine. Guo, S., Chen, S., & Li, Y. IEEE International Conference on Information and Automation, IEEE, 2017.
- Real-Time tracking a ground moving target in complex indoor and outdoor environments with UAV. Chen, S., Guo, S., & Li, Y. IEEE International Conference on Information and Automation, 2017.
- Average Reward Reinforcement Learning for Semi-Markov Decision Processes. Yang, J., Li, Y., Chen, H., & Li, J. International Conference on Neural Information Processing, 2017.
- Visual Servo Tracking Control of Quadrotor with a Cable Suspended Load. Jia, E., Chen, H., Li, Y., Lou, Y., & Liu, Y. International Conference on Computer Vision Systems, 2017.
- Carotid artery wall motion estimated from ultrasound imaging sequences using a nonlinear state space approach. Gao, Z., Sun, Y., Zhang, H., Ghista, D., Li, Y., Xiong, H., Liu, X., Xie, Y., Wu, W. & Li, S. MICCAI 2016: Medical Image Computing and Computer-Assisted Intervention, Part III, 2016.
- A semi-Markov decision process based dynamic power management for mobile devices. Zhang, M., Li, Y., & Chen, H. IEEE International Conference on Real-Time Computing and Robotics, 2016.
- Autonomous WiFi-relay control with mobile robots. Gao, Y., Chen, H., Li, Y., & Liu, Y. IEEE International Conference on Real-Time Computing and Robotics, 2016.
- Sample-path based performance sensitivity construction of semi-Markov systems. Li, Y., & Zhang, J. Chinese Control Conference, 2016.
- An online optimization for dynamic power management. Zhai, J., Li, Y., & Chen, H. IEEE International Conference on Industrial Technology, 2016.
- Visual laser-SLAM in large-scale indoor environments. Liang, X., Chen, H., Li, Y., & Liu, Y. 2016 IEEE International Conference on Robotics and Biomimetics, 2016.
- A Gradient Learning Optimization for Dynamic Power Management. Li, Y., & Jiang, F. IEEE International Conference on Systems, Man, and Cybernetics, 2015.
- An adaptive kalman filter to estimate state-of-charge of lithium-ion batteries. Luo, Z., Li, Y., & Lou, Y. IEEE International Conference on Information and Automation, 2015.
- A simulation study of control methods for three-phase energy storage inverter. Du, J., Li, Y., & Lou, Y. IEEE International Conference on Information and Automation, 2015.
- A unified approach for semi-Markov decision processes with discounted and average reward criteria. Li, Y., Wang, H., & Chen, H. The World Congress on Intelligent Control and Automation (WCICA), 2015.
- Auction-based multi-agent task assignment in smart logistic center. Guo, Y., Li, Y., & Zhang, Y. Chinese Control Conference, 2014.
- Convex optimization of battery energy storage station in a micro-grid. Zhang, R., Li, Y., & Lou, Y. IEEE International Conference on Information and Automation, 2013.
- Sensitivity-based inverse reinforcement learning. Tao, Z., Chen, Z., & Li, Y. Chinese Control Conference, 2013.
- Performance analysis of a small-scale unmanned helicopter under large wind disturbance. Zeng, W., Zhu, X., Li, Y., & Li, L. Chinese Control Conference, 2013.
- An average reward performance potential estimation with geometric variance reduction. Li, Y. Chinese Control Conference, 2012.
- An average-reward reinforcement learning algorithm based on Schweitzer’s Transformation. Li, J., Ren, J., & Li, Y. Chinese Control Conference, 2012.
- Reinforcement learning algorithms for semi-Markov decision processes with average reward. Li, Y. IEEE International Conference on Networking, Sensing and Control, 2012
- Less computational unscented Kalman filter for practical state estimation of small scale unmanned helicopters. Zeng, W., Zhu, X., Li, Y., & Li, Z. IEEE International Conference on Robotics and Automation, 2011.
- Infinite-horizon gradient estimation for semi-Markov decision processes. Li Y., & Cao F. 2011 8th Asian Control Conference (ASCC), 2011.
- Combining Sub-bands SNR on Cochlear Model for Voice Activity Detection. Liu, Q., Liu, Y., & Li, Y.. 2010 International Conference on Asian Language Processing, 2010.
- RVI reinforcement learning for Semi-Markov decision processes with average reward. Li, Y., & Cao, F. The World Congress on Intelligent Control and Automation (WCICA), 2010.
- An improvement of policy gradient estimation algorithms. Li, Y., Cao, F., & Cao, X.-R. International Workshop on Discrete Event Systems, 2008.