👨🎓 About Me
Hi! I am currently a fourth-year PhD student at Institude for AI Industry Research (AIR) and School of Vehicle and Mobility, Tsinghua University, advised by Prof. Xianyuan Zhan and Prof. Ya-Qin Zhang. I got my bachelor’s degree in June 2021 from the School of Mechanical Engineering, Xi’an Jiaotong University.
My research interest broadly lies in advanced data-driven learning theory and algorithms on decision making and optimization, such as offline reinforcement learning (RL), as well as their promising applications on autonomous driving and robotics.
🔥 News
- [2025.01] Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning has been accepted by ICRA 2025!
- [2025.01] Skill Expansion and Composition in Parameter Space and Diffusion-Based Planning for Autonomous Driving with Flexible Guidance have been accepted by ICLR 2025!
- [2024.12] Are Expressive Models Truly Necessary for Offline RL? has been accepted as Oral presentation in AAAI 2025!
- [2024.09] Instruction-Guided Visual Masking has been accepted in NeurIPS 2024!
- [2024.07] DecisionNCE has been selected as Outstanding paper at MFM-MEI Workshop@ICML 2024!
- [2024.07] Instruction-Guided Visual Masking has been selected as Outstanding paper at MFM-MEI Workshop@ICML 2024!
- [2024.05] DecisionNCE has been accepted in ICML 2024!
- [2024.01] Query-Policy Misalignment in Preference-Based Reinforcement Learning” has been selected as Spotlight in ICLR 2024!
- [2024.01] Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model and Query-Policy Misalignment in Preference-Based Reinforcement Learning” have been accepted to ICLR 2024!
- [2023.11] Honored to be selected as a Top Reviewer of NeurIPS 2023 (top 10%).
- [2023.02] Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization, When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning and Mind the Gap: Offline Policy Optimization for Imperfect Rewards have been accepted in ICLR 2023, including one Oral paper!
- [2022.09] A Policy-Guided Imitation Approach for Offline Reinforcement Learning has been accepted as Oral in NeurIPS 2022!
📝 Publications
* Equal contribution.
Preprints and Codebases
-
Jinliang Zheng$^*$, Jianxiong Li$^*$, Dongxiu Liu$^*$, Yinan Zheng, Zhihao Wang, Zhonghong Ou, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan, Universal Actions for Enhanced Embodied Foundation Models, 2025. [Code][Project Page]
-
Haoyi Niu$^*$, Qimao Chen$^*$, Tenglong Liu, Jianxiong Li, Guyue Zhou, Yi Zhang, Jianming Hu, Xianyuan Zhan, xTED: Cross-Domain Policy Adaptation via Diffusion-Based Trajectory Editing NeurIPS 2024 OWA workshop, 2024.
-
Jianxiong Li$^*$, Shichao Lin$^*$, Tianyu Shi, Chujie Tian, Yu Mei, Jian Song, Xianyuan Zhan, Ruimin Li, A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning Under Review, 2023.
-
Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Ya-Qin Zhang, PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning Under Review, 2023. [Code]
-
Haoran Xu, Xianyuan Zhan, Jianxiong Li, Honglei Yin, Offline Reinforcement Learning with Soft Behavior Regularization, NeurIPS 2022 offline RL workshop, Under Review, 2021. [Code]
Conference Proceedings
-
[ICRA 2025] Jianxiong Li$^*$, Zhihao Wang$^*$, Jinliang Zheng$^*$, Xiaoai Zhou, Guanming Wang, Guanglu Song, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Junzhi Yu, Xianyuan Zhan, Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning.
-
[ICLR 2025] Tenglong Liu$^*$, Jianxiong Li$^*$, Yinan Zheng, Haoyi Niu, Yixing Lan, Xin Xu, Xianyuan Zhan, Skill Expansion and Composition in Parameter Space.
-
[ICLR 2025] Yinan Zheng$^*$, Ruiming Liang$^*$, Kexin Zheng$^*$, Jinliang Zheng, Liyuan Mao, Jianxiong Li, Weihao Gu, Rui Ai, Shengbo Eben Li, Xianyuan Zhan, Jingjing Liu, Diffusion-Based Planning for Autonomous Driving with Flexible Guidance. [Code]
-
[AAAI 2025 (Oral) Guan Wang$^*$, Haoyi Niu$^*$, Jianxiong Li], Li Jiang, Jianming HU, Xianyuan Zhan, Are Expressive Models Truly Necessary for Offline RL?. [Code]
-
[NeurIPS 2024 (Outstanding Paper at MFM-MAI Workshop @ ICML 2024)] Jinliang Zheng$^*$, Jianxiong Li$^*$, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan, Instruction-Guided Visual Masking.
-
[ICML 2024 (Outstanding Paper at MFM-MAI Workshop @ ICML 2024)] Jianxiong Li$^*$, Jinliang Zheng$^*$, Yinan Zheng$^*$, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan, DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning. [Code][Project Page]
-
[ICLR 2024 (Spotlight)] Xiao Hu$^*$, Jianxiong Li$^*$, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang, Query-Policy Misalignment in Preference-Based Reinforcement Learning. [Code]
-
[ICLR 2024] Yinan Zheng$^*$, Jianxiong Li$^*$, Dongjie Yu, Yujie Yang, Shengbo Eben Li, Xianyuan Zhan, Jingjing Liu, Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model”. [Code][Project Page]
-
[ICLR 2023] Jianxiong Li, Xianyuan Zhan, Haoran Xu, Xiangyu Zhu, Jingjing Liu, Ya-Qin Zhang, When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning. [Code]
-
[ICLR 2023] Jianxiong Li$^*$, Xiao Hu$^*$, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang, Mind the Gap: Offline Policy Optimization for Imperfect Rewards. [Code]
-
[ICLR 2023 (Oral)] Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Victor Wai Kin Chan, Xianyuan Zhan, Offline RL with No OOD Actions: Offline RL with Implicit Value Regularization. [Code]
-
[NeurIPS 2022 (Oral)] Haoran Xu$^*$, Li Jiang$^*$, Jianxiong Li, Xianyuan Zhan, A Policy-Guided Imitation Approach for Offline Reinforcement Learning. [Code]
-
[CAC 2022] Shiyue Zhao, Jianxiong Li, Xiao Hu, Junzhi Zhang, Chengkun He, Vehicle Extreme Control based on Offline Reinforcement Leaning.
📖 Educations
- 2021.08 - Present, PhD candidate, Institude for AI Industry Research (AIR) / School of Vehicle and Mobility, Tsinghua University, Beijing, China.
- 2017.08 - 2021.06, Undergraduate, School of Mechanical Engineering, Xi’an Jiaotong University, Shaanxi, China.
💻 Internships
- 2022.06 - 2023.05, Baidu, Beijing, China. Studing on the applications of offline RL algorithms on Traffic Signal Control (TSC).
- 2023.07 - 2023.09, IDRIVERPLUS, Beijing, China. Studing on the applications of life long learning algorithms on 2D object detection for autonomous driving.
🧑🎨 Services
Reviewer for NeurIPS 23-24, ICLR 24-25, ICML 24-25, AAAI 24-25, IJCAI 24, TMLR, DMLR workshop@ICLR 2024