Hi! I am Jianxiong Li, studying embodied AI and Reinforcement Learning (RL). I am a final-year PhD candidate at AIR, Tsinghua University, advised by Prof. Xianyuan Zhan and Prof. Ya-Qin Zhang. I got my bachelor's degree in 2021 from the School of Mechanical Engineering, Xi'an Jiaotong University, where I did lots of projects on mechanical design and robotics.
My dream is to develop robots that are universally deployable across diverse real-world environments. Towards this goal, my current work primarily focused on:
- (Efficient Pretrain) How to build robotic foundation models efficiently when robotics data are limited?
- (Fast Post-train) How to fastly enhance robot peformance given limited budget?
- (RL+X) How to use RL to reach super-human performance on diverse domains, like robots, VLMs or LLMs?
Some links: Github / Twitter / Google Scholar / li-jx21@mails.tsinghua.edu.cn
News
- πOur X-VLA has won 1st place in the AGIBOT World Challenge (Manipulation track) @ IROS 2025.
- πWe release X-VLA, a cross-embodiment model that sweeps many benchmarks and achieves strong real-world performance.
- One paper (FlowPlanner) on autonomous driving is accepeted to NeurIPS 2025.
- One paper (LBP) on efficient latent planning is accepted to ICML 2025.
- One paper (UniAct) on cross-embodiment universal actions is accepted to CVPR 2025.
- πDiffusion-Planner is selected as oral presentation at ICLR 2025.
- Two papers on fast post-train (PSEC) and autonomous driving (Diffusion-Planner) are accepted to ICLR 2025.
- One paper (Robo-MUTUAL) on embodied representations is accepted to ICRA 2025.
- πOne paper (RSP) on offline RL is accepted to AAAI 2025 as oral.
- πIVM and DecisionNCE are selected as Outstanding Paper at MFM-EAI workshop @ ICML 2024.
Publications (* marks equal contribution)
- X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model (1st place π @ AGIBOT World Challenge (Manipulation track), IROS 2025) 2025 Paper | Code | Page
- Universal Actions for Enhanced Embodied Foundation Models CVPR 2025 2025 Paper | Code | Page
- Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning ICRA 2025 2025 Paper | Code | Page
- Skill Expansion and Composition in Parameter Space ICLR 2025 2025 Paper | Code | Page | Dataset | Model
- Instruction Guided Visual Masking NeurIPS 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page | Dataset | Model
- DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning ICML 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page
- Query-Policy Misalignment in Preference-Based Reinforcement Learning ICLR 2024 (Spotlight, Top 5%) 2024 Paper | Code
- Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model ICLR 2024 2024 Paper | Code | Page
- When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning ICLR 2023 2023 Paper | Code
- Mind the Gap: Offline Policy Optimization for Imperfect Rewards ICLR 2023 2023 Paper | Code
- X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model (1st place π @ AGIBOT World Challenge (Manipulation track), IROS 2025) 2025 Paper | Code | Page
- Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling NeurIPS 2025 2025
- PhysiAgent: An Embodied Agent Framework in Physical World 2025 Paper
- Universal Actions for Enhanced Embodied Foundation Models CVPR 2025 2025 Paper | Code | Page
- Efficient Robotic Policy Learning via Latent Space Backward Planning ICML 2025 2025 Paper | Code | Page
- Pushing the Limit of Sample-Efficient Offline Reinforcement Learning ICLR 2025 @ WWM Workshop 2025
- Reachability-Aware Reinforcement Learning for Collision Avoidance in Human-Machine Shared Control Under Review 2025
- Diffusion-Based Planning for Autonomous Driving with Flexible Guidance ICLR 2025 (Oral, Top 2%) 2025 Paper | Code | Page
- Skill Expansion and Composition in Parameter Space ICLR 2025 2025 Paper | Code | Page | Dataset | Model
- Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning ICRA 2025 2025 Paper | Code | Page
- Are Expressive Models Truly Necessary for Offline RL? AAAI 2025 (Oral, Top 5%) 2024 Paper | Code
- xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing NeurIPS 2024 OWA Workshop 2024 Paper | Code | Page
- Instruction Guided Visual Masking NeurIPS 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page | Dataset | Model
- DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning ICML 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page
- Query-Policy Misalignment in Preference-Based Reinforcement Learning ICLR 2024 (Spotlight, Top 5%) 2024 Paper | Code
- Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model ICLR 2024 2024 Paper | Code | Page
- PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning Preprint 2023 Paper | Code
- A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning Data Science for Transportation 2023 Paper
- Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization ICLR 2023 (Oral, Notable Top 5%) 2023 Paper | Code
- Mind the Gap: Offline Policy Optimizaiton for Imperfect Rewards ICLR 2023 2023 Paper | Code
- When data geometry meets deep function: Generalizing offline reinforcement learning ICLR 2023 2023 Paper | Code
- A Policy-Guided Imitation Approach for Offline Reinforcement Learning NeurIPS 2022 (Oral, Top 2%) 2022 Paper | Code | Slides | Media
- Vehicle Extreme Control based on Offline Reinforcement Leaning CAC 2022 2022
- Offline Reinforcement Learning with Soft Behavioral Regularization NeurIPS 2021 Offline RL Workshop 2021 Paper | Code
- X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model (1st place π @ AGIBOT World Challenge (Manipulation track), IROS 2025) 2025 Paper | Code | Page
- Universal Actions for Enhanced Embodied Foundation Models CVPR 2025 2025 Paper | Code | Page
- Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning ICRA 2025 2025 Paper | Code | Page
- Instruction Guided Visual Masking NeurIPS 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page | Dataset | Model
- DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning ICML 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page
- xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing NeurIPS 2024 OWA Workshop 2024 Paper | Code | Page
- Skill Expansion and Composition in Parameter Space ICLR 2025 2025 Paper | Code | Page | Dataset | Model
- Diffusion-Based Planning for Autonomous Driving with Flexible Guidance ICLR 2025 (Oral, Top 2%) 2025 Paper | Code | Page
- PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning Preprint 2023 Paper | Code
- Diffusion-Based Planning for Autonomous Driving with Flexible Guidance ICLR 2025 (Oral, Top 2%) 2025 Paper | Code | Page
- Are Expressive Models Truly Necessary for Offline RL? AAAI 2025 (Oral, Top 5%) 2024 Paper | Code
- Instruction Guided Visual Masking NeurIPS 2024 (Outstanding Paper @ ICML 2024 MFM-EAI Workshop) 2024 Paper | Code | Page | Dataset | Model
- Query-Policy Misalignment in Preference-Based Reinforcement Learning ICLR 2024 (Spotlight, Top 5%) 2024 Paper | Code
- Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model ICLR 2024 2024 Paper | Code | Page
- A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning Preprint 2023 Paper
- Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization ICLR 2023 (Oral, Notable Top 5%) 2023 Paper | Code
- Mind the Gap: Offline Policy Optimizaiton for Imperfect Rewards ICLR 2023 2023 Paper | Code
- When data geometry meets deep function: Generalizing offline reinforcement learning ICLR 2023 2023 Paper | Code
- A Policy-Guided Imitation Approach for Offline Reinforcement Learning NeurIPS 2022 (Oral, Top 2%) 2022 Paper | Code | Slides | Media
- Vehicle Extreme Control based on Offline Reinforcement Leaning CAC 2022 2022
- Offline Reinforcement Learning with Soft Behavioral Regularization NeurIPS 2021 Offline RL Workshop 2021 Paper | Code
