Introduction

Embodied Decision Intelligence Lab (EDI Lab) belongs to Tsinghua University Shenzhen International Graduate School. It is supersised by Prof. Chao Yu (于超). Our lab is committed to research related to reinforcement learning infrastructure, strategic agent and embodied AI.

Chao Yu is an Associate Professor at Tsinghua University, SIGS. She is also the chairman of the Tsinghua Shenzhen International Graduate School - AgiBot Joint Research Center for Embodied Cognition and Decision Systems (JCES). As first author or corresponding author, Prof. Chao Yu has published more than 50 papers in top-tier international conferences and journals, including ICML, NeurIPS, ICLR, CVPR, ECCV, CoRL, IROS, ICRA, TMLR, and RAL, with over 6,000 citations on Google Scholar.My representative works include the multi-agent reinforcement learning algorithm MAPPO, which has received more than 3,000 Google Scholar citations, and RLinf, a large-scale reinforcement learning training framework for embodied intelligence, which has accumulated over 3,000 GitHub stars.

Our lab is currently recruiting Master’s students, Ph.D. students, joint-program Ph.D. students with Zhongguancun Academy, postdoctoral researchers, and undergraduate research assistants. We warmly welcome students who are interested in recommended admission or applying through the graduate entrance examination to SIGS programs such as Artificial Intelligence, Data Science and Information Technology, Big Data Engineering, and Electronic Engineering, as well as applicants for the above Ph.D. and postdoctoral positions, to join us.

Highlights

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Zelai Xu, Zhexuan Xu, Xiangmin Yi, Huining Yuan, Xinlei Chen, Yongji Wu, Chao Yu, Yu Wang

Proceedings of CVPR · 2026CCF-A

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Chao Yu, Yuanqing Wang, Zhen Guo, Hao Lin, Si Xu, Hongzhi Zang, Quanlu Zhang, Yongji Wu, Chunyang Zhu, Junhao Hu, Zixiao Huang, Mingjie Wei, Yuqing Xie, Ke Yang, Bo Dai, Zhexuan Xu, Jiakun Du, Xiangyuan Wang, Xu Fu, Letong Shi, Zhihao Liu, Kang Chen, Weilin Liu, Gang Liu, Boxun Li, Jianlei Yang, Zhi Yang, Guohao Dai, Yu Wang

arXiv preprint · 2025

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training

Hongzhi Zang, Mingjie Wei, Si Xu, Yongji Wu, Zhen Guo, Yuanqing Wang, Hao Lin, Liangzhi Shi, Yuqing Xie, Zhexuan Xu, Zhihao Liu, Kang Chen, Wenhao Tang, Quanlu Zhang, Weinan Zhang, Chao Yu, Yu Wang

arXiv preprint · 2025

Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game

Zelai Xu, Chao Yu, Fei Fang, Yu Wang, Yi Wu

Proceedings of ICML · 2024CCF-A

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu

Proceedings of ICML · 2024CCF-A

The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre Bayen, Yi Wu

Proceedings of NeurIPS · 2022CCF-A

News

RLinf助力全球首个具身智能工业产线规模落地

April 14, 2026

2026 年 4 月 14 日，智元机器人在龙旗科技南昌平板制造工厂开启长达 8 小时的真实产线作业直播，2283次任务零失误，成功率100%，面向全球公开验证全球首个具身智能 3C 精密制造产线规模化落地成果。智元精灵 G2 以产线 “正式员工” 身份，在高速流水线上完成精密上下料、人机协同全流程作业。标志具身智能正式迈入工业级常态化部署阶段，成为全球具身智能产业商用落地的里程碑事件。

清华大学-自变量机器人“下一代具身基座模型”产学研专项启动会顺利召开

April 13, 2026

4月13日，清华大学-自变量机器人科技（深圳）有限公司“下一代具身基座模型”产学研深度融合专项启动会暨指导委员会第一次会议在清华大学电子工程馆7层顺利召开。清华大学电子工程系主任沈渊、专项指导委员会主任汪玉、专项指导委员会秘书长于超，以及自变量机器人科技（深圳）有限公司创始人&首席执行官，专项指导委员会主任王潜、联合创始人&首席技术官王昊等共同出席本次会议。

第二十一届研电赛“无问芯穹”命题正式发布，新增具身智能强化学习赛题

April 11, 2026

保研加分+丰厚奖金+平台券等你赢！

智元的Genie Sim已接入RLinf!

April 08, 2026

智元仿真平台Genie Sim 3.0迎来新升级，自然语言生成3D世界

RLinf 入选具身智能 EAI-100 年度榜单十大突破项目

March 26, 2026

实力登榜！无问芯穹 RLinf 入选具身智能 EAI-100 年度榜单十大突破项目

View all news →