Embodied Decision Intelligence Lab (EDI Lab) 清华大学具身决策智能实验室

Introduction

Embodied Decision Intelligence Lab (EDI Lab) belongs to Tsinghua University Shenzhen International Graduate School. It is supersised by Prof. Chao Yu (于超). Our lab is committed to research related to reinforcement learning infrastructure, strategic agent and embodied AI.

Chao Yu is an Associate Professor at Tsinghua University, SIGS. She is also the chairman of the Tsinghua Shenzhen International Graduate School - AgiBot Joint Research Center for Embodied Cognition and Decision Systems (JCES). As first author or corresponding author, Prof. Chao Yu has published more than 50 papers in top-tier international conferences and journals, including ICML, NeurIPS, ICLR, CVPR, ECCV, CoRL, IROS, ICRA, TMLR, and RAL, with over 6,000 citations on Google Scholar.My representative works include the multi-agent reinforcement learning algorithm MAPPO, which has received more than 3,000 Google Scholar citations, and RLinf, a large-scale reinforcement learning training framework for embodied intelligence, which has accumulated over 3,000 GitHub stars.

Our lab is currently recruiting Master’s students, Ph.D. students, joint-program Ph.D. students with Zhongguancun Academy, postdoctoral researchers, and undergraduate research assistants. We warmly welcome students who are interested in recommended admission or applying through the graduate entrance examination to SIGS programs such as Artificial Intelligence, Data Science and Information Technology, Big Data Engineering, and Electronic Engineering, as well as applicants for the above Ph.D. and postdoctoral positions, to join us.

Highlights

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments
Zelai Xu, Zhexuan Xu, Xiangmin Yi, Huining Yuan, Xinlei Chen, Yongji Wu, Chao Yu, Yu Wang
Proceedings of CVPR  ·  2026CCF-A
RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Chao Yu, Yuanqing Wang, Zhen Guo, Hao Lin, Si Xu, Hongzhi Zang, Quanlu Zhang, Yongji Wu, Chunyang Zhu, Junhao Hu, Zixiao Huang, Mingjie Wei, Yuqing Xie, Ke Yang, Bo Dai, Zhexuan Xu, Jiakun Du, Xiangyuan Wang, Xu Fu, Letong Shi, Zhihao Liu, Kang Chen, Weilin Liu, Gang Liu, Boxun Li, Jianlei Yang, Zhi Yang, Guohao Dai, Yu Wang
arXiv preprint  ·  2025
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training
Hongzhi Zang, Mingjie Wei, Si Xu, Yongji Wu, Zhen Guo, Yuanqing Wang, Hao Lin, Liangzhi Shi, Yuqing Xie, Zhexuan Xu, Zhihao Liu, Kang Chen, Wenhao Tang, Quanlu Zhang, Weinan Zhang, Chao Yu, Yu Wang
arXiv preprint  ·  2025
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
Zelai Xu, Chao Yu, Fei Fang, Yu Wang, Yi Wu
Proceedings of ICML  ·  2024CCF-A
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu
Proceedings of ICML  ·  2024CCF-A
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre Bayen, Yi Wu
Proceedings of NeurIPS  ·  2022CCF-A

News

RLinf助力全球首个具身智能工业产线规模落地

2026 年 4 月 14 日,智元机器人在龙旗科技南昌平板制造工厂开启长达 8 小时的真实产线作业直播,2283次任务零失误,成功率100%,面向全球公开验证全球首个具身智能 3C 精密制造产线规模化落地成果。智元精灵 G2 以产线 “正式员工” 身份,在高速流水线上完成精密上下料、人机协同全流程作业。标志具身智能正式迈入工业级常态化部署阶段,成为全球具身智能产业商用落地的里程碑事件。

清华大学-自变量机器人“下一代具身基座模型”产学研专项启动会顺利召开

4月13日,清华大学-自变量机器人科技(深圳)有限公司“下一代具身基座模型”产学研深度融合专项启动会暨指导委员会第一次会议在清华大学电子工程馆7层顺利召开。清华大学电子工程系主任沈渊、专项指导委员会主任汪玉、专项指导委员会秘书长于超,以及自变量机器人科技(深圳)有限公司创始人&首席执行官,专项指导委员会主任王潜、联合创始人&首席技术官王昊等共同出席本次会议。

第二十一届研电赛“无问芯穹”命题正式发布,新增具身智能强化学习赛题

保研加分+丰厚奖金+平台券等你赢!

智元的Genie Sim已接入RLinf!

智元仿真平台Genie Sim 3.0迎来新升级,自然语言生成3D世界

RLinf 入选具身智能 EAI-100 年度榜单十大突破项目

实力登榜!无问芯穹 RLinf 入选具身智能 EAI-100 年度榜单十大突破项目

Talks

RLinf相关工作将在北京智源大会上报告

清华大学校庆暨智元实验室揭牌

军事科学院线上分享

“松雅智汇”学术讲座第二期

迈向具身决策智能: 强化学习算法与基础设施协同演进

中国具身智能大会

于超老师受邀在中国具身智能大会做了两场报告

Sponsors