Yuhui Chen - 陈宇辉

Affiliations. Institute of Automation, Chinese Academy of Sciences.

prof_pic1.jpg

at Wall Street, New York in 2024.

Hi there! This is Yuhui Chen. I am currently a third-year Ph.D student in Control Theory and Engineering at the Institute of Automation, Chinese Academy of Sciences (CASIA), supervised by Prof. Dongbin Zhao and Prof. Haoran Li.

My research sits at Embodied AI, Reinforcement Learning, and Foundation Models. I am particularly interested in building models that are not only general-purpose but also capable of continuous adaptation and precise physical interaction in real-world. My long-term goal is to equip autonomous agents with the ability to understand the open world, learn from prior experiences, and interact intelligently with humans. Currently, I am working through:

  1. VLA & Foundation Models: Developing unified model architectures for robotic manipulation that integrate spatial perception and 3D reasoning.
  2. Reinforcement Fine-Tuning & Continual Learning: Bridging the gap between large-scale pre-training and downstream task-specific performance with reinforced post-training and enabling robots learn continuously without knowledge forgetting.
  3. Generative World Models as Simulators: Leveraging Diffusion and Video Generation models to provide dense rewards and structural guidance for sample-efficient robot learning.

Prior to joining CASIA, I was a MCU embedded engineer at Dajiang Innovations (DJI). I hold a B.Eng. degree in Information Engineering from Beijing Institute of Technology (BIT) and a B.Eng. in Electrical Communication Engineering from the Australian National University (ANU).

News

Feb 02, 2026 Our paper about learning 3D representations for robotic manipulation (CLAR) was accepted to ICRA 2026.
Dec 19, 2025 Our paper about augmenting VLA models with auxiliary depth prediction (QDepth-VLA) was accepted to AAMAS 2026.
Nov 21, 2025 Our paper about leveraging a pre-trained text-to-video diffusion model to generate dense rewards for robot tasks (TeViR) was accepted to IEEE Transactions on Systems, Man, and Cybernetics: Systems.
Nov 06, 2025 Our survey of VLA models for embodied manipulation was accepted to Acta Automatica Sinica (自动化学报).
Apr 11, 2025 Our paper about VLA reinforced fine-tuning via consistency policy in real-world environemnts (ConRFT) was accepted to RSS 2025.

Selected Publications

  1. Survey of Vision-Language-Action Models for Embodied Manipulation
    面向具身操作的视觉-语言-动作模型综述
    Haoran LiYuhui Chen, Wenbo Cui, Weiheng Liu, Kai Liu, Mingcai ZhouZhengtao Zhang, and Dongbin Zhao
    IEEE/CAA Journal of Automatica Sinica 自动化学报, Jan 2026
  2. ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
    Yuhui Chen, Shuai Tian , Yingting Zhou, Shugao Liu, Haoran Li, and Dongbin Zhao
    In Robotics: Science and Systems XXI, RSS , Jun 2025
  3. Under Review
    LifeLongRFT.png
    Towards Long-Lived Robots: Continual Learning VLA Models via Reinforcement Fine-Tuning
    Yuan Liu, Haoran Li, Shuai Tian, Yuxing Qin, Yuhui ChenYupeng Zheng, Yongzhen Huang, and Dongbin Zhao
    Feb 2026
  4. Under Review
    wovr.png
    WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL
    Zhennan Jiang , Shangqing Zhou , Yutong Jiang, Zefang Huang, Mingjie Wei, Yuhui Chen , Tianxing Zhou, Zhen Guo, Hao Lin , Quanlu Zhang , Yu Wang, Haoran LiChao Yu, and Dongbin Zhao
    Feb 2026
  5. TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning
    IEEE Transactions on Systems, Man, and Cybernetics: Systems, Feb 2026
  6. Boosting Continuous Control with Consistency Policy
    Yuhui ChenHaoran Li, and Dongbin Zhao
    In The 23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS , May 2024
Full List of Publications