Publications

2026

  1. Survey of Vision-Language-Action Models for Embodied Manipulation
    面向具身操作的视觉-语言-动作模型综述
    Haoran LiYuhui ChenWenbo Cui, Weiheng Liu, Kai Liu, Mingcai ZhouZhengtao Zhang, and Dongbin Zhao
    IEEE/CAA Journal of Automatica Sinica 自动化学报, Jan 2026
  2. QDepth-VLA: Quantized Depth Prediction as Auxiliary Supervision for Vision-Language-Action Models
    Yixuan Li, Yuhui ChenMingcai Zhou, and Haoran Li
    May 2026

2025

  1. ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
    Yuhui Chen, Shuai Tian , Yingting Zhou, Shugao Liu, Haoran Li, and Dongbin Zhao
    In Robotics: Science and Systems XXI, RSS , Jun 2025
  2. Under Review
    CL3R.png
    CL3R: 3D Reconstruction and Contrast Learning for Enhanced Robotic Manipulation Representations
    Jul 2025
  3. TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning
    IEEE Transactions on Systems, Man, and Cybernetics: Systems, Dec 2025

2024

  1. Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization
    Haoran LiZhennan JiangYuhui Chen, and Dongbin Zhao
    In The 38th Annual Conference on Neural Information Processing Systems, NIPS , Sep 2024
  2. Boosting Continuous Control with Consistency Policy
    Yuhui ChenHaoran Li, and Dongbin Zhao
    In The 23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS , May 2024