Publications

2025

  1. Under Review
    QdepthVLA.png
    QDepth-VLA: Quantized Depth Prediction as Auxiliary Supervision for Vision-Language-Action Models
    Yixuan Li, Yuhui Chen, Mingcai Zhou, and Haoran Li
    Oct 2025
  2. Survey of Vision-Language-Action Models for Embodied Manipulation
    面向具身操作的视觉-语言-动作模型综述
    Haoran LiYuhui ChenWenbo Cui, Weiheng Liu, Kai Liu, Mingcai Zhou, Zhengtao Zhang, and Dongbin Zhao
    In IEEE/CAA Journal of Automatica Sinica , Nov 2025
  3. ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
    Yuhui Chen, Shuai Tian, Yingting Zhou, Shugao Liu, Haoran Li, and Dongbin Zhao
    In Robotics: Science and Systems XXI, RSS , Jun 2025
  4. Under Review
    CL3R.png
    CL3R: 3D Reconstruction and Contrast Learning for Enhanced Robotic Manipulation Representations
    Jul 2025
  5. TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning
    Nov 2025

2024

  1. Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization
    Haoran LiZhennan JiangYuhui Chen, and Dongbin Zhao
    In The 38th Annual Conference on Neural Information Processing Systems, NIPS , Sep 2024
  2. Boosting Continuous Control with Consistency Policy
    Yuhui ChenHaoran Li, and Dongbin Zhao
    In The 23rd International Conference on Autonomous Agents and Multiagent Systems, AAMAS , May 2024