ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy figure
AlphaXiv 中文论文页面(可滚动查看)