CF-VLA: Efficient Coarse-to-Fine Action Generation for Vision-Language-Action Policies figure
AlphaXiv 中文论文页面(可滚动查看)