Actions as Language: Fine-Tuning VLMs into VLAs Without Catastrophic Forgetting figure
AlphaXiv 中文论文页面(可滚动查看)