MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent figure
AlphaXiv 中文论文页面(可滚动查看)