MoS-VLA: A Vision-Language-Action Model with One-Shot Skill Adaptation figure
AlphaXiv 中文论文页面(可滚动查看)