VLA2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation figure
AlphaXiv 中文论文页面(可滚动查看)