Exploring the Limits of Vision-Language-Action Manipulation in Cross-task Generalization figure
AlphaXiv 中文论文页面(可滚动查看)