Vision-Language-Action Model with Open-World Generalization figure
AlphaXiv 中文概览(可滚动查看)