RynnVLA-002: A Unified Vision-Language-Action and World Model figure
AlphaXiv 中文论文页面(可滚动查看)