PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance figure
AlphaXiv 中文论文页面(可滚动查看)