Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation figure
AlphaXiv 中文概览(可滚动查看)