Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-Language Navigation figure
AlphaXiv 中文论文页面(可滚动查看)