Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model figure
AlphaXiv 中文论文页面(可滚动查看)