StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision figure
AlphaXiv 中文论文页面(可滚动查看)