SA-VLA: Spatially-Aware Flow-Matching for Vision-Language-Action Reinforcement Learning figure
AlphaXiv 中文论文页面(可滚动查看)