StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation figure
AlphaXiv 中文论文页面(可滚动查看)