Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning figure
AlphaXiv 中文论文页面(可滚动查看)