ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning figure
AlphaXiv 中文论文页面(可滚动查看)