MVP-LAM: Learning Action-Centric Latent Action via Cross-Viewpoint Reconstruction figure
AlphaXiv 中文论文页面(可滚动查看)