LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment figure
AlphaXiv 中文论文页面(可滚动查看)