Steering Your Diffusion Policy with Latent Space Reinforcement Learning figure
AlphaXiv 中文概览(可滚动查看)