Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals figure
AlphaXiv 中文概览(可滚动查看)