Extracting Visual Plans from Unlabeled Videos via Symbolic Guidance figure
AlphaXiv 中文概览(可滚动查看)