EmboAlign: Aligning Video Generation with Compositional Constraints for Zero-Shot Manipulation figure
AlphaXiv 中文论文页面(可滚动查看)