H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos figure
AlphaXiv 中文论文页面(可滚动查看)