MVISTA-4D: View-Consistent 4D World Model with Test-Time Action Inference for Robotic Manipulation figure
AlphaXiv 中文论文页面(可滚动查看)