NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards figure
AlphaXiv 中文论文页面(可滚动查看)