Awesome Robotics Manipulation · full_paper

Robotic VLA Benefits from Joint Learning with Motion Image Diffusion

作者：Yu Fang, Kanchana Ranasinghe, Le Xue, Honglu Zhou, Juntao Tan, Ran Xu, Shelby Heinecke, Caiming Xiong, Silvio Savarese, Daniel Szafir, Mingyu Ding, Michael S. Ryoo, Juan Carlos Niebles · 单位：University of North Carolina at Chapel Hill · 会议/期刊：arXiv · 日期：2025-12-19 · 来源：Low-Level Learning-Based Action Modelling / Input Modelling / 2D Vision Language Action Models with Auxiliary Tasks - Visual Goal Extraction

辅助任务视觉语言动作扩散策略感知机器人学习操作

Robotic VLA Benefits from Joint Learning with Motion Image Diffusion figure — AlphaXiv 中文论文页面（可滚动查看）