主页 ← Embodied AI TopConf Index

Embodied AI TopConf · ICLR2025

VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation

ICLR2025 / Vision-Language-Action Models

视觉语言动作感知

VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation figure — AlphaXiv 中文论文页面（可滚动查看）

论文对话

模型：读取中