Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models figure
AlphaXiv 中文概览(可滚动查看)