VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs figure
AlphaXiv 中文论文页面(可滚动查看)