Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey figure
AlphaXiv 中文论文页面(可滚动查看)