VERM: Leveraging Foundation Models to Create a Virtual Eye for Efficient 3D Robotic Manipulation figure
AlphaXiv 中文论文页面(可滚动查看)