A General One-Shot Multimodal Active Perception Framework for Robotic Manipulation: Learning to Predict Optimal Viewpoint figure
AlphaXiv 中文论文页面(可滚动查看)