CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision figure
AlphaXiv 中文概览(可滚动查看)