DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control figure
AlphaXiv 中文论文页面(可滚动查看)