ReMem-VLA: Empowering Vision-Language-Action Model with Memory via Dual-Level Recurrent Queries figure
AlphaXiv 中文论文页面(可滚动查看)