Embodied Navigation with Auxiliary Task of Action Description Prediction figure
AlphaXiv 中文论文页面(可滚动查看)