DEAS: DEtached value learning with Action Sequence for Scalable Offline RL figure
AlphaXiv 中文论文页面(可滚动查看)