Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows figure
AlphaXiv 中文论文页面(可滚动查看)