FASTer: Toward Powerful and Efficient Autoregressive Vision–Language–Action Models with Learnable Action Tokenizer and Block-wise Decoding figure
在线论文 PDF(可滚动查看)

精读笔记

精读笔记尚未生成。