FASTer: Toward Efficient Autoregressive Vision Language Action Modeling via Neural Action Tokenization figure
AlphaXiv 中文论文页面(可滚动查看)