Rethinking Visual-Language-Action Model Scaling: Alignment, Mixture, and Regularization figure
AlphaXiv 中文论文页面(可滚动查看)