BFA++: Hierarchical Best-Feature-Aware Token Prune for Multi-View Vision Language Action Model figure
AlphaXiv 中文论文页面(可滚动查看)