2D or 3D: Who Governs Salience in VLA Models? -- Tri-Stage Token Pruning Framework with Modality Salience Awareness figure
AlphaXiv 中文论文页面(可滚动查看)