KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition figure
AlphaXiv 中文论文页面(可滚动查看)