EndoVLA: Dual-Phase Vision-Language-Action for Precise Autonomous Tracking in Endoscopy figure
AlphaXiv 中文论文页面(可滚动查看)