主页 ← Embodied AI TopConf Index

Embodied AI TopConf · ICCV2025

Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding

ICCV2025 / Vision-Language-Action Model

视觉语言动作视频感知

Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding figure — AlphaXiv 中文论文页面（可滚动查看）

论文对话

模型：读取中