SheepNav
精选今天0 投票

Personalizing Embodied Multimodal Large Language Model Agents over Long-term User Interactions

arXiv:2605.26256v1 Announce Type: new Abstract: Multimodal large language model (MLLM)-based embodied agents have shown strong potential for solving complex tasks in physical environments. However, personalized assistance requires more than following generic instruction or recognizing object categories. In real-world scenarios, the intended target is often specified only implicitly through prior interactions, requiring agents to leverage personalized context accumulated over time. In this work,

延伸阅读

  1. 从3D形状到可建造砖块结构:BrickAnything 用结构感知分词技术革新生成方式
  2. LLM 能内省吗?一项现实检验
  3. 智能体记忆是数据库吗?重新思考长期AI记忆的数据基础
查看原文