EverMemOS Hits SOTA Performance on LoCoMo

EverMind researchers
Published at September 30, 2025
About 3 minutes to read
#SOTA #LoCoMo #long-term memory

EverMemOS is an intelligent memory operating system designed to give AI the ability not just to remember, but to understand, reason, and evolve. On the LoCoMo benchmark, our approach built upon EverMemOS achieved a 92.3% reasoning accuracy (evaluated by LLM-Judge), outperforming comparable methods in our internal evaluation.

EverMemOS Hits SOTA Performance on LoCoMo

Most existing AI systems suffer from contextual forgetfulness — they can handle one conversation perfectly but fail to connect it with what came before. As AI becomes a long-term collaborator instead of a short-term assistant, such amnesia limits its capacity for real understanding and continuity. Our motivation in creating EverMemOS was simple yet ambitious: To transform AI memory from a passive data store into an active cognitive layer — one that captures meaning, maintains coherence, and builds a long-term understanding of users and tasks. We believe true intelligence requires the ability to remember with purpose and act with insight.

At its core, EverMemOS integrates layered memory extraction, structured knowledge organization, and adaptive retrieval mechanisms, enabling AI to build narratives from scattered interactions and make reasoning decisions grounded in accumulated understanding. These design principles come together in what we call our Unique Advantages — the foundation that makes EverMemOS not just a system of memory, but a system of foresight. Our unique advantages lie in three key dimensions that redefine how AI perceives, connects, and grows through memory.

  1. 🔗 Coherent Narrative – Beyond fragments, connecting stories. EverMemOS automatically links conversation pieces to form clear thematic contexts, enabling AI to truly understand. It naturally distinguishes between threads such as “Project A progress discussions” and “Team B strategy planning”, maintaining coherent logic within each theme. From scattered phrases to complete narratives, AI no longer just “understands one sentence” — it “understands the whole story.”
  2. 🧠 Evidence-Based Perception – Beyond retrieval, towards intelligent perception. EverMemOS proactively captures deep relationships between memories and tasks, allowing AI to “think thoroughly” at critical moments. For instance, when a user asks for food recommendations, the system recalls that “you had dental surgery two days ago” and adapts its suggestions accordingly. This is Contextual Awareness — intelligence built on understanding rather than isolated responses.
  3. 💾 Living Profiles – Beyond records, towards dynamic growth. EverMemOS continuously updates user profiles in real time, learning from every interaction to know users more authentically. Over time, preferences, tone, and focus areas evolve naturally. The system doesn’t just “remember what you said,” it’s actively “learning who you are.”

Based on the EverMemOS framework and the GPT-4.1-mini model, we evaluated long-context memory reasoning performance on the LoCoMo dataset. According to assessments by LLM-Judge, our approach achieved a reasoning accuracy of 92.3%, outperforming comparable methods. The detailed experimental results are shown in the table below.

table