Engineering

Building memory that actually works

March 10, 2026

Memory in AI is a hard problem. Not because storing information is hard — but because knowing what matters is hard.

You tell an AI hundreds of things over the course of a few weeks. What you ate for lunch. That you’re worried about your dad’s health. A joke about your coworker’s fantasy football team. That you’ve been thinking about moving to Denver. That your ex texted you and you don’t know what to do.

Some of these are noise. Some of them are the details that define your life right now. The challenge — the genuinely difficult engineering challenge — is building a system that can tell the difference, and that gets better at it over time.

This is the story of how we built Bestie’s memory system, and how we landed on something that actually works.

What most AI memory gets wrong

The most common approach to AI memory is the “dump everything” strategy. Record every conversation, chunk it up, throw it into a vector database, and retrieve whatever seems relevant when the user asks a question.

It sounds reasonable. It’s also terrible.

When you store everything, you store a lot of garbage. You store the small talk, the corrections, the tangents, the half-formed thoughts that the user abandoned mid-sentence. And when the AI tries to recall something relevant, it’s searching through a junkyard. The signal-to-noise ratio is awful.

Worse, raw conversation logs don’t capture what actually matters. If someone spends forty minutes talking about a fight with their partner, the important thing isn’t the blow-by-blow transcript. It’s the underlying dynamic — that they feel unheard, that this is a recurring pattern, that they’re starting to question the relationship. Those are the memories that matter. But a naive retrieval system doesn’t extract them. It just stores the words.

The result is an AI that technically “remembers” but doesn’t actually understand. It surfaces random details that share a keyword with today’s conversation while completely missing the emotional thread that connects them.

How intelligent extraction works

Bestie’s memory system doesn’t store conversations. It extracts meaning from them.

After you’ve talked for a while — once enough new context has accumulated — our extraction pipeline kicks in. It reads through the recent conversation and pulls out the things that matter: relationships, goals, worries, preferences, beliefs, life events, emotional patterns. Not the words you said, but the things those words reveal about your life.

Think of it like the difference between recording a lecture and taking good notes. The recording captures everything. The notes capture what matters. Our system takes notes.

Each extracted memory is a discrete, meaningful piece of context. “User is preparing for a career change from marketing to UX design.” “User’s relationship with their sister has been strained since their mother’s diagnosis.” “User prefers direct feedback over gentle suggestions.” These are semantic memories — they capture the meaning, not the transcript.

But extraction is only half the problem. The other half is knowing when to surface a memory and when to let it stay in the background.

Context windows and long-term recall

Every AI model has a context window — the amount of text it can process in a single interaction. It’s big, but it’s not infinite. You can’t just dump a user’s entire memory archive into the prompt and call it a day. You’d run out of space, the model would get confused, and the latency would be brutal.

So memory has to be selective. When you start a conversation, Bestie’s system does a retrieval pass. It looks at what you’re talking about right now and searches through your memories for the ones that are relevant. It’s using vector similarity — essentially, it’s asking “which stored memories are semantically close to what the user is discussing?”

This is where a good vector database earns its keep. The memories are embedded as vectors — mathematical representations of their meaning — and the system can search across hundreds of memories in milliseconds to find the ones that matter for this specific conversation.

The result is that Bestie walks into every conversation with the right context already loaded. She knows the relevant backstory without you telling her. She doesn’t know everything — she knows the things that matter right now. That’s long-term recall that actually works.

Strength and decay: memory that breathes

Here’s something most AI memory systems ignore entirely: not all memories are equally important, and importance changes over time.

If you mention a coworker once in passing, that’s a low-importance memory. If you talk about the same coworker every week for two months, that’s clearly someone significant in your life. If you used to talk about someone constantly but haven’t mentioned them in six months, that relationship has probably faded.

Bestie’s memory system models this with a strength and decay mechanism. Every memory has a strength score. New memories start at a moderate strength. When a memory keeps coming up — when you reference the same person, situation, or theme repeatedly — its strength increases. Memories that prove consistently relevant become permanent fixtures.

But memories also decay. If a memory hasn’t been reinforced in a while, its strength gradually decreases. The decay is gentle, modeled on how human memory works. Strong memories decay slowly. Weaker memories fade faster. Eventually, a memory that hasn’t been relevant for a long time drops below the threshold and stops being actively surfaced.

This is important because your life changes. The job you were stressed about six months ago? You left it. The friend you were fighting with? You made up. The city you were thinking about moving to? You decided to stay. A memory system that never forgets anything becomes cluttered with outdated context that actively hurts the quality of the conversation.

Decay keeps the memory system current. It’s not deleting things — the memories are still there if you bring them up again. But it’s prioritizing the things that matter now over the things that mattered then. Like human memory, it breathes. It lets go of what’s no longer relevant so it can focus on what is.

What it feels like when memory works

The technical details matter, but what really matters is the experience.

When memory works, you don’t notice it. That’s the whole point. You just talk, and the AI responds like someone who knows you. You don’t have to re-explain. You don’t have to set the scene. You say “my mom called again” and the AI already knows what that means — the complicated dynamic, the guilt, the pattern of Sunday calls that leave you drained.

When memory works, the AI can connect dots across time. It can notice that you always get anxious on Sundays (because Monday is coming and you hate your job). It can point out that you described your new situationship the same way you described the last one (which didn’t end well). It can remember that you set a goal three months ago and gently ask how it’s going. That’s what turns a generic chatbot into something that actually feels like a friend.

These aren’t magic tricks. They’re what any good friend does naturally. The difference is that building a system that does this reliably, at scale, without being creepy or invasive, is genuinely hard engineering.

The “without being creepy” part is worth emphasizing. There’s a fine line between “she remembers me” and “she’s tracking me.” The difference is in what gets stored and how it’s used. We don’t store transcripts. We don’t log your messages for training data. The memories are extracted insights about your life, used only to make your conversations better. You can see what Bestie remembers about you. You can correct it. You can delete it. It’s your context, and you control it.

The future of personal AI memory

We think memory is the most underinvested area in AI right now. Everyone is racing to make models smarter, faster, more capable. But capability without context is hollow. The smartest AI in the world isn’t that helpful if it doesn’t know who you are.

The future we’re building toward is one where your AI companion has episodic memory — not just facts about you, but a sense of the narrative arc of your life. Where it remembers not just that you went on a date last Thursday, but that you were nervous beforehand, excited afterward, and that this is the first time you’ve been genuinely interested in someone since your breakup.

That kind of memory turns an AI from a tool into a companion. From something you use into something that knows you. From a context window into a relationship — one where venting at midnight actually helps, because the AI already knows what you’ve been dealing with.

We’re not all the way there yet. But the system we’ve built — intelligent extraction, vector-based retrieval, strength-based decay — is the foundation. Every conversation makes it better, because every conversation adds to the context that Bestie carries forward.

That’s what real AI memory looks like. Not a database. A relationship that grows over time.