Can I use Perplexity Memory without a Pro subscription?

No. Memory is only available for Perplexity Pro and Max subscribers as of April 2026. Free users don't get persistent context across conversations or any stored preferences.

What happens to my memories if I switch from Pro to the free tier?

Your existing memories remain in the system for 30 days but stop surfacing in responses immediately. If you downgrade and then upgrade again within 30 days, your full memory store comes back.

Perplexity Memory vs RAG for building agent context?

RAG retrieves documents but doesn't track user preferences, handle contradictions, or reason about time. Memory systems need working memory for active context, episodic memory for past interactions, semantic memory for facts, and procedural memory for behaviors—RAG only covers retrieval, missing the other three layers entirely.

How does incognito mode affect my stored memories in Perplexity?

Incognito mode disables memory for that session only. No queries get logged, no new memories form, and existing memories don't surface in responses. Your previously stored memories stay intact and active in normal mode.

Can I export my Perplexity memories to use in another AI tool?

No export option exists as of April 2026. Your memories live exclusively inside Perplexity's system with no portability to other platforms or agents you might build.

Best way to prevent Perplexity from remembering specific topics without using incognito?

Delete the memory immediately after the conversation ends, then avoid repeating that topic. Since repetition drives memory formation, one-off mentions rarely stick anyway—but manual deletion guarantees it won't surface.

Does Perplexity's memory graph track relationships between facts like Supermemory does?

No. Perplexity stores preferences and search history but doesn't maintain a knowledge graph that tracks how facts relate, contradict, or evolve over time. Supermemory's memory graph handles entity relationships, temporal updates, and inference—Perplexity's architecture focuses on retrieval, not graph reasoning.

What makes the 95% recall rate meaningful compared to the old 77%?

At 77%, you hit enough misses that you stop trusting the system. At 95%, context surfaces reliably enough that you can depend on it. The upgrade stores half as many memories but retrieves them when they actually matter, trading quantity for precision.

Can enterprise teams control memory policies across users in Perplexity?

Yes. Enterprise accounts get admin-level memory controls, letting teams set policies for what gets stored, how long memories persist, and who can access them across the organization.

How long does it take to build memory into an AI agent using Supermemory vs building from scratch?

Supermemory ships memory as API calls, so you can integrate working memory, episodic memory, semantic memory, and procedural memory in days. Building from scratch means solving retrieval, conflict resolution, temporal reasoning, and graph storage before you ship anything—that's quarters, not days.

Learning

How Perplexity Memory Works: What It Remembers (and What It Doesn't)

Shardul Mane

30 Apr 2026 • 6 min read

You've asked Perplexity about vegan recipes four times this week, but when you open a new chat tomorrow, it's like those conversations never happened. Zero context carried forward. That's the old Perplexity. Understanding how memory works now reveals a completely different architecture: a two-part system where repetition signals intent, sensitive data gets filtered out automatically, and your context persists across model switches. The recall rate jumped to 95% in February 2026 by storing fewer memories but surfacing them way more reliably. Let me show you what actually gets remembered and what doesn't.

TLDR:

Perplexity Memory stores preferences and search history across conversations, eliminating repetitive context-setting with a 95% recall rate
Memory works cross-model, persisting your context whether you use GPT-4o, Claude, or Gemini in the same session
The system filters sensitive data automatically and offers full deletion controls with a 30-day recovery window
Most teams building agents underestimate memory architecture: RAG retrieves documents but can't track user preferences or handle contradictions
Supermemory offers a memory API with 85.4% LongMemEval-S accuracy and sub-300ms retrieval for teams building context-aware agents

What Perplexity Memory Actually Is (and Why It Matters)

Every time you opened Perplexity, it forgot you existed. New conversation, blank slate, zero context. You'd mention your stack again. Explain your preferences again. Set up everything you thought you'd already said.

Perplexity Memory changes that. It retains details you share across conversations, like your interests, preferences, and recurring context, so the AI can factor them into future responses without you spelling everything out again.

This isn't a minor quality-of-life fix. When a search assistant starts knowing who you are, the interaction changes entirely. It stops feeling like a query engine and starts feeling like a knowledgeable collaborator that actually pays attention.

The Two-Part Memory System: How Perplexity Stores Your Context

Perplexity's memory system has two distinct layers that serve different purposes.

The first is Memories: explicit preferences, interests, and personal details you share. Things like your preferred coding language, dietary restrictions, or the fact that you're a founder. These get stored as structured facts and surface whenever relevant.

The second is Search History: your past queries and the answers Perplexity returned, referenced to enrich future responses.

Layer	What Gets Stored	When It's Used
Memories	Preferences, interests, personal facts	Whenever relevant context is needed
Search History	Past queries and responses	When a new question overlaps with prior searches

Neither layer operates in isolation. Ask about Python and it might reference your stated preference for Python 3.11. Ask a follow-up on a prior topic and it recalls that thread.

What Perplexity Remembers (with Examples)

Perplexity builds memory from patterns. Ask about gluten-free pasta three times and it files that preference away. Repetition signals intent, and intent becomes a stored fact.

Here's what sticks across real use cases:

Dietary preferences: ask repeatedly about vegan recipes and future food recommendations skip the meat entirely
Hobbies: frequent questions about trail running get remembered, so gear recommendations land in context
Work context: your role, industry, and tools are all fair game if you bring them up often enough
Active projects: reference your startup or codebase enough times and Perplexity tracks it

Frequency is the primary signal. One-off mentions may not stick. Recurring themes almost certainly will.

What Perplexity Doesn't Remember (and Why)

Not everything you tell Perplexity gets filed away. That's by design.

Sensitive categories like health conditions and financial details are actively filtered out. A one-time question about a medication won't become a saved memory. Neither will a question about your credit score.

There's also incognito mode, which disables memory entirely for a session. No queries logged, no preferences captured, nothing carried forward.

The core constraint worth knowing: memory favors repetition. Ask something once and it likely won't stick. Thin context or genuinely private topics get filtered out through the frequency model itself, beyond the safety layer.

Cross-Model Memory Portability: Your Context Follows You Everywhere

Perplexity places memory above the model layer, not inside it. Most AI tools tie your context to a single model, so switching means starting over. Perplexity sidesteps this entirely.

Your stored preferences and search history persist regardless of which model handles the response. Jump from GPT-4o to Claude to Gemini mid-session and the memory layer stays intact. The model changes; your context doesn't.

This is a real architectural distinction. Memory decoupled from inference means you pick the best model for each task without losing continuity. Need Claude for writing? GPT for structured reasoning? Switch freely.

The Memory Recall Upgrade: 95% Accuracy and What It Means

Perplexity shipped a memory upgrade in February 2026 for Pro and Max subscribers. Recall rate jumped from 77% to 95%, while the system stores roughly half as many memories as before.

That tradeoff is the whole point. Fewer memories, recalled more reliably. The older system captured everything indiscriminately, so retrieval got noisy. Surface enough irrelevant context and the useful signal drowns.

When the system only stores what genuinely reflects your patterns, precision goes up. A 95% recall rate means context surfaces when it should. At 77%, you'd hit enough misses to stop trusting it entirely.

Privacy Controls: What You Can Delete, Toggle, and Manage

Perplexity gives you straightforward control over what it retains. All saved memories live under Settings > Personalize > Manage Memories. Hit the trash icon next to any individual memory to remove it, or wipe everything with the clear all button.

A few things worth knowing before you start deleting:

Deleted memories have a 30-day retention window before permanent removal, so if you clear something and change your mind, you have time to recover it.
Toggling memory off stops new memories from forming but leaves existing ones intact.
Incognito mode skips memory entirely for that session, nothing logged, nothing saved.
Enterprise accounts get admin-level controls, letting teams set memory policies across users.

How Retrieval Actually Works: When Memory Surfaces in Your Answers

Retrieval in Perplexity isn't passive. When you submit a query, the system scans your memory store for relevant context before generating an answer. Three signals determine what surfaces:

Semantic relevance: how closely a stored memory relates to your current question
Recency: newer context carries more weight
Query context: what you're asking for in this moment

What makes this unusually transparent is the sourcing. Referenced memories appear directly in responses, labeled just like web citations. You can see exactly which stored facts shaped the answer, unlike black-box approaches where memory influence is invisible. If the system pulls the wrong memory, you'll catch it immediately and delete it.

Why Memory Architecture Matters for AI Applications

Perplexity's memory system works well for search. Building one from scratch is a different problem entirely.

Most teams reach for RAG first. RAG retrieves documents. It doesn't remember. There's no user profile, no preference tracking, no temporal reasoning. When context contradicts itself or evolves, RAG has no mechanism to handle that.

Real memory architecture requires four distinct layers working together: working memory for active session context, episodic memory for past interactions, semantic memory for persistent facts, and procedural memory for learned behaviors. Miss any one and the system degrades.

That's the gap most teams underestimate.

Beyond Search: Building AI Agents That Actually Remember You

Perplexity solved memory for search. If you're building your own agent, you're starting from scratch.

That's what Supermemory is built for: a memory API for AI agents that gives your application the same context continuity Perplexity ships as a product feature, exposed as infrastructure you own. Sub-300ms retrieval, a memory graph that tracks relationships between facts instead of just similarity scores, and user profiles built automatically from behavior.

The benchmark numbers are real: 85.4% overall accuracy on LongMemEval-S, 92.3% on single-session user memory, 76.7% on multi-session recall. On LoCoMo, we rank first.

Final Thoughts on AI Memory Systems

How Perplexity memory works reveals what separating memory from models actually unlocks: context that persists regardless of which AI handles the response. Building that yourself means solving retrieval, conflict resolution, and temporal reasoning before you ship anything users see. We built Supermemory to give you that entire stack as API calls. Try it in your agent and ship memory in days instead of quarters.

FAQ

Perplexity Memory vs Supermemory: what's the difference?

Perplexity Memory is a product feature built into their search assistant, while Supermemory is a memory API you can integrate into your own AI applications. Perplexity solves memory for search; Supermemory gives you the infrastructure to build agents that remember context across sessions, with sub-300ms retrieval and a memory graph that tracks relationships between facts.

Can I delete specific memories in Perplexity without wiping everything?

Yes. Go to Settings > Personalize > Manage Memories and hit the trash icon next to any individual memory you want removed. Deleted memories have a 30-day retention window before permanent removal, so you can recover them if you change your mind.

How does Perplexity decide what to remember from my conversations?

Repetition is the primary signal. Ask about gluten-free recipes three times and it files that preference away; mention something once and it likely won't stick. Perplexity also actively filters out sensitive categories like health conditions and financial details, regardless of how often you mention them.

What's the fastest way to build memory into my AI agent without rebuilding from scratch?

Use a memory API like Supermemory instead of trying to build on RAG alone. RAG retrieves documents but doesn't remember users, track evolving context, or handle contradictions. Real memory architecture requires working memory, episodic memory, semantic memory, and procedural memory working together. Supermemory ships all four layers with 85.4% accuracy on LongMemEval-S benchmarks.

Does Perplexity's memory work across different AI models?

Yes. Perplexity decouples memory from the model layer, so your stored preferences and search history persist whether you're using GPT-4o, Claude, or Gemini. Switch models mid-session and your context stays intact: the model changes but your memory doesn't.