Per-user memory graph
Auto-built profiles and fact hierarchies, so agents learn in real time.
2× cheaper than next-best, with better quality.
Free to get started with. Pay only for what you use, with your own spend limits.
For builders tinkering, prototypes and side projects.
For small teams and plugin power-users.
More headroom for developers who need it.
For teams running production workloads.
Used by the best teams
Usage-based primitives for storing, extracting, retrieving, and operating on agent context. Pay only for what you use.
Auto-built profiles and fact hierarchies, so agents learn in real time.
2× cheaper than next-best, with better quality.
Multimodal extract, chunk, and retrieve. No embeddings, no vectors.
Semantic search and graph traversal across everything you've stored.
Built for agent loops.
Re-rank, aggregate, and rewrite queries as building blocks.
Composable building blocks for richer queries.
Qualifying early-stage startups and academic research projects get production-grade memory infrastructure, free. Ship the agent, we'll cover the context.
up to $2,000 in free credits
Dedicated support · 3 months to build · all features unlocked
Every plan comes with a monthly dollar balance. Each API call — storing memories, searching, indexing — draws from that balance at the rates listed above. When the balance runs out, you can top up or auto top-up to keep going. No surprise bills at the end of the month.
SM tokens are the unique tokens Supermemory actually ingests and embeds — repeats and unchanged content don't get billed again. Effectively a 100% discount on what a normal prompt cache would re-charge you for. Plain text is $0.005 per 1K SM tokens; rich content (PDFs, audio, video) is $0.010 per 1K because it needs heavier extraction.
Nothing — you're not billed again. Supermemory deduplicates at the token level, so re-uploading a doc, syncing a connector, or pushing the same conversation history won't redraw from your balance. Only net-new content counts as SM tokens. This is why production agents that loop over the same context end up an order of magnitude cheaper here than with a typical vector DB.
Subscription credits reset monthly. Top-up credits you purchase in advance never expire — they sit in your balance until you use them.
On Free, you'll be paused: pay-as-you-go isn't available on Free, so you'll need to upgrade to a paid plan to keep going. On Pro, Max and Scale, auto top-up kicks in to keep your app running. You can set hard spend caps on Scale to prevent runaway usage.
Self-hosted deployments are available on Scale and Enterprise. Enterprise additionally supports fully air-gapped deployments (LLM inference may be the only outbound dependency - Unless GPUs are available!).
Yes — qualifying early-stage startups and academic research teams get up to $2,000 in credits, dedicated support, and 3 months to build. Apply via the Startup Program link above.
Still something on your mind?
Get $5 in monthly credits. No credit card required.