Automatic Content Intelligence
When you add content, Supermemory:- Detects the content type — PDF, code, markdown, images, video, etc.
- Extracts content optimally — Uses type-specific extraction (OCR for images, transcription for audio)
- Chunks intelligently — Applies the right chunking strategy for the content type
- Generates embeddings — Creates vector representations for semantic search
- Builds relationships — Connects new knowledge to existing memories
Smart Chunking by Content Type
Different content types need different chunking strategies. Supermemory applies the optimal approach automatically:Documents (PDF, DOCX)
PDFs and documents are chunked by semantic sections — headers, paragraphs, and logical boundaries. This preserves context better than arbitrary character splits.Code
Code is chunked using code-chunk, our open-source library that understands AST (Abstract Syntax Tree) boundaries:- Functions and methods stay intact
- Classes are chunked by method
- Import statements grouped separately
- Comments attached to their code blocks
Web Pages
URLs are fetched, cleaned of navigation/ads, and chunked by article structure — headings, paragraphs, lists.Markdown
Chunked by heading hierarchy, preserving the document structure. See Content Types for the full list of supported formats.Hybrid Memory + RAG
Supermemory combines the best of both approaches in every search:Traditional RAG
- Finds similar document chunks
- Great for knowledge retrieval
- Stateless — same results for everyone
Memory System
- Extracts and tracks user facts
- Understands temporal context
- Personalizes results per user
searchMode: "hybrid" (the default), you get both:
Search Optimization
Two flags give you fine-grained control over result quality:Reranking
Re-scores results using a cross-encoder model for better relevance:Query Rewriting
Expands your query to capture more relevant results:Why It’s “Super”
| Traditional RAG | SUPER RAG |
|---|---|
| Manual chunking config | Automatic per content type |
| One-size-fits-all splits | AST-aware code chunking |
| Just document retrieval | Hybrid memory + documents |
| Static embeddings | Relationship-aware graph |
| Generic search | Rerank + query rewriting |
Next Steps
Content Types
All supported formats and how they’re processed
How It Works
The full processing pipeline
Memory vs RAG
When to use each approach
Search
Search parameters and optimization