All articles
Naman Bansal
AUTHOR

Naman Bansal

Writer and editor at Supermemory. Covering AI memory, context engineering, and the future of intelligent systems for developers and builders.

10 posts

Supermemory blog banner titled "Matryoshka Representation Learning: The Ultimate Guide" with three nested Russian dolls
Learning ·

Matryoshka Representation Learning: The Ultimate Guide & How We Use It

Embeddings are the cornerstone of any retrieval system. And the larger the embeddings, the more information they can store. But large embeddings require a lot of memory, which leads to high computational costs and latency. To reduce this high cost, we can use models that produce embeddings with sm

Naman Bansal 8 min read
Blog cover banner reading "How Montra Uses Supermemory To Rethink Video Creation" with camera lens and brain icons
Case Study ·

Never Record Again: How Montra Uses Supermemory to Rethink Video Creation

Campbell Baron, the founder of Montra, has been making videos since he was twelve. By thirteen, he was already doing brand work. Today, he’s betting on a very different future for creators: a world where recording is the exception, and most videos are generated from scratch. Montra’s vision is bold

Naman Bansal 5 min read
Banner reading "Build Perplexity With Supermemory in 15 Minutes" with a retro computer and Supermemory x Perplexity logos
Learning ·

Build Your Own Perplexity in 15 Minutes With Supermemory

Supermemory has a fascinating open-source tool called OpenSearchAI. It's essentially a search assistant similar to Perplexity, but it remembers everything you've searched for and enriches future responses with that memory. I thought to myself, “This seems cool. But how complicated is it to build so

Naman Bansal 10 min read
Banner reading "Chat With All Your Docs Ft. Supermemory" with a blue file folder and Supermemory x Drive logos
Learning ·

Building an AI Compliance Chatbot With Supermemory and Google Drive

Contract compliance reviews are a serious drain on time and focus. It’s a repetitive process that takes away from actual legal thinking, and the workflow is absolutely broken. Files live in different places. You’re never sure if you’re reading the latest version. And no one has time to manually tra

Naman Bansal 15 min read
Blog banner reading "Knowledge Graph For RAG" with a 3D node network and a Step-By-Step Tutorial tag
Learning ·

Knowledge Graph For RAG: Step-by-Step Tutorial

If you’ve ever built a retrieval-augmented generation (RAG) system using embeddings and vector databases, you already know the drill: you turn your data into vectors, stuff them into a store like FAISS, and let your model retrieve similar chunks during inference. And it works, until it doesn’t. W

Naman Bansal 12 min read
Blog banner reading "How To Extend Context Window In LLMs" with blue 3D blocks labeled 100,000,000 tokens
Learning ·

2 Approaches For Extending Context Windows in LLMs

Transformer-based large language models have become the poster boys of modern AI, yet they still share one stark limitation: a finite context window. Once that window overflows, performance drops like a rock or the model forgets key details. This guide walks through two complementary strategies tha

Naman Bansal 9 min read
Supermemory banner reading "LLM Cost Optimization For SaaS - Real Experts Weigh In" with a wojak meme at a retro PC
Learning ·

LLM Costs Skyrocketing? Real Experts Weigh In

In this blog, we're gonna walk through a fictional story, while learning how to optimize LLMs for cost, and the associated tradeoffs. Tuesday, 10 June, 2:14 PM PST The billing alert hit. I was halfway through a product demo, nodding along to myself on Zoom, saying something vaguely confident about

Naman Bansal 9 min read
Supermemory blog banner reading "Best Open-Source Embedding Models Benchmarked And Ranked" with a layered blue database cylinder
Learning ·

Best Open-Source Embedding Models Benchmarked and Ranked

If your AI agent is returning the wrong context, it’s probably not your LLM, but your embedding model. Embeddings are the hidden engine behind retrieval-augmented generation (RAG) and memory systems. The better they are, the more relevant your results, and the smarter your app feels. But here’s the

Naman Bansal 9 min read
Supermemory banner reading "3 Ways To Build LLMs With Long-Term Memory" beside a blue robotic arm and chip
Learning ·

3 Ways To Build LLMs With Long-Term Memory

You’ve already met our guide on implementing short-term conversational memory using LangChain, which is great for managing context inside a single chat window. But life, therapy, and enterprise apps sprawl across days, weeks, and years. If our agents are doomed to goldfish-brain amnesia, users end

Naman Bansal 13 min read
Banner reading "How To Add Conversational Memory To LLMs Using LangChain" with chat, brain icons and a sticky note
Learning ·

How To Add Conversational Memory To LLMs Using LangChain

Chatbots that don’t remember conversations are very frustrating to work with. Users treat AI like a human and expect it to remember. LangChain recently migrated to LangGraph, a new stateful framework for building multi-step, memory-aware LLM apps. So while the docs might still say “LangChain memory

Naman Bansal 21 min read