POST /v4/search
) provides minimal-latency search optimized for real-time interactions. This endpoint prioritizes speed over extensive control, making it perfect for chatbots, Q&A systems, and any application where users expect immediate responses.
Basic Search
Container Tag Filtering
Filter by user, project, or organization:Threshold Control
Control result quality with similarity threshold:Reranking
Improve result quality with secondary ranking:Query Rewriting
Improve search accuracy with automatic query expansion:Include Related Content
Include documents, related memories, and summaries:Metadata Filtering
Simple metadata filtering for Memories search:Chatbot Example
Optimal configuration for conversational AI:Complete Memories Search Example
Combining features for comprehensive results:Comon Use Cases
- Chatbots: Basic search with container tag and low threshold
- Q&A Systems: Add reranking for better relevance
- Knowledge Retrieval: Include documents and summaries
- Real-time Search: Skip rewriting and reranking for maximum speed