An agent memory system with biomimetic architecture that enables AI to learn from experience via reflection, featuring multi-strategy retrieval and top performance on LongMemEval benchmark, released under MIT license.
Core Philosophy#
Hindsight is an agent memory system that enables agents to truly "learn" from experience via reflection, rather than merely recalling conversation history. Traditional agent memory systems focus on storing and retrieving conversation history (RAG), leaving agents able to "remember" but not "grow." Hindsight addresses this through biomimetic architecture.
Biomimetic Memory Architecture#
Memory is organized into three tiers, simulating human cognitive structure:
- World: Stores objective facts (e.g., "Alice works at Google"), analogous to declarative memory
- Experiences: Records the agent's own interaction processes and event streams, analogous to episodic memory
- Mental Models: Abstract understanding extracted from Experiences and World via the reflect operation - the key to achieving "learning"
Core Operations#
- Retain: Ingests unstructured content and categorizes it into a specified Bank
- Recall: Activates relevant memories using multi-strategy retrieval based on query
- Reflect: Triggers reasoning mechanism to analyze existing memories and generate new insights or update Mental Models
Multi-Strategy Retrieval#
Built-in composite retrieval logic combining:
- Semantic similarity search
- Exact keyword matching
- Entity relationship graph queries
- Temporal dimension retrieval
Quick Start#
from hindsight_client import Hindsight
client = Hindsight(base_url="http://localhost:8888")
# Store information
client.retain(bank_id="my-bank", content="Alice works at Google")
# Retrieve memories
results = client.recall(bank_id="my-bank", query="What does Alice do?")
# Generate insights
client.reflect(bank_id="my-bank", query="Tell me about Alice")
Ideal Use Cases#
- Long-running AI employees (accumulating industry knowledge or company processes)
- Adaptive interaction (adjusting behavior based on user feedback)
- Complex task automation (maintaining state and experience over long cycles)
- Per-user personalization (remembering specific user preferences)
Performance#
Achieves SOTA results on LongMemEval benchmark.
Development Resources#
- GitHub Stars: 1.5k+
- License: MIT License
- SDK Support: Python, Node.js/TypeScript
⚠️ Note: The input materials do not provide specific GitHub repository URL or official documentation links. Deployment method (Docker/binary/library) and underlying storage dependencies (vector database) are pending verification.