LLM

Memory Systems in AI Assistants

Memory Systems in AI Assistants

Working, structured, and retrieval memory for assistants.

Memory turns assistants from reactive to persistent, but it is also where many systems quietly rot. Surveys argue the short-term versus long-term split is no longer enough for modern agent memory; OpenAI and LangGraph SDKs point to a simpler stack — working memory, durable state, and retrieval.

LLM Wiki - Compiled Knowledge That RAG Cannot Replace

LLM Wiki - Compiled Knowledge That RAG Cannot Replace

Compiled knowledge for AI systems

The premise is simple: compiled knowledge is more reusable than retrieved fragments. RAG became the default answer to a straightforward question - how do I give an LLM access to external knowledge?

Hermes Agent Skill Authoring — SKILL.md Structure and Best Practices

Hermes Agent Skill Authoring — SKILL.md Structure and Best Practices

Author Hermes skills that load fast and behave reliably

Hermes Agent treats skills as the default way to teach repeatable workflows. Official documentation describes them as on-demand knowledge documents aligned with the open agentskills.io shape, loaded through progressive disclosure so the model sees a small index first and only pulls full instructions when a task actually needs them.

Agent Memory Providers Compared — Honcho, Mem0, Hindsight, and Five More

Agent Memory Providers Compared — Honcho, Mem0, Hindsight, and Five More

Eight pluggable backends for persistent agent memory.

Modern assistants still forget everything when you close the tab unless something persists beyond the context window. Agent memory providers are services or libraries that hold facts and summaries across sessions — often wired in as plugins so the framework stays thin while memory scales.

AI Systems Memory — Persistent Knowledge and Agent Memory

AI Systems Memory — Persistent Knowledge and Agent Memory

Persistent knowledge beyond a single chat thread.

This section collects guides on persistent knowledge and memory for AI systems — how assistants keep facts, preferences, and distilled context across sessions without stuffing every token into one prompt. Here, memory means intentional retention (user facts, summaries, plugin-backed stores), not GPU RAM or model weights.