persistent-ai-memory

Name: persistent-ai-memory
Author: savantskie

Verified

A persistent local memory for AI, LLMs, or Copilot in VS Code.

234stars

30forks

Python

Installation

# Add to your Claude Code skills
git clone https://github.com/savantskie/persistent-ai-memory

Getting Started

Guides for using ai agents skills like persistent-ai-memory.

Caveman: Cut Claude Token Use by 65%
How agent-side prompt compression works, when to use it, and when not to.
What is an AI Skills Marketplace?
Definitions, how marketplaces work, and how to choose between them in 2026.
Getting Started with AI Skills

Security ReportVerified

Last scanned: 5/30/2026

{
  "issues": [],
  "status": "PASSED",
  "scannedAt": "2026-05-30T15:28:33.725Z",
  "npmAuditRan": true,
  "pipAuditRan": false
}

README.md

Frequently Asked Questions

What is persistent-ai-memory?

persistent-ai-memory is an open-source ai agents skill for AI coding assistants such as Claude Code, Codex CLI, and ChatGPT, built by savantskie. A persistent local memory for AI, LLMs, or Copilot in VS Code. It has 234 GitHub stars.

Is persistent-ai-memory safe to use?

Yes. persistent-ai-memory passed SkillsLLM's automated security scan — a dependency vulnerability audit plus prompt-injection heuristics — with no high-severity issues. You can read the full report in the Security Report section on this page.

How do I install persistent-ai-memory?

Clone the repository with "git clone https://github.com/savantskie/persistent-ai-memory" and add it to your Claude Code skills directory (see the Installation section above).

What programming language is persistent-ai-memory written in?

persistent-ai-memory is primarily written in Python. It is open-source under savantskie on GitHub, so you can review or fork the full source.

Are there alternatives to persistent-ai-memory?

Yes. SkillsLLM lists many other AI Agents skills you can browse and compare side by side. Open the AI Agents category from the badge at the top of this page, or use the Related Skills and comparison links further down to weigh persistent-ai-memory against similar tools.

Agentic AI for Beginners

Build your first AI agent from scratch - tool use, ReAct pattern, memory, deployment

41 minBeginner

Comments (0)

to leave a comment.

No comments yet. Be the first to share your thoughts!

Related Skills

superpowers

by obra

An agentic skills framework & software development methodology that works.

234,966

google_ads_mcp skills-for-architects

Persistent AI Memory System v1.5.0

🌟 Community Call to Action: Have you made improvements or additions to this system? Submit a pull request! Every contributor will be properly credited in the final product.

GITHUB LINK - https://github.com/savantskie/persistent-ai-memory.git

🆕 What's New in v1.5.0 (March 28, 2026)

Major Architectural Rewrite: OpenWebUI-Native Integration

✅ OpenWebUI-first design - AI Memory System now deeply integrated into OpenWebUI via plugin (primary deployment method)
✅ Advanced short-term memory - sophisticated memory extraction, filtering, and injection for chat conversations
✅ User ID & Model ID isolation - strict multi-tenant support with configurable enforcement for security and tracking
✅ Complete system portability - all hardcoded paths replaced with environment variables (works anywhere)
✅ Generic class names - removed all Friday-specific branding (FridayMemorySystem → AIMemorySystem)
✅ Production-ready - enhanced error handling, validation, and logging throughout

Upgrade from v1.1.0: See CHANGELOG.md for migration guide.

📚 Documentation Guide

Choose your starting point:

I want to...	Read this	Time
Get started quickly	REDDIT_QUICKSTART.md	5 min
Install the system	INSTALL.md	10 min
Understand configuration	CONFIGURATION.md	15 min
Check system health	TESTING.md	10 min
Use the API	API.md	20 min
Deploy to production	DEPLOYMENT.md	15 min
Fix a problem	TROUBLESHOOTING.md	varies
See examples	examples/README.md	15 min

🚀 Quick Start (30 seconds)

Installation

# Linux/macOS
pip install git+https://github.com/savantskie/persistent-ai-memory.git

# Windows (same command, just use Command Prompt or PowerShell)
pip install git+https://github.com/savantskie/persistent-ai-memory.git

First Validation

python tests/test_health_check.py

Expected output:

[✓] Imported ai_memory_core
[✓] Found embedding_config.json
[✓] System health check passed
[✓] All health checks passed! System is ready to use.

💡 What This System Does

Persistent AI Memory provides sophisticated memory management for AI assistants:

📝 OpenWebUI Short-Term Memory Plugin - Intelligent memory extraction and injection directly in chat conversations
🧠 Persistent Memory Storage - SQLite databases for structured, searchable long-term memories
🔍 Semantic Search - Vector embeddings for intelligent memory retrieval and relevance scoring
💬 Conversation Tracking - Multi-platform conversation history capture with context linking
🎯 Smart Memory Filtering - Advanced blacklist/whitelist and relevance scoring to inject only what matters
🧮 Tool Call Logging - Track and analyze AI tool usage patterns and performance
🔄 Self-Reflection - AI insights into its own behavior and memory patterns
📱 Multi-Platform Support - Works with OpenWebUI (primary), LM Studio, VS Code, and any MCP-compatible assistant
🎨 MCP Server - Standard Model Context Protocol for cross-platform integration

⚙️ System Architecture

Five Specialized Databases

~/.ai_memory/
├── conversations.db      # Chat messages and conversation history
├── ai_memories.db       # Curated long-term memories
├── schedule.db          # Appointments and reminders
├── mcp_tool_calls.db    # Tool usage logs and reflections
└── vscode_project.db    # Development session context

Configuration Files

~/.ai_memory/
├── embedding_config.json   # Embedding provider setup
└── memory_config.json      # Memory system defaults

🎯 Core Features

Memory Operations

store_memory() - Save important information persistently
search_memories() - Find memories using semantic search
list_recent_memories() - Get recent memories without searching

Conversation Tracking

store_conversation() - Store user/assistant messages
search_conversations() - Search through conversation history
get_conversation_history() - Retrieve chronological conversations

Tool Integration

log_tool_call() - Record MCP tool invocations
get_tool_call_history() - Analyze tool usage patterns
reflect_on_tool_usage() - Get AI insights on tool patterns

System Health

get_system_health() - Check databases, embeddings, providers
built-in health check - python tests/test_health_check.py

🔌 Embedding Providers

Choose your embedding service:

Provider	Speed	Quality	Cost
Ollama (local)	⚡⚡	⭐⭐⭐	FREE
LM Studio (local)	⚡	⭐⭐⭐⭐	FREE
OpenAI (cloud)	⚡⚡	⭐⭐⭐⭐⭐	$$$

See CONFIGURATION.md for setup instructions for each provider.

� Important: User ID & Model ID Requirements

All memory operations require user_id and model_id parameters for data isolation and tracking.

This ensures:

✅ Multi-user safety - Each user's memories are completely isolated
✅ Model tracking - Different AI models can maintain separate memories
✅ Audit trail - All operations are traceable to the user and model

Configuration Options

By default, user_id and model_id are required. You can change this in memory_config.json:

{
  "tool_requirements": {
    "require_user_id": true,
    "require_model_id": true,
    "default_user_id": "default_user",
    "default_model_id": "default_model"
  }
}

require_user_id/require_model_id: true → Strict mode (recommended for production, security-focused, or multi-user systems)
require_user_id/require_model_id: false → Use defaults instead (simpler for single-user/single-model setups)

For AI Assistants: Auto-Fill in System Prompt

To make your AI automatically provide these values, add this to its system prompt:

When using memory system tools (store_memory, search_memories, etc.), 
ALWAYS include these parameters:
- user_id='your_user_identifier' (e.g., 'nate_user_1')
- model_id='your_model_name' (e.g., 'llama-2:7b' or 'gpt-4')

If the actual values are unknown, use safe defaults:
- user_id='default_user'
- model_id='default_model'

This isolates memories per user and tracks which AI model generated each memory.

Examples

With user_id and model_id:

# Memories are stored with full isolation
await system.store_memory(
    "User likes Python", 
    user_id="alice", 
    model_id="gpt-4"
)

# Search returns only this user's memories for this model
results = await system.search_memories(
    "programming", 
    user_id="alice", 
    model_id="gpt-4"
)

Without strict requirements (if disabled):

# Uses defaults from memory_config.json
await system.store_memory("User likes Python")  # user_id="default_user", model_id="default_model"

See API.md for complete parameter documentation.

�🔄 Integration Methods (Choose One)

1. OpenWebUI Plugin (Recommended)

Primary deployment method - Deep integration for sophisticated memory management:

Deploy ai_memory_short_term.py as an OpenWebUI Function
Automatically extracts memories from conversations
Intelligently injects relevant memories before AI response
Configurable memory scoring, filtering, and injection preferences
No additional setup required beyond copying file into OpenWebUI Functions editor

Installation:

In OpenWebUI: Settings → Functions → +New Function
Paste entire ai_memory_short_term.py file
Set trigger to Inlet (runs before model response)
Configure memory preferences via function settings

2. MCP Server (Alternative Platforms)

Use with any MCP-compatible AI assistant (Claude, custom integrations, etc.):

# Via mcpo
python -m ai_memory_mcp_server

# Or make streamable for OpenWebUI's alternative integration
# (OpenWebUI supports both plugin and streamable MCP methods)

3. Standalone Library (Custom Implementations)

Use memory capabilities directly in your Python code:

from ai_memory_core import AIMemorySystem
system = AIMemorySystem()
await system.store_memory("Important information", user_id="user1", model_id="model1")
results = await system.search_memories("query", user_id="user1", model_id="model1")

🛠️ Development & Examples

Ready-to-use examples:

python examples/basic_usage.py          # Store and search memories
python examples/advanced_usage.py       # Conversation tracking and tool logging
python examples/performance_tests.py    # Benchmark operations

Full API reference: API.md

📖 Learning Resources

New to AI memory systems? → REDDIT_QUICKSTART.md
Troubleshooting issues? → TROUBLESHOOTING.md
Need configuration help? → CONFIGURATION.md
Want to deploy to production? → DEPLOYMENT.md
Need the full API? → API.md

� System Sophistication

This is a significantly enhanced version of traditional memory systems:

Feature	Traditional	AI Memory System
Memory Extraction	Manual/Static	LLM-powered intelligent extraction
Filtering	Simple keyword matching	Multi-layer semantic + relevance scoring
Memory Injection	All available memories	Smart filtering - only