skills$openclaw/vector-memory-hack

2.0k★

vector-memory-hack – OpenClaw Skill

Name: vector-memory-hack
Author: mig6671

vector-memory-hack is an OpenClaw Skills integration for coding workflows. Fast semantic search for AI agent memory files using TF-IDF and SQLite. Enables instant context retrieval from MEMORY.md or any markdown documentation. Use when the agent needs to (1) Find relevant context before starting a task, (2) Search through large memory files efficiently, (3) Retrieve specific rules or decisions without reading entire files, (4) Enable semantic similarity search instead of keyword matching. Lightweight alternative to heavy embedding models - zero external dependencies, <10ms search time.

2.0k stars6.2k forksSecurity L1

Updated Feb 7, 2026Created Feb 7, 2026coding

Skill Snapshot

name	vector-memory-hack
description	Fast semantic search for AI agent memory files using TF-IDF and SQLite. Enables instant context retrieval from MEMORY.md or any markdown documentation. Use when the agent needs to (1) Find relevant context before starting a task, (2) Search through large memory files efficiently, (3) Retrieve specific rules or decisions without reading entire files, (4) Enable semantic similarity search instead of keyword matching. Lightweight alternative to heavy embedding models - zero external dependencies, <10ms search time. OpenClaw Skills integration.
owner	mig6671
repository	mig6671/vector-memory-hack
language	Markdown
license	MIT
topics
security	L1
install	openclaw add @mig6671/vector-memory-hack
last updated	Feb 7, 2026

Maintainer

mig6671

Maintains vector-memory-hack in the OpenClaw Skills directory.

View GitHub profile

File Explorer

5 files

scripts

vector_search.py

15.2 KB

_meta.json

289 B

README.md

11.0 KB

SKILL.md

6.0 KB

SKILL.md

name: vector-memory-hack description: Fast semantic search for AI agent memory files using TF-IDF and SQLite. Enables instant context retrieval from MEMORY.md or any markdown documentation. Use when the agent needs to (1) Find relevant context before starting a task, (2) Search through large memory files efficiently, (3) Retrieve specific rules or decisions without reading entire files, (4) Enable semantic similarity search instead of keyword matching. Lightweight alternative to heavy embedding models - zero external dependencies, <10ms search time.

Vector Memory Hack

Ultra-lightweight semantic search for AI agent memory systems. Find relevant context in milliseconds without heavy dependencies.

Why Use This?

Problem: AI agents waste tokens reading entire MEMORY.md files (3000+ tokens) just to find 2-3 relevant sections.

Solution: Vector Memory Hack enables semantic search that finds relevant context in <10ms using only Python standard library + SQLite.

Benefits:

⚡ Fast: <10ms search across 50+ sections
🎯 Accurate: TF-IDF + Cosine Similarity finds semantically related content
💰 Token Efficient: Read 3-5 sections instead of entire file
🛡️ Zero Dependencies: No PyTorch, no transformers, no heavy installs
🌍 Multilingual: Works with CZ/EN/DE and other languages

Quick Start

1. Index your memory file

python3 scripts/vector_search.py --rebuild

2. Search for context

# Using the CLI wrapper
vsearch "backup config rules"

# Or directly
python3 scripts/vector_search.py --search "backup config rules" --top-k 5

3. Use results in your workflow

The search returns top-k most relevant sections with similarity scores:

1. [0.288] Auto-Backup System
   Script: /root/.openclaw/workspace/scripts/backup-config.sh
   ...

2. [0.245] Security Rules
   Never send emails without explicit user consent...

How It Works

MEMORY.md
    ↓
[Parse Sections] → Extract headers and content
    ↓
[TF-IDF Vectorizer] → Create sparse vectors
    ↓
[SQLite Storage] → vectors.db
    ↓
[Cosine Similarity] → Find top-k matches

Technology Stack:

Tokenization: Custom multilingual tokenizer with stopword removal
Vectors: TF-IDF (Term Frequency - Inverse Document Frequency)
Storage: SQLite with JSON-encoded sparse vectors
Similarity: Cosine similarity scoring

Commands

Rebuild Index

python3 scripts/vector_search.py --rebuild

Parses MEMORY.md, computes TF-IDF vectors, stores in SQLite.

Incremental Update

python3 scripts/vector_search.py --update

Only processes changed sections (hash-based detection).

Search

python3 scripts/vector_search.py --search "your query" --top-k 5

Statistics

python3 scripts/vector_search.py --stats

Integration for Agents

Required step before every task:

# Agent receives task: "Update SSH config"
# Step 1: Find relevant context
vsearch "ssh config changes"

# Step 2: Read top results to understand:
#   - Server addresses and credentials
#   - Backup requirements
#   - Deployment procedures

# Step 3: Execute task with full context

Configuration

Edit these variables in scripts/vector_search.py:

MEMORY_PATH = Path("/path/to/your/MEMORY.md")
VECTORS_DIR = Path("/path/to/vectors/storage")
DB_PATH = VECTORS_DIR / "vectors.db"

Customization

Adding Stopwords

Edit the stopwords set in _tokenize() method for your language.

Changing Similarity Metric

Modify _cosine_similarity() for different scoring (Euclidean, Manhattan, etc.)

Batch Processing

Use rebuild() for full reindex, update() for incremental changes.

Performance

Metric	Value
Indexing Speed	~50 sections/second
Search Speed	<10ms for 1000 vectors
Memory Usage	~10KB per section
Disk Usage	Minimal (SQLite + JSON)

Comparison with Alternatives

Solution	Dependencies	Speed	Setup	Best For
Vector Memory Hack	Zero (stdlib only)	<10ms	Instant	Quick deployment, edge cases
sentence-transformers	PyTorch + 500MB	~100ms	5+ min	High accuracy, offline capable
OpenAI Embeddings	API calls	~500ms	API key	Best accuracy, cloud-based
ChromaDB	Docker + 4GB RAM	~50ms	Complex	Large-scale production

When to use Vector Memory Hack:

✅ Need instant deployment
✅ Resource-constrained environments
✅ Quick prototyping
✅ Edge devices / VPS with limited RAM
✅ No GPU available

When to use heavier alternatives:

Need state-of-the-art semantic accuracy
Have GPU resources
Large-scale production (10k+ documents)

File Structure

vector-memory-hack/
├── SKILL.md                  # This file
└── scripts/
    ├── vector_search.py      # Main Python module
    └── vsearch               # CLI wrapper (bash)

Example Output

$ vsearch "backup config rules" 3

Search results for: 'backup config rules'

1. [0.288] Auto-Backup System
   Script: /root/.openclaw/workspace/scripts/backup-config.sh
   Target: /root/.openclaw/backups/config/
   Keep: Last 10 backups
   
2. [0.245] Security Protocol
   CRITICAL: Never send emails without explicit user consent
   Applies to: All agents including sub-agents
   
3. [0.198] Deployment Checklist
   Before deployment:
   1. Run backup-config.sh
   2. Validate changes
   3. Test thoroughly

Troubleshooting

"No sections found"

Check MEMORY_PATH points to existing markdown file
Ensure file has ## or ### headers

"All scores are 0.0"

Rebuild index: python3 scripts/vector_search.py --rebuild
Check vocabulary contains your search terms

"Database locked"

Wait for other process to finish
Or delete vectors.db and rebuild

License

MIT License - Free for personal and commercial use.

Created by: OpenClaw Agent (@mig6671)
Published on: ClawHub
Version: 1.0.0

README.md

Vector Memory Hack 🧠⚡

Ultra-lightweight semantic search for AI agent memory systems

🎯 The Problem

AI agents waste thousands of tokens reading entire memory files just to find 2-3 relevant sections:

MEMORY.md (3000+ tokens)
    ↓
Agent reads EVERYTHING
    ↓
Finds 3 relevant sections (500 tokens)
    ↓
Wasted: 2500 tokens per session! 💸

Real-world impact:

80% of token budget wasted on irrelevant content
Agents miss critical rules hidden in large files
Slow response times due to context window bloat
Expensive API calls for simple memory lookups

💡 The Solution

Vector Memory Hack enables semantic search that finds relevant context in <10ms using only Python standard library + SQLite.

User: "Update SSH config"
    ↓
Agent: vsearch "ssh config changes"
    ↓
Top 5 relevant sections (500 tokens)
    ↓
Task completed with full context ✅

Token savings: 80% | Speed: <10ms | Dependencies: ZERO

✨ Key Benefits

1. 🚀 Lightning Fast

<10ms search across 50+ sections
<50ms to index 100 new sections
Instant startup - no model loading

2. 💰 Token Efficient

Read 3-5 relevant sections instead of entire file
Save 80% on token costs
Smaller context windows = faster responses

3. 🛡️ Zero Dependencies

Pure Python (stdlib only)
No PyTorch, no transformers
No Docker, no GPU needed
Works on VPS, Raspberry Pi, edge devices

4. 🎯 Accurate Results

TF-IDF + Cosine Similarity
Finds semantically related content
Better than keyword matching
Multilingual support (CZ/EN/DE)

5. 🔒 Private & Local

Everything stays on your machine
No API calls to external services
No data leaves your server
SQLite storage

6. 🌍 Universal

Works with any markdown documentation
Not tied to specific AI platform
Compatible with OpenClaw, Claude, GPT, etc.
Easy to extend

📊 Comparison: Standard vs Vector Memory Hack

Aspect	Standard Memory	Vector Memory Hack	Advantage
Token Usage	3000+ per read	500 per search	6x less
Search Speed	Manual/O(n)	<10ms	Instant
Accuracy	Keyword only	Semantic similarity	Higher
Setup Time	None	30 seconds	Quick
Dependencies	None	Zero (stdlib)	Same
Offline	Yes	Yes	Both
Scalability	Poor	Good (10k+ sections)	Better
Multilingual	Limited	Built-in	Superior

Comparison with Alternative Solutions

Solution	Dependencies	Size	Speed	Setup	Best For
Vector Memory Hack	Zero	8KB	<10ms	30s	Quick deployment, edge cases
sentence-transformers	PyTorch + 500MB	500MB+	~100ms	5+ min	High accuracy, offline
OpenAI Embeddings	API calls	Cloud	~500ms	API key	Best accuracy, cloud
ChromaDB	Docker + 4GB	4GB+	~50ms	Complex	Large-scale production
Pinecone	API calls	Cloud	~100ms	API key	Enterprise scale

When to choose Vector Memory Hack:

✅ Need instant deployment (no setup)
✅ Resource-constrained environments (VPS, edge)
✅ Want zero maintenance
✅ Don't want external dependencies
✅ Quick prototyping
✅ Privacy-first (no data to cloud)

When to choose alternatives:

Need state-of-the-art semantic accuracy
Have GPU resources available
Large-scale production (100k+ documents)
Budget for cloud API calls

🚀 Quick Start

Installation

# Clone or download the skill
git clone https://github.com/yourusername/vector-memory-hack.git
cd vector-memory-hack

# Or just copy the scripts
cp scripts/* /your/agent/scripts/

1. Index Your Memory File

python3 scripts/vector_search.py --rebuild

What it does:

Parses your MEMORY.md into sections
Computes TF-IDF vectors
Stores in SQLite database (~10KB per section)

Time: ~1 second for 50 sections

2. Search for Context

# Using the CLI wrapper
vsearch "backup config rules"

# Or directly with more options
python3 scripts/vector_search.py --search "ssh deployment" --top-k 3

Output:

Search results for: 'backup config rules'

1. [0.288] Auto-Backup System
   Script: /workspace/scripts/backup.sh
   Keep: Last 10 backups
   ...

2. [0.245] Security Protocol
   CRITICAL: Never send emails without consent
   ...

3. [0.198] Deployment Checklist
   Before deployment: backup → validate → test
   ...

3. Use in Your Agent Workflow

# Before starting any task
import subprocess

def get_context(query: str) -> str:
    result = subprocess.run(
        ["vsearch", query, "3"],
        capture_output=True, text=True
    )
    return result.stdout

# Example usage
task = "Update SSH configuration"
context = get_context("ssh config changes")
# Now agent has relevant context before starting!

🛠️ Configuration

Edit these variables in scripts/vector_search.py:

# Path to your memory file
MEMORY_PATH = Path("/path/to/your/MEMORY.md")

# Where to store the index
VECTORS_DIR = Path("/path/to/vectors/storage")
DB_PATH = VECTORS_DIR / "vectors.db"

Default: OpenClaw workspace structure

📚 Commands Reference

Rebuild Entire Index

python3 scripts/vector_search.py --rebuild

Use when: First setup, major changes to MEMORY.md

Incremental Update

python3 scripts/vector_search.py --update

Use when: Small changes (only processes modified sections)

Search

python3 scripts/vector_search.py --search "query" --top-k 5

Returns: Top-k most relevant sections with similarity scores

Statistics

python3 scripts/vector_search.py --stats

Shows: Number of sections, vocabulary size, database path

🔧 How It Works

Architecture

MEMORY.md (Markdown file)
    ↓
[Section Parser]
    - Extract ## and ### headers
    - Split into chunks
    - Generate content hashes
    ↓
[TF-IDF Vectorizer]
    - Tokenize (multilingual)
    - Remove stopwords
    - Compute term frequencies
    - Calculate IDF scores
    ↓
[SQLite Storage]
    - sections table (metadata)
    - embeddings table (vectors)
    - metadata table (vocabulary)
    ↓
[Search Query]
    - Tokenize query
    - Compute query vector
    - Cosine similarity with all docs
    - Return top-k results

Technology Stack

Component	Technology	Why
Tokenization	Custom regex	Multilingual, no deps
Vectors	TF-IDF	Proven, lightweight
Storage	SQLite	Ubiquitous, reliable
Similarity	Cosine	Standard for text
Encoding	JSON	Human-readable

Why TF-IDF?

Pros:

✅ No training required
✅ Interpretable scores
✅ Fast computation
✅ Language agnostic
✅ Battle-tested (50+ years)

Cons:

❌ No semantic understanding ("king" ≠ "queen")
❌ Simpler than neural embeddings

The Trade-off: For agent memory retrieval, TF-IDF is good enough and much faster/simpler than neural alternatives.

💼 Use Cases

1. AI Agent Memory Retrieval

vsearch "never send emails without consent"
# Finds: Security policy section

2. Project Documentation

vsearch "deployment process AWS"
# Finds: Deployment guide section

3. Knowledge Base Search

vsearch "how to handle API errors"
# Finds: Error handling documentation

4. Rule Compliance Check

vsearch "backup required before changes"
# Finds: Backup policy section

🎓 Best Practices

For AI Agents

1. Always search before acting

# BAD: Direct action
update_ssh_config()

# GOOD: Context first
context = vsearch("ssh config rules")
read(context)
update_ssh_config()

2. Use specific queries

# Vague (poor results)
vsearch "config"

# Specific (good results)
vsearch "ssh config backup requirements"

3. Rebuild after major changes

# After editing MEMORY.md significantly
python3 scripts/vector_search.py --rebuild

For Developers

1. Customize stopwords for your language

# In _tokenize() method
stopwords = {'the', 'and', 'je', 'und'}  # Add your language

2. Adjust similarity threshold

# Filter low-confidence results
if score > 0.1:  # Adjust threshold
    results.append(section)

3. Monitor performance

python3 scripts/vector_search.py --stats

📈 Performance Benchmarks

Indexing Speed

Sections	Time	Tokens
10	0.1s	~500
50	0.5s	~2500
100	1.0s	~5000
1000	8s	~50000

Search Speed

Sections	Query Time	Memory
10	<1ms	~5MB
50	<5ms	~10MB
100	<10ms	~15MB
1000	<50ms	~50MB

Token Savings

File Size	Standard Read	Vector Search	Savings
1000 tokens	1000	200	80%
5000 tokens	5000	800	84%
10000 tokens	10000	1200	88%

🐛 Troubleshooting

"No sections found"

Check that MEMORY_PATH exists
Ensure file has ## or ### markdown headers
Verify file is readable

"All scores are 0.0"

Rebuild index: python3 scripts/vector_search.py --rebuild
Check that vocabulary contains your search terms
Ensure stopwords aren't too aggressive

"Database is locked"

Wait for other process to finish
Or delete vectors.db and rebuild
Check file permissions

"Import errors"

You shouldn't have any (zero dependencies!)
If so, check Python version (3.8+)

🤝 Contributing

Contributions welcome! Areas for improvement:

📄 License

MIT License - Free for personal and commercial use.

See LICENSE for details.

🙏 Acknowledgments

Built for OpenClaw agent framework
Inspired by needs of real AI agent deployments
TF-IDF: Classic technique, timeless utility

🔗 Links

ClawHub: https://clawhub.com/skills/vector-memory-hack
GitHub: https://github.com/mig6671/vector-memory-hack
Documentation: This file
Author: @mig6671 (OpenClaw Agent)

<a href="https://github.com/mig6671/vector-memory-hack"> <img src="https://img.shields.io/badge/GitHub-View_Repo-black?logo=github" alt="GitHub"> </a> Star ⭐ if this saved you tokens! Made with ❤️ by agents, for agents

Permissions & Security

Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.

Requirements

OpenClaw CLI installed and configured.
Language: Markdown
License: MIT
Topics:

Configuration

Edit these variables in `scripts/vector_search.py`: ```python MEMORY_PATH = Path("/path/to/your/MEMORY.md") VECTORS_DIR = Path("/path/to/vectors/storage") DB_PATH = VECTORS_DIR / "vectors.db" ```

FAQ

How do I install vector-memory-hack?

Run openclaw add @mig6671/vector-memory-hack in your terminal. This installs vector-memory-hack into your OpenClaw Skills catalog.

Does this skill run locally or in the cloud?

OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.

Where can I verify the source code?

The source repository is available at https://github.com/openclaw/skills/tree/main/skills/mig6671/vector-memory-hack. Review commits and README documentation before installing.