Question 1

Does my code leave my machine?

Accepted Answer

No. All embeddings, AST parsing, and graph storage run locally in ~/.eidos/. There are zero cloud dependencies. Your code never leaves your computer.

Question 2

Which AI tools does it work with?

Accepted Answer

Claude Code, Gemini CLI, Qwen, Aider, llm, sgpt, mods, Open Interpreter, Continue, Cursor, Claude Desktop, and any OpenAI-compatible tool.

Question 3

How is this different from claude-mem?

Accepted Answer

EidosCore replaces claude-mem, memory, graphify, and caveman with one tool. It adds automatic context injection, token-budgeted assembly, session continuity (QMS), self-learning retrieval, and 14 MCP tools. Published on npm with 106 automated tests.

Question 4

Is it really free?

Accepted Answer

Yes. MIT licensed. Local-first. No cloud dependency. No hidden costs, no API keys needed for the core functionality.

Question 5

How much does it actually save?

Accepted Answer

In our debugging benchmark: 1,245 words down to 57 words — a 95.6% reduction. At 50 prompts/day, that's roughly $90/month per developer in token costs saved.

Question 6

What is an AI memory engine?

Accepted Answer

An AI memory engine is a tool that gives AI coding assistants persistent memory across sessions. It indexes your codebase, retrieves relevant context, compresses it, and injects it into every prompt automatically — saving 95-98% of tokens.

Question 7

How does the Model Context Protocol (MCP) work with Eidos?

Accepted Answer

Eidos provides a full MCP server with 14 tools. This lets Claude Desktop, Cursor, Continue, and other MCP-compatible clients query Eidos memory directly — search code, assemble context, and remember decisions.

Question 8

Can I use Eidos Memory with ChatGPT or GPT-4?

Accepted Answer

Yes. Eidos works with any OpenAI-compatible tool through its universal CLI adapter. Use 'eidos wrap' with any tool that accepts prompts via stdin.

Content Type	Original Tokens	Compressed Tokens	Reduction
Full function body	200-500	50 (skeleton)	75-90%
10 conversation turns	500-1,000	40-60 (micro-summary)	94-96%
File diff vs. old version	200	10-30 (patch only)	85-95%
Daily context (all above)	5,000-10,000	150-300	97-98%
Typical session	20,000+	200-400	98-99%

How to Reduce AI Token Usage by 95%

The Problem: You're Paying for Repetition

Where Tokens Actually Go

1. File contents (40-60% of tokens)

2. Architecture explanations (20-30% of tokens)

3. Conversation history (10-20% of tokens)

The Solution: Persistent AI Memory

Before (without memory):

After (with memory):

Real Numbers

How to Set It Up

Other Ways to Reduce Token Usage

The Bottom Line