Question 1

What is Compresh?

Accepted Answer

Compresh is context-compression and episodic-memory middleware for LLM APIs. It sits between your app and your provider, reconstructing prior context instead of resending the whole conversation. One line to integrate: change your base URL.

Question 2

How does compression work?

Accepted Answer

As a conversation deepens, Compresh reconstructs what mattered — decisions, facts, corrections — instead of retransmitting the full history each turn. The classification that guides it is internal; the model receives reconstructed context, never the tags. Deeper conversations see more savings.

Question 3

Will it affect response quality?

Accepted Answer

In a 360-item real-world benchmark replayed as one long session, Compresh cut input tokens by about 66% with no measurable quality loss versus sending the full history, and the memory layer added no token overhead. Short conversations see minimal compression. Deeper episodic recall is under active benchmarking.

Question 4

Is my data safe?

Accepted Answer

Yes. API keys are encrypted with Fernet. No conversation content is stored permanently — context is processed in-flight and the internal compression state is transient. Compresh never logs message content.

Question 5

Can I use free or local models?

Accepted Answer

Yes. Free and local models (Ollama, LM Studio, etc.) work through Compresh with no savings-share deduction. The Starter tier needs a $10 minimum top-up ($7.50 with the 25% discount) and a $5 minimum balance to keep free-model access.

Question 6

What's the pricing?

Accepted Answer

Starter: $0 service fee, $10 minimum top-up ($7.50 with the permanent 25% top-up discount), no card on file. Every verified account gets $30 in credit, and accounts using the MCP server or OpenClaw hook include a 5-day free TUL 2.0 trial. Paid models carry a 30% savings-share. Pro subscriptions: Quarterly $18 (20% share), Semi-annual $33 (16% share), Annual $60 (12% share).

Question 7

What if Compresh doesn't recognize my model as free or paid?

Accepted Answer

Compresh detects model pricing from provider responses; local usage is auto-detected. For models it has no pricing data on yet, it falls back to OpenRouter's lowest published tariff as a conservative default. Misclassifications can be disputed via support.

Question 8

Can I self-host?

Accepted Answer

The core proxy is open-source — fork the repository, configure your own keys, and deploy anywhere. Self-hosting means you handle infrastructure, updates, and scaling yourself.

Question 9

Which models are supported?

Accepted Answer

Any OpenAI-compatible model, all Anthropic Claude models, and 200+ models via OpenRouter.

Question 10

What about injection attacks?

Accepted Answer

Compresh includes a 3-layer injection detection system — regex pattern matching, heuristic analysis, and ML-based classification — supporting 19 languages, running on every request before it is forwarded to your provider.

FAQ