Question 1

What counts as a request on the free tier?

Accepted Answer

Every proxy call to /api/proxy/*, regardless of cache hit or miss. Shadow-mode requests count too. That's why the free tier is sized for 10K/month rather than 1K.

Question 2

What happens at 50K requests on Pro?

Accepted Answer

Overage kicks in at $0.50 per additional 1K requests. Hard cap at 500K/month; above that we route you to Enterprise. You get a usage alert at 80% of any threshold.

Question 3

How is 'documented savings' calculated for Enterprise?

Accepted Answer

Sum of upstream costs we avoided based on cache hits times the published per-million-token price of the model that would otherwise have served the request. The 15% comes out monthly; full audit log and raw numbers are exported with the invoice.

Question 4

Can I self-host?

Accepted Answer

Yes, with a license key (Pro or above). The Terraform module deploys the gateway into your own Vercel, AWS, or GCP account, so your data stays in your VPC. Contact enterprise@semanticguard.dev.

Question 5

Annual discount?

Accepted Answer

10% off Pro paid annually. Enterprise contracts are custom.

	SemanticGuard	OpenAI prompt cache	Anthropic prompt caching
Pricing model	Free tier + $49/mo Pro + 15% of savings Enterprise	Free, applied automatically on exact-match cache hits	90% discount on cached input tokens (5-min TTL)
What gets cached	Same wording, different wording, and multi-turn context, caught by multi-layer verification	Exact prompt prefix only	Explicitly-marked cache blocks only
Cross-provider	Yes. Cache a GPT-4o response, serve it to a Claude-Opus paraphrase	No (OpenAI only)	No (Anthropic only)
Cache correctness audit	100% on our public benchmark, AI-judged	No public correctness number	No public correctness number
Observability	Dashboard + Prometheus + per-request tracing	Token counters in API response	Token counters in API response
Per-tenant kill switch + audit log	Yes	N/A (provider-side)	N/A (provider-side)

Pay for what you save.

Choose a tier

Free

Pro

Enterprise

How we compare to native provider caching

Pricing FAQ

Not sure which tier fits?