Question 1

Is SemanticGuard a Cloudflare AI Gateway alternative?

Accepted Answer

Only partially. Cloudflare AI Gateway is a first-party gateway on Cloudflare's network with logging, rate limiting, and exact-match caching. SemanticGuard is an intelligent semantic cache with verified correctness that runs on any host. If you are all-in on Cloudflare and mostly need identical-prompt caching plus rate limits, Cloudflare AI Gateway is a better starting point. If you need paraphrase-aware caching with a published correctness number, or you run off Cloudflare, SemanticGuard is a better fit.

Question 2

What is the difference between exact-match and semantic caching?

Accepted Answer

Exact-match caching only hits when the request bytes are identical. Semantic caching recognizes when two prompts mean the same thing even if wording differs, so support bots, RAG, and Q&A workloads see many more hits. SemanticGuard verifies correctness on every served semantic hit; every cache return is measured against the disclosed benchmark.

Question 3

Can I use Cloudflare AI Gateway and SemanticGuard together?

Accepted Answer

Yes. They live at different layers. Use Cloudflare AI Gateway at your network edge for rate limits, logging, and per-key budgets. Route through SemanticGuard for the cache layer with paraphrase-aware hits and verified correctness. Cache misses still hit the provider through Cloudflare's rate limits and logging on the way through.

Question 4

What does 'verified correctness' actually mean?

Accepted Answer

Every candidate cache hit goes through multi-layer verification before being served. We publish a benchmark with 100% measured cache correctness on the disclosed workloads at https://www.semanticguard.dev/benchmark, including the judge model, sample size, and methodology.

Dimension	Cloudflare AI Gateway	SemanticGuard	Better fit
Primary job	First-party AI gateway for Cloudflare traffic: logging, rate limiting, exact-match cache	Intelligent caching with verified correctness across any host	Both fit
Cache type	Exact-match: identical prompts hit, paraphrases do not	Semantic: catches paraphrases and reworded questions with correctness verified on every served hit	SemanticGuard
Correctness measurement on cache hits	Cache returns are trusted as-is; no published correctness measurement	100% measured on public benchmark, methodology disclosed at /benchmark	SemanticGuard
Works off Cloudflare	Best when your traffic already runs on Cloudflare Workers or Pages	Any host: Vercel, AWS, GCP, self-hosted, local dev. Cloud-agnostic	SemanticGuard
Rate limiting and per-key quotas	Core strength: request quotas, per-key budgets, protocol-native	Per-tenant billing quotas; not a general-purpose rate limiter	Cloudflare AI Gateway
Edge presence and cold-start latency	Runs on Cloudflare's global network; extremely low overhead if you are already on Cloudflare	Vercel Edge Runtime for hot paths; comparable regional latency	Cloudflare AI Gateway
Shadow mode (see savings before enabling)	N/A	Default. Install and watch "would have saved $X" for a week before flipping cache on	SemanticGuard
Self-host in your own cloud tenant	Runs on Cloudflare's platform by design	One-click install deploys the proxy into your own Vercel account. Prompts and cache stay in your tenant	SemanticGuard
Pricing model at scale	Usage-based on Cloudflare's platform pricing	$49/mo Pro, or 15% of documented savings on Enterprise ($500/mo minimum). Pays for itself when caching works	SemanticGuard

Exact matches vs semantic hits.

Pick Cloudflare AI Gateway if

Pick SemanticGuard if

Or stack them

Try SemanticGuard on your real traffic