We aggregated 100+ free AI models into one endpoint

The free-LLM landscape is real but scattered: Groq, Gemini, Cloudflare, Mistral, OVHcloud and a dozen others each give away genuine capacity, behind a dozen signup pages, a dozen key formats, and a dozen incompatible APIs. Over two days we discovered, tested, and merged every legitimate permanent-free provider we could into UnoRouter. The result: 134 free model rows from 15 providers behind one OpenAI-compatible endpoint and one key.

What we added

Fifteen free providers, one at a time: Groq, Gemini, Cerebras, SambaNova, Mistral, Cloudflare Workers AI (two accounts), GitHub Models, Z.ai, OVHcloud, AI Horde, Pollinations, Cohere, Jina, NVIDIA NIM, and OpenRouter. That is 134 free model rows: Llama, gpt-oss, Qwen, Mistral, GLM, Nemotron and more, plus 30 free embedding models and 13 free image and audio models. Every one is probed end to end for HTTP, streaming, and tool calls before it goes live, the same authenticity and harness checks we run on paid models.

Free for a reason

These models are genuinely free, and that is exactly why they have limits. The provider sets those limits, not us, and we cannot raise them. Each upstream enforces its own rate limits: requests per minute, daily token quotas, Cloudflare neuron budgets, volunteer-queue priority. Hit a cap and that provider returns 429 until it resets. A free key that worked this morning can be exhausted by this afternoon. Free tier is best-effort throughput, not a guarantee. If your workload needs predictable latency and no surprise 429s, use a paid model.

Why aggregate them at all

Because the alternative is fifteen accounts. Each provider has its own signup, its own key format, its own base URL, and its own quirks: Z.ai speaks the Zhipu V4 path, Cloudflare carries the account id in the URL, AI Horde wants an anonymous key, GitHub gates models behind a token scope. We absorbed all of that so you call them the way you call everything else: one OpenAI-compatible endpoint, one key, a model name. The honest rule we hold ourselves to: one real account per provider, caps accepted, nothing farmed, nothing pooled. We expose the free tier as a gift, not as a resale of someone else's quota.

How we soften the limits

BLOG.POSTS.FREE_MODELS_AGGREGATED.P_FAILOVER

What we did not do

We did not add reverse proxies that re-serve OpenAI or Claude flagships without permission. We did not pull in personal-key aggregators whose tokens are non-transferable, or pool-of-pools services that farm and rotate other people's keys. Those exist and they are tempting and they are exactly the gray-market mess this gateway is meant to replace. Every provider on the list gives its free tier away on purpose, under its own terms. If a source could not pass that bar, it is not here.

Try it

BLOG.POSTS.FREE_MODELS_AGGREGATED.CTA