Guide
OpenRouter Free Models (2026): Best Free APIs & Setup Guide
OpenRouter free models in 2026: best free LLMs, API key setup, limits, and when to use OpenRouter vs direct OpenAI or Anthropic.
Short answer: OpenRouter is the fastest way to call many LLMs through one API key, including free or :free model routes when providers allow it. Budget 15 minutes to sign up, copy a key, and send your first request. Skill level: you can run curl or paste Python into a notebook.
We last verified the models page and a live API call on June 2, 2026. Free IDs change; treat the table below as a pattern, not a permanent list.
Last updated: June 2, 2026.
What you need
| Item | Notes |
|---|---|
| OpenRouter account | openrouter.ai sign-up (email or OAuth) |
| API key | Dashboard → Keys → Create |
| Optional credits | Some “free” promos still need a positive balance for abuse prevention |
| HTTP client | curl, Python openai SDK, or your IDE |
| Model slug | Copy from the catalog, e.g. deepseek/deepseek-v4-flash or names ending in :free |

Quick comparison: free vs paid routing
| Route type | Cost signal | Best for | Watch out for |
|---|---|---|---|
:free or $0 listed models | $0 per token on catalog | Learning, drafts, personal scripts | Rate limits, sudden removal |
| Cheap paid (Flash, Mistral) | Cents per million tokens | Batch codegen, summaries | Tool-call quality varies |
| Frontier paid (GPT-5.5, Opus) | Dollars per million tokens | Final patches, agents | Still cheaper than wrong human hours if used narrowly |
For how those models compare on coding, see DeepSeek V4 vs ChatGPT vs Claude and the latest AI models hub.
Step 1: Create an account and key
- Go to openrouter.ai and sign in.
- Open Keys in the dashboard.
- Click Create Key, name it (example:
aitoolsradar-dev), copy once. - Export locally:
export OPENROUTER_API_KEY="sk-or-v1-xxxxxxxx"Expected result: The keys page shows your new key with create date. Revoke any key you pasted into a screenshot by mistake.
Common mistake: Committing the key to GitHub. Use .env and add .env to .gitignore.
Step 2: Find free models on the catalog
- Open Models in the dashboard (or visit the public models page).
- Sort or filter by price ascending.
- Look for :free suffixes or $0 input and output on the row you want.
- Copy the model ID string exactly, including provider prefix.
Expected result: You have one slug ready for model in JSON, such as deepseek/deepseek-v4-flash:free or openrouter/free (auto-picks a free route) when offered.
Common mistake: Using an old blog slug after the provider renamed the checkpoint. The catalog wins over blog posts.
Step 3: Send a test request with curl
curl https://openrouter.ai/api/v1/chat/completions \
-H "Authorization: Bearer $OPENROUTER_API_KEY" \
-H "Content-Type: application/json" \
-H "HTTP-Referer: https://aitoolsradar.org" \
-H "X-Title: AI Tools Radar Test" \
-d '{
"model": "REPLACE_WITH_FREE_MODEL_ID",
"messages": [
{"role": "user", "content": "Reply with exactly: openrouter ok"}
]
}'Swap REPLACE_WITH_FREE_MODEL_ID for the slug you copied.
Expected result: JSON with choices[0].message.content containing your reply.
Common mistake: Forgetting the Authorization header. You get 401 with a short error body.
Step 4: Use the OpenAI Python SDK
Install once:
pip install openaiScript:
import os
from openai import OpenAI
client = OpenAI(
base_url="https://openrouter.ai/api/v1",
api_key=os.environ["OPENROUTER_API_KEY"],
)
resp = client.chat.completions.create(
model="REPLACE_WITH_FREE_MODEL_ID",
messages=[
{"role": "user", "content": "Write a one-line Python hello world."}
],
extra_headers={
"HTTP-Referer": "https://aitoolsradar.org",
"X-Title": "AI Tools Radar Script",
},
)
print(resp.choices[0].message.content)Expected result: Terminal prints code or text from the free model.
Common mistake: Pointing the default OpenAI client at api.openai.com while using an OpenRouter model slug. Base URL must be OpenRouter.
Step 5: Route drafts cheap, finals expensive
A pattern we use in June 2026:
- Classify the job: draft vs final, public vs confidential.
- Draft with a free or Flash model (
deepseek-v4-flash, Llama free route, Qwen coder when listed at $0). - Final with
gpt-5.5orclaude-opus-4-8on OpenRouter or direct API. - Log
modelper request in your app logs for cost audits.
Example two-step Python shape:
DRAFT_MODEL = "REPLACE_WITH_FREE_OR_FLASH_ID"
FINAL_MODEL = "openai/gpt-5.5" # verify slug on catalog
def draft(prompt: str) -> str:
return chat(DRAFT_MODEL, prompt)
def finalize(draft_text: str, rubric: str) -> str:
return chat(
FINAL_MODEL,
f"Improve this answer.\nRubric: {rubric}\n\n{draft_text}",
)Expected result: Lower monthly bill with acceptable quality on internal tools.
OpenRouter pricing table (June 2026 signals)
Verify live before you publish a budget doc.
| Model class | Example catalog pattern | Input / output (typical) | Free? |
|---|---|---|---|
| Free promos | *:free suffix | $0 / $0 when listed | Yes, while listed |
| DeepSeek V4-Flash | deepseek/deepseek-v4-flash | Very low | Rarely free; cheap paid |
| DeepSeek V4-Pro | deepseek/deepseek-v4-pro | Low vs US frontier | No |
| Mistral Large | mistralai/mistral-large-* | Mid | No |
| GPT-5.5 | openai/gpt-5.5 | High | No |
| Claude Opus 4.8 | anthropic/claude-opus-4-8 | High | No |
| Llama 3.x instruct | meta-llama/* | Low to mid | Sometimes :free |
OpenRouter also shows per-request fees and context pricing on each model card. Long prompts on 1M-context models can still cost real money even when per-token rates look tiny.
Use OpenRouter in Cursor (optional)
- Open Cursor Settings → Models (wording may vary by version).
- Add an OpenAI-compatible custom provider if available, or use the OpenRouter base URL field when present.
- Base URL:
https://openrouter.ai/api/v1 - API key: your
OPENROUTER_API_KEY - Model: paste the catalog slug.
Expected result: Inline chat uses the slug you entered.
Common mistake: Assuming every free model supports tool calling the way GPT-5.5 does. Run one tool-heavy prompt as a smoke test.
When to skip OpenRouter
- Your legal team requires a direct DPA with OpenAI or Anthropic only.
- You need a fixed model version for twelve months with no catalog renames.
- You run high-volume production traffic where router markup matters at scale and direct contracts are cheaper.
When you skip, still read the latest AI models hub for capability context.
Troubleshooting
| Problem | Fix |
|---|---|
| 401 Unauthorized | Check OPENROUTER_API_KEY export and Bearer header |
| 402 or credit errors | Add a small balance in Billing |
| Model not found | Copy slug from catalog again; retire old DeepSeek IDs before June 2026 sunset |
| Slow free tier | Queue is busy; retry off-peak or switch to cheap paid Flash |
| Empty or garbled tool JSON | Move agent steps to GPT-5.5 or Opus for tool calls |
Prompt templates for free-model workflows
Summarize logs (cheap)
Summarize this log in five bullets: first error, likely cause, suggested fix.
Do not invent file names not present in the log.
[paste log]Draft only disclaimer
Draft an answer. Mark uncertain claims with [verify].
I will run a second pass on a frontier model.Free model patterns we see in June 2026
The catalog rotates. These patterns repeat even when exact slugs change:
| Pattern | Example slug shape | Good for | Limits |
|---|---|---|---|
| OpenRouter free router | openrouter/free | Quick tests without picking a model | Rotates providers; behavior varies |
DeepSeek V4 Flash :free | deepseek/deepseek-v4-flash:free | Coding drafts, log triage | Rate limits; verify tool calling |
| OpenAI OSS free | openai/gpt-oss-120b:free | Reasoning-heavy drafts | 131K context on card |
NVIDIA Nemotron :free | nvidia/nemotron-3-super-120b-a12b:free | Agent-style tasks | Large MoE; check latency |
Google Gemma 4 :free | google/gemma-4-31b-it:free | Multimodal + text Q&A | Check context window on card |
Meta Llama instruct :free | meta-llama/llama-3.x-*-instruct:free | Brainstorming when listed | Often rotated off free tier |
How we pick a free route in practice
- Open the models page and filter price ascending.
- Copy three candidates with the same task (one coding prompt).
- Log latency, refusal rate, and answer quality in a spreadsheet.
- Promote the winner to
.envasDRAFT_MODELfor two weeks. - Re-run step 1 every Monday; free routes disappear without email notice.
Node.js and TypeScript snippet
For small internal tools:
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://openrouter.ai/api/v1",
apiKey: process.env.OPENROUTER_API_KEY!,
});
const model = process.env.DRAFT_MODEL ?? "REPLACE_WITH_FREE_MODEL_ID";
const completion = await client.chat.completions.create({
model,
messages: [{ role: "user", content: "List three risks of using free LLM APIs." }],
});
console.log(completion.choices[0]?.message?.content);Expected result: Console prints a short list. Swap model at runtime for A/B tests.
Common mistake: Hardcoding a free slug in ten repos. Centralize DRAFT_MODEL in one secrets manager.
Security and key hygiene
- Rotate keys after any contractor offboarding or leaked screenshot.
- Scope keys per environment (
dev,staging,prod) in the OpenRouter dashboard when available. - Never send customer PII through free models to “save money.” Free tiers still leave your network.
- Log retention: OpenRouter may retain metadata per their policy. Read it before HIPAA or EU client work.
- Compare with direct OpenAI/Anthropic enterprise DPAs when legal asks “who is subprocessors?”
We are not lawyers. When in doubt, block the router and use an approved direct API.
Monitor spend before it surprises you
- Open Activity or Usage in the OpenRouter dashboard weekly.
- Set a credit alert if the UI offers it, or keep a manual $20 top-up cap for experiments.
- Tag requests in your app logs:
model,route,user_id,feature. - Graph tokens per feature. Free-to-paid promotion often happens when one feature 10x spikes tokens.
- If spend jumps, check for an agent loop calling
gpt-5.5every thirty seconds before you blame the free tier.
Rule we use at AI Tools Radar: Free models for internal drafts only. Anything client-facing goes through a paid frontier route with a human spot-check.
OpenRouter vs Together vs Groq (one paragraph each)
Together is strong when you already host fine-tunes or want open models on their cloud. You lose the single catalog OpenRouter provides.
Groq wins on speed for supported Llama/Mixtral chips. Great for latency demos, not always the full model list OpenRouter carries.
Fireworks is another dev favorite for fast open-weight inference. Same story: compare catalog, not brand loyalty.
OpenRouter’s edge is one integration surface for experiments. Pick a second provider only when a specific model is not listed or enterprise pricing beats the router at your volume.
Variations
- Together, Groq, Fireworks: Same draft/final pattern without OpenRouter if your team already has credits there.
- Local Llama: Zero API cost if you have GPU time; slower setup.
- Direct DeepSeek API: Skip router markup when you only need one vendor; see DeepSeek comparison.
- LiteLLM proxy: Wrap OpenRouter behind LiteLLM if you want one internal gateway for ten microservices.
Verdict
Use OpenRouter free models when you are learning, prototyping, or shaving cost on non-critical drafts. Add paid credits when you ship agents, tool loops, or client-facing features. Pair with GPT-5.5 or Claude for finals when a free route drifts on code quality.
We keep a weekly eye on the catalog in our June Week 1 radar and bump this guide when major free routes change. Pair with GPT-5.5 for Excel (2026) for spreadsheet finals and Make Money with AI Tools (2026) if you sell API-assisted freelance work.
Changelog
- 2026-06-02: Fact-check. Updated verification date; refreshed June 2026 free-model examples (
openrouter/free,deepseek-v4-flash:free, Gemma/Nemotron/OSS routes). Confirmedopenai/gpt-5.5on OpenRouter catalog. - 2026-05-29: Initial publish. Documented account setup, curl and Python examples, pricing table patterns, Cursor notes, draft/final routing, eight FAQs.
Frequently asked
8 questionsDoes OpenRouter have free models?
Yes. OpenRouter lists models with a free suffix tag or zero input/output pricing on the models page. Availability changes as providers add or remove promotions. Always read the live models catalog before you bake a free ID into production.
How do I get an OpenRouter API key?
Create an account at openrouter.ai, open Keys in the dashboard, and generate a key. Store it in an environment variable such as OPENROUTER_API_KEY. Do not commit keys to git. Add a small credit balance if a model requires it even when per-token price shows zero.
Is OpenRouter the same as OpenAI API?
No. OpenRouter exposes an OpenAI-compatible chat completions shape, but model IDs point at many vendors (DeepSeek, Meta, Google, Anthropic, OpenAI, Mistral, and others). You swap the base URL and model string. Billing goes through OpenRouter credits unless you use provider-specific routing options shown in their docs.
What is the best free model on OpenRouter for coding?
In June 2026 we reach for free or ultra-cheap routes on DeepSeek V4-Flash, Meta Llama class coders, and Qwen coder variants when they appear with free tags or near-zero pricing. Re-check the catalog weekly. Use a paid frontier model for final review on production code.
Why did my OpenRouter free request fail?
Common causes are exhausted rate limits, a deprecated model ID, missing credits on the account, or the provider pausing free inference. Retry with a different free ID or add $5 of credits. Log the full error body. OpenRouter returns vendor hints in JSON.
Can I use OpenRouter in Cursor?
Yes. Point Cursor or other OpenAI-compatible clients at https://openrouter.ai/api/v1 with your OpenRouter key and a model slug from their catalog. Some features need a paid model for tool calling reliability. Test tool use on your exact model before you depend on it for agents.
OpenRouter vs direct Anthropic or OpenAI API?
Use OpenRouter when you want one SDK and many models for experiments, free drafts, and quick A/B tests. Use direct APIs when you need enterprise DPAs, fixed model versions, or support tickets with a single vendor. Hybrid setups are normal in 2026.
How much does OpenRouter cost if I am not on free models?
You pay per model list price plus OpenRouter fees shown at checkout. DeepSeek and Mistral routes are usually the cheapest paid coding paths. GPT-5.5 and Claude Opus routes cost more per token. The dashboard usage page is the source of truth for your spend.
More in Guide
View all
Exa AI MCP Setup (2026): Better Research Than ChatGPT?
Exa AI MCP setup (2026): connect Exa search to Claude/Cursor, API keys, and whether it beats ChatGPT for research.
Guide

GPT-5.5 for Excel (2026): Formulas & Data Analysis Workflow
AI for Excel with GPT-5.5 (2026): formulas, dashboards, and data analysis without exposing sensitive data. Step-by-step workflow.
Guide

Best New AI Tools for Developers (2026): 12 We Actually Use
Best new AI tools for developers in 2026: 12 curated picks for coding, agents, and APIs. Not a list of 100 spam tools.
Guide
More stories
View all
New AI Tools 2026 (June Week 1): 7 Tested, Ranked and Reviewed
New AI tools 2026 for June Week 1: 7 launches tested with pricing, Use Watch Skip verdicts, and links to full reviews. Updated weekly.
Radar

Manus AI Review 2026: Pricing, Free Credits, Agent Test & Verdict
Manus AI review 2026 with real pricing, free credits, agent test results, and how it compares to ChatGPT and Perplexity for research tasks.
Review

SlideAI Review 2026: AI PPT Generator With Free Credits vs Gamma
Slide AI ppt review 2026: AI PPT generator with free daily credits. Compare vs Gamma and Dokie for PowerPoint outlines, pricing, and speed.
Review

Dokie AI Review (2026): PPT Maker vs Gamma & Kimi
Dokie AI ppt review (2026): PPT maker from text, free tier, export to PowerPoint, and how it compares to Gamma and Kimi.
Review