OpenRouter API dashboard showing free model list and API key setup

Guide

OpenRouter Free Models: Full List + Rate Limits Explained

Complete list of OpenRouter free models with rate limits explained. Covers coding performance, API key setup, model rotation, and fixing common errors.

AI Tools Radar Editorial May 29, 2026 Updated June 13, 2026 11 min read

Short answer: OpenRouter is the fastest way to call many LLMs through one API key, including free or :free model routes when providers allow it. Budget about 15 minutes to sign up, copy a key, and send your first request. Skill level: you can run curl or paste Python into a notebook.

We last verified the models page and a live API call on June 13, 2026. Free IDs change, so treat the table below as a pattern rather than a permanent list.

Last updated: June 13, 2026.

Quick list: free models on OpenRouter (June 2026)

Model	Slug pattern	Type	Best for
Qwen3-Coder	`qwen/qwen3-coder:free`	Free	Coding (our #1 free pick)
Poolside Laguna	`poolside/laguna-xs.2:free`	Free	Coding agents, inline suggestions
NVIDIA Nemotron 3 Ultra	`nvidia/nemotron-3-ultra-550b-a55b:free`	Free	1M-context agent tasks
Google Gemma 4	`google/gemma-4-31b-it:free`	Free	Multimodal + text Q&A
OpenAI OSS 120B	`openai/gpt-oss-120b:free`	Free	Reasoning-heavy drafts
Nous Hermes	`nousresearch/hermes-3-llama-3.1-405b:free`	Free	Generalist, uncensored drafts
Z AI GLM	`z-ai/glm-4.5-air:free`	Free	Lightweight general tasks
Meta Llama 3.3 70B	`meta-llama/llama-3.3-70b-instruct:free`	Free	Fallback (may rotate off free)
MiniMax M3	`minimax/minimax-m3`	Paid ($0.30/$1.20)	Cheapest 1M-context coding
Kimi K2.7 Code	`moonshotai/kimi-k2.7-code`	Paid ($0.95/$4.00)	Premium open-weight coding
DeepSeek V4-Pro	`deepseek/deepseek-v4-pro`	Paid (low)	Best value frontier model
GPT-5.5	`openai/gpt-5.5`	Paid (high)	Final patches, agent loops
Claude Opus 4.8	`anthropic/claude-opus-4-8`	Paid (high)	Production code, complex reasoning

Rate limits apply. Free models throttle under load. Add $5 of credits to avoid 402 errors. Slugs change weekly — always verify on openrouter.ai/models.

What you need

Item	Notes
OpenRouter account	openrouter.ai sign-up (email or OAuth)
API key	Dashboard → Keys → Create
Optional credits	Some “free” promos still need a positive balance for abuse prevention
HTTP client	`curl`, Python `openai` SDK, or your IDE
Model slug	Copy from the catalog, e.g. `qwen/qwen3-coder:free` or names ending in `:free`

OpenRouter models page with free-tier and pricing filters on openrouter.ai — OpenRouter models catalog showing free and paid routes. June 5, 2026 capture.

Quick comparison: free vs paid routing

Route type	Cost signal	Best for	Watch out for
`:free` or $0 listed models	$0 per token on catalog	Learning, drafts, personal scripts	Rate limits, sudden removal
Cheap paid (Flash, Mistral)	Cents per million tokens	Batch codegen, summaries	Tool-call quality varies
Frontier paid (GPT-5.5, Opus)	Dollars per million tokens	Final patches, agents	Still cheaper than wrong human hours if used narrowly

For how those models compare on coding, see DeepSeek V4 vs ChatGPT vs Claude and the latest AI models hub.

Step 1: Create an account and key

Go to openrouter.ai and sign in.
Open Keys in the dashboard.
Click Create Key, name it (for example, aitoolsradar-dev), and copy it once.
Export locally:

export OPENROUTER_API_KEY="sk-or-v1-xxxxxxxx"

Expected result: The keys page shows your new key with its create date. Revoke any key you pasted into a screenshot by mistake.

Common mistake: Committing the key to GitHub. Use .env and add .env to .gitignore.

Step 2: Find free models on the catalog

Open Models in the dashboard (or visit the public models page).
Sort or filter by price ascending.
Look for :free suffixes or $0 input and output on the row you want.
Copy the model ID string exactly, including the provider prefix.

Expected result: You have one slug ready for model in JSON, such as qwen/qwen3-coder:free or openrouter/free (auto-picks a free route) when offered.

Common mistake: Using an old blog slug after the provider renamed the checkpoint. The catalog wins over blog posts.

Step 3: Send a test request with curl

curl https://openrouter.ai/api/v1/chat/completions \
  -H "Authorization: Bearer $OPENROUTER_API_KEY" \
  -H "Content-Type: application/json" \
  -H "HTTP-Referer: https://aitoolsradar.org" \
  -H "X-Title: AI Tools Radar Test" \
  -d '{
    "model": "REPLACE_WITH_FREE_MODEL_ID",
    "messages": [
      {"role": "user", "content": "Reply with exactly: openrouter ok"}
    ]
  }'

Swap REPLACE_WITH_FREE_MODEL_ID for the slug you copied.

Expected result: JSON with choices[0].message.content containing your reply.

Common mistake: Forgetting the Authorization header. You will get a 401 with a short error body.

Step 4: Use the OpenAI Python SDK

Install once:

pip install openai

Script:

import os
from openai import OpenAI

client = OpenAI(
    base_url="https://openrouter.ai/api/v1",
    api_key=os.environ["OPENROUTER_API_KEY"],
)

resp = client.chat.completions.create(
    model="REPLACE_WITH_FREE_MODEL_ID",
    messages=[
        {"role": "user", "content": "Write a one-line Python hello world."}
    ],
    extra_headers={
        "HTTP-Referer": "https://aitoolsradar.org",
        "X-Title": "AI Tools Radar Script",
    },
)

print(resp.choices[0].message.content)

Expected result: Terminal prints code or text from the free model.

Common mistake: Pointing the default OpenAI client at api.openai.com while using an OpenRouter model slug. The base URL must be OpenRouter.

Step 5: Route drafts cheap, finals expensive

Here is a pattern we use in June 2026:

Classify the job: draft vs final, public vs confidential.
Draft with a free model (moonshotai/kimi-k2.6:free for general work, qwen/qwen3-coder:free for code, poolside/laguna-xs.2:free for agentic coding, or any :free slug that handles your task).
Final with gpt-5.5 or claude-opus-4-8 on OpenRouter or direct API.
Log model per request in your app logs for cost audits.

Example two-step Python shape:

DRAFT_MODEL = "REPLACE_WITH_FREE_OR_FLASH_ID"
FINAL_MODEL = "openai/gpt-5.5"  # verify slug on catalog

def draft(prompt: str) -> str:
    return chat(DRAFT_MODEL, prompt)

def finalize(draft_text: str, rubric: str) -> str:
    return chat(
        FINAL_MODEL,
        f"Improve this answer.\nRubric: {rubric}\n\n{draft_text}",
    )

Expected result: Lower monthly bill with acceptable quality on internal tools.

OpenRouter pricing table (June 2026 signals)

Verify live before you publish a budget doc.

Model class	Example catalog pattern	Input / output (typical)	Free?
Free promos	`*:free` suffix	$0 / $0 when listed	Yes, while listed
Qwen3-Coder	`qwen/qwen3-coder:free`	$0 when `:free` is active	Yes, while listed
DeepSeek V4-Pro	`deepseek/deepseek-v4-pro`	Low vs US frontier	No
Mistral Large	`mistralai/mistral-large-*`	Mid	No
GPT-5.5	`openai/gpt-5.5`	High	No
Claude Opus 4.8	`anthropic/claude-opus-4-8`	High	No
Moonshot Kimi K2.6	`moonshotai/kimi-k2.6`	Mid (lost `:free` June 13)	No
Moonshot Kimi K2.7 Code	`moonshotai/kimi-k2.7-code`	Mid (new June 12)	No
MiniMax M3	`minimax/minimax-m3`	Low (promo $0.30/$1.20)	No

OpenRouter also shows per-request fees and context pricing on each model card. Long prompts on 1M-context models can still cost real money even when per-token rates look tiny.

Use OpenRouter in Cursor (optional)

Open Cursor Settings → Models (wording may vary by version).
Add an OpenAI-compatible custom provider if available, or use the OpenRouter base URL field when present.
Base URL: https://openrouter.ai/api/v1
API key: your OPENROUTER_API_KEY
Model: paste the catalog slug.

Expected result: Inline chat uses the slug you entered.

Common mistake: Assuming every free model supports tool calling the way GPT-5.5 does. Run one tool-heavy prompt as a smoke test.

When to skip OpenRouter

Your legal team requires a direct DPA with OpenAI or Anthropic only.
You need a fixed model version for twelve months with no catalog renames.
You run high-volume production traffic where router markup matters at scale and direct contracts are cheaper.

When you skip, still read the latest AI models hub for capability context.

Troubleshooting

Problem	Fix
401 Unauthorized	Check `OPENROUTER_API_KEY` export and Bearer header
402 or credit errors	Add a small balance in Billing
Model not found	Copy slug from catalog again; retire old DeepSeek IDs before June 2026 sunset
Slow free tier	Queue is busy; retry off-peak or switch to cheap paid Flash
Empty or garbled tool JSON	Move agent steps to GPT-5.5 or Opus for tool calls

Prompt templates for free-model workflows

Summarize logs (cheap)

Summarize this log in five bullets: first error, likely cause, suggested fix.
Do not invent file names not present in the log.

[paste log]

Draft only disclaimer

Draft an answer. Mark uncertain claims with [verify].
I will run a second pass on a frontier model.

Free model patterns we see in June 2026

The catalog rotates. These patterns repeat even when exact slugs change:

Pattern	Example slug shape	Good for	Limits
OpenRouter free router	`openrouter/free`	Quick tests without picking a model	Rotates providers; behavior varies
Qwen3-Coder `:free`	`qwen/qwen3-coder:free`	Best free option for code	1M context; verify tool calling
Moonshot Kimi K2.7 Code	`moonshotai/kimi-k2.7-code`	Premium open-weight coding: preserve_thinking, multimodal	$0.95/$4.00 per M; 256K context
MiniMax M3	`minimax/minimax-m3`	Cheapest 1M-context coding + multimodal open weights	$0.30/$1.20 promo; check license
Poolside Laguna `:free`	`poolside/laguna-xs.2:free`, `poolside/laguna-m.1:free`	Coding agents, inline suggestions	262K context; new on catalog
OpenAI OSS free	`openai/gpt-oss-120b:free`, `openai/gpt-oss-20b:free`	Reasoning-heavy drafts	131K context on card
NVIDIA Nemotron `:free`	`nvidia/nemotron-3-super-120b-a12b:free`, `nvidia/nemotron-3-ultra-550b-a55b:free`	Agent-style tasks, safety checks	Large MoE; check latency
Google Gemma 4 `:free`	`google/gemma-4-31b-it:free`, `google/gemma-4-26b-a4b-it:free`	Multimodal + text Q&A	Check context window on card
Z AI GLM `:free`	`z-ai/glm-4.5-air:free`	Lightweight general tasks	131K context
Nous Hermes `:free`	`nousresearch/hermes-3-llama-3.1-405b:free`	Generalist, uncensored drafts	131K context
Meta Llama instruct `:free`	`meta-llama/llama-3.3-70b-instruct:free`, `meta-llama/llama-3.2-3b-instruct:free`	Fallback only	Rotates off free tier quickly; less relevant in mid-2026

Moonshot Kimi chat interface showing long-document handling on kimi.moonshot.cn — Moonshot Kimi K2.6, our pick for best all-around free model on OpenRouter in June 2026. Screenshot from vendor site, captured June 5, 2026. UI and pricing may change.

How we pick a free route in practice

Open the models page and filter price ascending.
Copy three candidates with the same task (one coding prompt).
Log latency, refusal rate, and answer quality in a spreadsheet.
Promote the winner to .env as DRAFT_MODEL for two weeks.
Re-run step 1 every Monday; free routes disappear without email notice.

Node.js and TypeScript snippet

For small internal tools:

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY!,
});

const model = process.env.DRAFT_MODEL ?? "REPLACE_WITH_FREE_MODEL_ID";

const completion = await client.chat.completions.create({
  model,
  messages: [{ role: "user", content: "List three risks of using free LLM APIs." }],
});

console.log(completion.choices[0]?.message?.content);

Expected result: Console prints a short list. Swap model at runtime for A/B tests.

Common mistake: Hardcoding a free slug in ten repos. Centralize DRAFT_MODEL in one secrets manager.

Security and key hygiene

Rotate keys after any contractor offboarding or leaked screenshot.
Scope keys per environment (dev, staging, prod) in the OpenRouter dashboard when available.
Never send customer PII through free models to “save money.” Free tiers still leave your network.
Log retention: OpenRouter may retain metadata per their policy. Read it before HIPAA or EU client work.
Compare with direct OpenAI/Anthropic enterprise DPAs when legal asks “who is subprocessors?”

We are not lawyers. When in doubt, block the router and use an approved direct API.

Monitor spend before it surprises you

Open Activity or Usage in the OpenRouter dashboard weekly.
Set a credit alert if the UI offers it, or keep a manual $20 top-up cap for experiments.
Tag requests in your app logs: model, route, user_id, feature.
Graph tokens per feature. Free-to-paid promotion often happens when one feature 10x spikes tokens.
If spend jumps, check for an agent loop calling gpt-5.5 every thirty seconds before you blame the free tier.

Rule we use at AI Tools Radar: Free models for internal drafts only. Anything client-facing goes through a paid frontier route with a human spot-check.

OpenRouter vs Together vs Groq (one paragraph each)

Together is strong when you already host fine-tunes or want open models on their cloud. You lose the single catalog OpenRouter provides.

Groq wins on speed for supported Llama/Mixtral chips. Great for latency demos, not always the full model list OpenRouter carries.

Fireworks is another dev favorite for fast open-weight inference. Same story: compare catalog, not brand loyalty.

OpenRouter’s edge is one integration surface for experiments. Pick a second provider only when a specific model is not listed or enterprise pricing beats the router at your volume.

Variations

Together, Groq, Fireworks: Same draft/final pattern without OpenRouter if your team already has credits there.
Local Llama: Zero API cost if you have GPU time; slower setup.
Direct DeepSeek API: Skip router markup when you only need one vendor; see DeepSeek comparison.
LiteLLM proxy: Wrap OpenRouter behind LiteLLM if you want one internal gateway for ten microservices.

Verdict

Use OpenRouter free models when you are learning, prototyping, or shaving cost on non-critical drafts. Add paid credits when you ship agents, tool loops, or client-facing features. Pair with GPT-5.5 or Claude for finals when a free route drifts on code quality.

We keep a weekly eye on the catalog in our June Week 1 radar and bump this guide when major free routes change. Pair with GPT-5.5 for Excel (2026) for spreadsheet finals and Make Money with AI Tools (2026) if you sell API-assisted freelance work.

Changelog

2026-06-13: Live catalog check. Moonshot Kimi K2.6 removed from free list (lost :free tag). Added Kimi K2.7 Code and MiniMax M3 as paid open-weights options. Updated free model FAQ recommendation. Linked to full reviews for both new models.
2026-06-05: Live catalog check. Removed deepseek/deepseek-v4-flash:free (no longer :free). Added Qwen3-Coder, Poolside Laguna, Moonshot Kimi K2.6, GLM-4.5-Air, Nous Hermes, and expanded Nemotron/OSS/Gemma/Llama free slugs. Updated FAQ and pricing table.
2026-06-02: Fact-check. Updated verification date; refreshed June 2026 free-model examples (openrouter/free, deepseek-v4-flash:free, Gemma/Nemotron/OSS routes). Confirmed openai/gpt-5.5 on OpenRouter catalog.
2026-05-29: Initial publish. Documented account setup, curl and Python examples, pricing table patterns, Cursor notes, draft/final routing, eight FAQs.

Frequently asked

8 questions

Does OpenRouter have free models?

Yes. OpenRouter lists models with a free suffix tag or zero input/output pricing on the models page. Availability changes as providers add or remove promotions. Always read the live models catalog before you bake a free ID into production.

How do I get an OpenRouter API key?

Create an account at openrouter.ai, open Keys in the dashboard, and generate a key. Store it in an environment variable such as OPENROUTER_API_KEY. Do not commit keys to git. Add a small credit balance if a model requires it even when per-token price shows zero.

Is OpenRouter the same as OpenAI API?

No. OpenRouter exposes an OpenAI-compatible chat completions shape, but model IDs point at many vendors (DeepSeek, Meta, Google, Anthropic, OpenAI, Mistral, and others). You swap the base URL and model string. Billing goes through OpenRouter credits unless you use provider-specific routing options shown in their docs.

What is the best free model on OpenRouter for coding?

In June 2026 our first pick for free coding is **Qwen3-Coder** (`:free`). **Poolside Laguna** (`:free`) works well for coding agents and inline suggestions. **NVIDIA Nemotron 3 Ultra** (`:free`) handles 1M-context agent tasks at zero cost while the promo lasts. Note: Moonshot Kimi K2.6 lost its `:free` tag as of June 13. Use the new **Kimi K2.7 Code** (paid, $0.95/$4.00 per M tokens) or **MiniMax M3** (paid, $0.30/$1.20 promo) for frontier coding at open-weights pricing. Re-check the catalog weekly. Use a paid frontier model for final review on production code.

Why did my OpenRouter free request fail?

Common causes are exhausted rate limits, a deprecated model ID, missing credits on the account, or the provider pausing free inference. Retry with a different free ID or add $5 of credits. Log the full error body. OpenRouter returns vendor hints in JSON.

Can I use OpenRouter in Cursor?

Yes. Point Cursor or other OpenAI-compatible clients at https://openrouter.ai/api/v1 with your OpenRouter key and a model slug from their catalog. Some features need a paid model for tool calling reliability. Test tool use on your exact model before you depend on it for agents.

OpenRouter vs direct Anthropic or OpenAI API?

Use OpenRouter when you want one SDK and many models for experiments, free drafts, and quick A/B tests. Use direct APIs when you need enterprise DPAs, fixed model versions, or support tickets with a single vendor. Hybrid setups are normal in 2026.

How much does OpenRouter cost if I am not on free models?

You pay per model list price plus OpenRouter fees shown at checkout. DeepSeek and Mistral routes are usually the cheapest paid coding paths. GPT-5.5 and Claude Opus routes cost more per token. The dashboard usage page is the source of truth for your spend.