Free Qwen3 30B-A3B API on Cloudflare

Free Qwen3 30B-A3B API via Cloudflare Workers AI — efficient MoE reasoning and 100+ languages within 10,000 free Neurons/day.

About the Model

Why use Qwen3 30B-A3B?

Qwen3 30B-A3B is a mixture-of-experts model (~3B active of 30B) that punches far above its active size, with strong reasoning, coding, and 100+ language support.

Capabilities

Supports thinking / non-thinking modes, function-calling, and broad multilingual tasks — efficient enough for high-throughput apps.

Why it's free

Runs within Cloudflare Workers AI's 10,000 free Neurons per day through the /ai/run endpoint.

How to Access for Free (via Cloudflare Workers AI)

Free access on Cloudflare Workers AI

Call Qwen3 30B-A3B through Cloudflare's global edge using the unified /ai/run REST endpoint. Every Cloudflare account includes 10,000 Neurons per day for free — efficient MoE inference for reasoning and multilingual apps at no cost.

Authentication (BYOK)

Use your own Cloudflare Account ID and an API token with Workers AI access via Authorization: Bearer <token>.

Try it in your browser

Pick a model, paste your own free key, and run. Your key is sent once to call the provider and never stored on our servers.

The model's reply will stream here.

Code Examples

curl
curl https://api.cloudflare.com/client/v4/accounts/$CF_ACCOUNT_ID/ai/run/@cf/qwen/qwen3-30b-a3b-fp8 \
  -H "Authorization: Bearer $CF_API_TOKEN" \
  -d '{"messages":[{"role":"user","content":"Give me a regex for a valid email."}]}'
python
import os, requests

account = os.environ["CF_ACCOUNT_ID"]
token = os.environ["CF_API_TOKEN"]

resp = requests.post(
    f"https://api.cloudflare.com/client/v4/accounts/{account}/ai/run/@cf/qwen/qwen3-30b-a3b-fp8",
    headers={"Authorization": f"Bearer {token}"},
    json={"messages": [{"role": "user", "content": "Give me a regex for a valid email."}]},
)
print(resp.json()["result"]["response"])