Free QwQ 32B API on Cloudflare

Free QwQ 32B API via Cloudflare Workers AI — reasoning-focused model strong at math and code within 10,000 free Neurons/day.

About the Model

Why use QwQ 32B?

QwQ 32B is Qwen's reasoning-specialized model, designed to think step-by-step and excel at math, logic, and coding problems that trip up standard chat models.

Capabilities

Best for hard analytical tasks where deliberate, chain-of-thought reasoning beats fast answers.

Why it's free

Covered by Cloudflare Workers AI's 10,000 free Neurons per day via the /ai/run endpoint.

How to Access for Free (via Cloudflare Workers AI)

Free access on Cloudflare Workers AI

Call QwQ 32B through Cloudflare's global edge using the unified /ai/run REST endpoint. Every Cloudflare account includes 10,000 Neurons per day for free — ideal for step-by-step reasoning on math and code at no cost.

Authentication (BYOK)

Use your own Cloudflare Account ID and an API token with Workers AI access via Authorization: Bearer <token>.

Try it in your browser

Pick a model, paste your own free key, and run. Your key is sent once to call the provider and never stored on our servers.

The model's reply will stream here.

Code Examples

curl
curl https://api.cloudflare.com/client/v4/accounts/$CF_ACCOUNT_ID/ai/run/@cf/qwen/qwq-32b \
  -H "Authorization: Bearer $CF_API_TOKEN" \
  -d '{"messages":[{"role":"user","content":"If a train travels 60 km in 45 minutes, what is its speed in km/h?"}]}'
python
import os, requests

account = os.environ["CF_ACCOUNT_ID"]
token = os.environ["CF_API_TOKEN"]

resp = requests.post(
    f"https://api.cloudflare.com/client/v4/accounts/{account}/ai/run/@cf/qwen/qwq-32b",
    headers={"Authorization": f"Bearer {token}"},
    json={"messages": [{"role": "user", "content": "If a train travels 60 km in 45 minutes, what is its speed in km/h?"}]},
)
print(resp.json()["result"]["response"])