Chutes.ai

Name: Chutes.ai
Rating: 87

online

Decentralized inference on Bittensor. Pay-per-token, some free.

LLMest. 2024 · Decentralized (Bittensor)

/ 100 APIVault Score

Get API Key Read Docs

// At a glance

Free Tier

Some free models · $0–$0.30/MTok · no card

// Free tier details

Available Models

Llama 3.3 70BDeepSeek-R1Qwen3Mistral

Monthly Requests

Select models free; others $0–$0.30/MTok

No credit card needed

No phone verification

// Quick start

300">"text-purple-400">from openai 300">"text-purple-400">import OpenAI

client = OpenAI(
    api_key=300">"YOUR_CHUTES_KEY",
    base_url=300">"https://llm.chutes.ai/v1",
)

response = client.chat.completions.create(
    model=300">"unsloth/Llama-3.3-70B-Instruct",
    messages=[{300">"role": 300">"user", 300">"content": 300">"Hello."}],
)

print(response.choices[0].message.content)

// Overview

Chutes runs open-source models on donated and idle GPU capacity via the Bittensor network. Pricing is $0–$0.30/MTok depending on model — among the cheapest available. Some models remain free; the early-access 200 req/day free program ended Feb 2026.

// Pros

Cheapest inference on the market ($0–$0.30/MTok)
Decentralized — not dependent on one cloud
OpenAI-compatible endpoint

// Cons

Free 200 req/day program ended Feb 2026
Speed varies based on donated GPU availability
No uptime SLA

// Score breakdown

Reliability (35%) (from 1m ago health check)100/100

Free Tier Generosity (30%) (computed from quota, no-CC, no-phone fields)85/100

Documentation (20%) (human rating)78/100

Popularity (15%) (GitHub stars (log-normalised), or manual baseline)70/100

Methodology: apivault.directory/methodology

// Best for

Ultra-budget inferenceBittensor ecosystemDeepSeek / Llama at near-zero cost

// Recent changes

Feb 27, 2026Early Access 200 req/day free program discontinuedfree tier changed