Groq's LPU is the fastest free inference available, but it has rate limits and a smaller model selection. These providers offer speed-focused alternatives.
One API, 100+ models. Pay-as-you-go with no markup.
Open-weights frontier model. Industry-low pricing.
Open-source models at scale. $5 free credits.
Blazing fast OSS inference. $1 free credits.
2,000+ tokens/sec on Llama 70B. Free tier available.
RDU-powered inference. Llama 405B for free.