Groq's LPU (Language Processing Unit) delivers hundreds of tokens per second, making it the fastest hosted inference for open-source LLMs. The free tier is generous, stable, and production-ready for low-latency applications.
// Pros
Insane inference speed (500+ tokens/sec)
OpenAI-compatible API
Generous free tier with daily resets
// Cons
Smaller model selection vs OpenRouter
Rate limits can hit during peak hours
// Score breakdown
Reliability (35%) (from 3m ago health check)100/100
Free Tier Generosity (30%) (computed from quota, no-CC, no-phone fields)100/100
Documentation (20%) (human rating)100/100
Popularity (15%) (GitHub stars (log-normalised), or manual baseline)46/100