Huggingface

Community Evals: Because we're done trusting black-box leaderboards over the community

• Evaluation metrics saturated; MMLU >91%, GSM8K >94%, yet real‑world tasks still fail. • Inconsistent benchmark scores across papers, model cards, and platforms create no single t

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

• Open source is now the dominant strategy for Chinese AI firms, driving large‑scale deployment and integration. • DeepSeek leads Hugging Face followers, while Qwen ranks fourth, s