Releases

Model launches and the analysis that follows. Each post covers what shipped, why it matters, benchmark results, and practical testing notes.

3 posts

Showing 3 of 3

Release Aug 12, 2026

OpenAI GPT-5

Benchmark breakdown, practical testing notes, and competitive positioning for GPT-5. Where it leads, where it doesn't, and what it means for the frontier.

OpenAI GPT-5
Release Jul 28, 2026

Alibaba Qwen 3

Full benchmark analysis of the Qwen 3 family. Strong multilingual results, competitive coding, open weights.

Alibaba Qwen Qwen 3
Release Jun 30, 2026

Google Gemini 2.5 Pro

DeepMind's latest flagship. Native multimodal performance, massive context, and where it sits relative to the pack.

Google DeepMind Gemini 2.5