🤖

AI & LLMs

Meta's compact Llama 4 variant hits SOTA on reasoning tasks using only 17B active parameters via MoE with 16 experts.

Open-source model at this quality means local inference without API costs — a real alternative for production.

Anthropic releases Claude 4 Opus with native computer use, 200K token window, and significantly improved code generation benchmarks.

The context size alone unlocks whole-codebase reasoning — this changes what you can hand off to AI agents.

Analysis of the top 20 new models released in Q1 2025 shows 17 use MoE. Dense transformers are becoming legacy.

Understanding MoE routing is now table stakes for anyone deploying or fine-tuning foundation models.

The reasoning model now supports JSON schema enforcement, making it practical for production pipelines that need deterministic output shapes.

Reasoning + structured output finally makes LLMs reliable enough to drive business logic without a validation layer.