Discussion about this post

User's avatar
Ilia Karelin's avatar

Runway’s new model seems to be crazy good.

But also, DeepSeek is pricing their model like that? How many times is their model cheaper than Gemini or Sonnet? Crazy!

Expand full comment
Neural Foundry's avatar

Excelletn breakdown on the sparse-attention architecture in DeepSeek V3.2. What's partcularly interesting about their post-training approach is how they're essentially distilling multiple specialized RL models into one unified system, which hints at a broader shift in how we might build general purpose reasoning models goingforward. The pricing war between DeepSeek and the Western labs will likely push model costs down faster than quality improvements can justify premium tiers.

Expand full comment

No posts

Ready for more?