AI & LLMs
Claude Fable 5, DiffusionGemma 26B-A4B, Kimi K2.7 Code, NVIDIA 550B inference, Cohere North Mini Code
Jun 16, 2026·3mclaude-fable-5kimi-k2-7-code
AI & LLMs
Kimi K2.7 Code: Moonshot's Open-Weight Code Model
Jun 14, 2026·3mopen-weight-modelscode-generation
AI & LLMs
GLM-5.1 Community Drop: SWE-Bench Pro Scores Rival Closed Frontier Models
Jun 12, 2026·4mopen-weight-modelsglm-5.1
AI & LLMs
June 2026 Model Release Analysis: Nemotron 3 Ultra 550B, Gemma 4 12B, Qwen3.7 Plus, MiniMax-M3
Jun 10, 2026·6mnemotron-3-ultragemma-4-12b
AI & LLMs
GPT-4o mini and gpt-oss variants: weekly model, API, and tooling operational update
Jun 9, 2026·6mopenai-gpt-4ogpt-oss-120b
AI & LLMs
Claude Sonnet 4.6 Default Midtier: 1M-Token Beta Context, Agent Improvements, and Operational Guidance
Jun 8, 2026·6mclaude-sonnet-4-6anthropic
AI & LLMs
Claude Opus 4.7: What Platform Teams Must Track — Open Checkpoints, Agent Tooling, Inference Runtimes
Jun 6, 2026·6mclaude-opus-4-7inference-runtimes
AI & LLMs
Opus 4.8, Gemma 4 (12B), MiniMax M3 1M-Token: Open-Weight & Enterprise AI Update
Jun 5, 2026·6mllmsopen-weight-models
AI & LLMs
Open-model benchmarks, agent tooling, and inference-efficiency trends shaping AI engineering (Late 2025–Early 2026)
Jun 2, 2026·6mai-llmsinference-efficiency
AI & LLMs
Designing Robust Multi-Provider LLM Platforms: Routing, RAG, and Inference Scaling
May 29, 2026·6mai-architecturellm-platforms
AI & LLMs
Inference-Time Scaling, MoE, and Open-Weight LLMs: Practical Guide (2026)
May 27, 2026·6mopen-source-llmsinference-optimization
AI & LLMs
Open-weight MoE & Long-Context LLMs Powering Agentic Code Workflows (2025–26)
May 25, 2026·6mopen-llmsmixture-of-experts
1 more