Key Points
- Kimi.ai: Released the open‑source Kimi‑K2.7‑Code model, reporting +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, +31.5% on MLS Bench Lite, and 30% lower reasoning overthinking.
- Google Research: Launched Gemini‑SQL2, a text‑to‑SQL system built on Gemini 3.1 Pro that hits state‑of‑the‑art scores on the BIRD benchmark.
- OpenAI: Added saved Codex rate‑limit resets for Go, Plus, Pro, and Business tiers (one free reset) and a two‑week invite program letting Plus/Pro users earn extra resets by inviting friends.
- Claude: Made dynamic workflows in Claude Code generally available, letting the model orchestrate parallel sub‑agents for complex tasks like codebase‑wide bug hunts and verify work before returning results.
Account Highlights
- Kimi.ai (@Kimi_Moonshot): 1 update.
- Google Research (@GoogleResearch): 1 update.
- OpenAI (@OpenAI): 2 updates.
- Claude (@claudeai): 1 update.
Updates by Account
Kimi.ai (@Kimi_Moonshot)
🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced!
🔷 Improved coding & agent performance over K2.6: +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower https://t.co/jFS7I40avs
Google Research (@GoogleResearch)
🚀 Introducing Gemini-SQL2, our breakthrough text-to-SQL capability powered by Gemini 3.1 Pro! We've achieved state-of-the-art results on the highly competitive BIRD benchmark, translating natural language into execution-ready SQL queries. 🧵👇 https://t.co/HfO2ZW2pih
OpenAI (@OpenAI)
We heard you wanted to use Codex rate limit resets on your own time.
Starting today, we’re rolling out the ability to save rate limit resets to use later.
We’re starting Go, Plus, Pro, and Business users with one free reset: https://t.co/gucyTi04wc
For the next two weeks, Plus and Pro users can invite up to three friends to try Codex.
When a friend sends their first Codex message, you’ll both get another banked reset.
Claude (@claudeai)
Dynamic workflows in Claude Code are now generally available.
For complex tasks like codebase-wide bug hunts, Claude writes its own orchestration and runs subagents in parallel, verifying the work before it reaches you.
Read more: https://t.co/nbNpvkfRBZ