Tag: Pricing

AI Models 3 min read

Latest from X - 2026-06-30 to 2026-07-02

Google Research: Introduced TabFM, a foundation model for tabular data classification and regression that can generate high‑quality predictions on unseen tables in a single forward pass. ClaudeDevs: Raised Claude Platform API rate limits for all users, removed spend‑based tiers, and gave the latest Sonnet and Haiku models five‑times higher limits at the top tier. Google AI: Launched two workflow updates - one model for ultra‑fast image generation and another to instantly animate those images, both at a fraction of the usual cost, highlighted by Nano Banana 2 Lite. GitHub Changelog: Announced that Gemini 2.5 Pro and Gemini 3 Flash will be deprecated in all Copilot experiences on July 31 2026, urging migration to Gemini 3.1 Pro or Gemini 3.5 Flash beforehand. Z.ai: Rolled out ZCode, the official development environment for GLM‑5.2, offering 1.5× usage quota for GLM Coding Plan subscribers, BYOK support, and cross‑platform availability on macOS, Windows, and Linux. GitHub: Made Kimi Moonshot’s Kimi K2.7 Code generally available in GitHub Copilot as the first open‑weight model selectable in the model picker, noting lower cost with performance comparable to top‑tier models.

July 03, 2026

Google Research X Updates Claude

AI Models 4 min read

Previewing GPT‑5.6 Sol, Terra, and Luna

OpenAI has begun a limited preview of the GPT‑5.6 family: **Sol** (flagship), **Terra** (balanced, 2× cheaper than GPT‑5.5), and **Luna** (fast, low‑cost). Sol introduces a new “max” reasoning mode and an “ultra” mode that uses sub‑agents. Early benchmarks show state‑of‑the‑art performance on coding (Terminal‑Bench 2.1), biology (GeneBench v1), and cybersecurity (ExploitBench, ExploitGym). The models ship with OpenAI’s most robust safety stack to date, but the preview may block or delay some requests.

June 26, 2026

ChatGPT Claude Codex

AI Models 2 min read

Latest from X - 2026-06-26

OpenAI: OpenAI announced a limited preview of the GPT‑5.6 family—Sol, Terra and Luna. Sol is the new flagship, described as a step‑function improvement over GPT‑5.5. Terra delivers performance competitive with GPT‑5.5 at roughly 2× lower cost, and Luna is the most cost‑efficient model, offering strong capability at the lowest price. The company plans to make the three models generally available in the coming weeks, but the preview is currently limited to a small group of trusted partners in Codex and the API at the request of the U.S. government.

June 26, 2026

OpenAI GPT X Updates

AI Models 4 min read

GLM‑5.2 Brings 1M‑Token Context to Coding Agents

GLM‑5.2 is Z.ai’s newest open‑source model designed for long‑horizon coding tasks. It supports a solid 1 million‑token context, introduces flexible “effort” levels for balancing speed and capability, and uses the IndexShare architecture to cut per‑token FLOPs by 2.9×. Benchmarks show it outperforms its predecessor (GLM‑5.1) and ranks as the strongest open‑source coding model, closing the gap to leading closed‑source systems.

June 21, 2026

Claude Claude Code Codex

Coding 2 min read

Latest from X - 2026-06-12 to 2026-06-13

OpenCode: Kimi 2.7 Code is now available in Go with image support, optimized for coding and priced similarly to 2.6. Kimi.ai: Builders using the Kimi K2.7 Code API can earn 20%–30% extra quota by topping up $100+ before July 2, with one bonus per account. ollama: The Kimi‑K2.7‑Code model is now hosted on Ollama’s US cloud on NVIDIA B300 GPUs, keeping data private and never used for training; try it with “ollama launch claude --model”. Z.ai: GLM‑5.2, the new flagship model, is now accessible to all GLM Coding Plan users—including Lite, Pro, Max, and Team tiers.

June 13, 2026

Kimi OpenCode X Updates

Open Source 3 min read

Latest from X - 2026-06-09 to 2026-06-11

GitHub announces that Agentic Workflows are now in public preview, offering intelligent automations with guardrails, observability, and cost controls. GitHub Changelog notes the workflows can now use the built‑in GITHUB_TOKEN instead of personal access tokens, improving security and simplicity. OpenCode reports that DeepSeek V4 Pro, Fable 5, and the North Mini Code model (256K context, fully open source) are now available on its platform. OpenRouter launches an Activity explorer that shows real‑time spending, token usage, cache hit rates, agents, and trends for models like Fable. RyanLee shares that his high‑performance MSA kernel library is open‑source and that the M3 weights are expected to be released on Friday, with a link to the GitHub paper.

June 11, 2026

GitHub Open Source X Updates

AI Models 3 min read

Gemini 3.5 Live Translate Brings Near Real‑Time Speech Translation to Developers

Google announced Gemini 3.5 Live Translate, an audio model that streams speech‑to‑speech translation in real time for more than 70 languages. It is available now in public preview via the Gemini Live API, in private preview for Google Meet, and globally in the Google Translate mobile app.

June 09, 2026

Gemini Coding Agents

Coding 3 min read

Latest from X - 2026-06-04

NVIDIA AI announced the release of Nemotron 3 Ultra, a 550 B MoE model that speeds inference fivefold, lowers agentic task costs up to 30 % and excels at coding, deep research, and long‑horizon planning. OpenCode noted that Nemotron 3 Ultra is now free with 1 M context and fully open source. Ollama said the model is available on its cloud platform, offering launch commands for Claude, Hermes and OpenClaw. OpenAI introduced a new memory system for ChatGPT that automatically tracks important details, doubles memory capacity for Plus and Pro users in the US, and lets users review and steer remembered content via a summary. Cursor added an interactive context‑usage report in its canvas, breaking down token distribution across prompts, tools, rules and skills.

June 04, 2026

NVIDIA AI X Updates OpenCode

Coding 3 min read

Qwen 3.7‑Plus: Multimodal Coding Agent with Vision‑Language Upgrade

Qwen 3.7‑Plus is a new multimodal agent model that adds vision capabilities to the strong text backbone of Qwen 3.7. It can read screens, interact with GUIs, and generate code from visual references while keeping the coding and tool‑use strengths of its predecessor. Benchmarks show notable gains in several coding‑related tasks, especially in terminal‑based and spreadsheet benchmarks.

June 02, 2026

Coding Agents Benchmarks

AI Models 6 min read

Latest from X - 2026-06-01 to 2026-06-02

Qwen: introduces Qwen3.7-Plus, a multimodal agent model that unifies vision and language with both GUI and CLI operation and serves as a coding and productivity assistant. OpenAI: frontier models and Codex are now generally available on AWS via Amazon Bedrock, extending enterprise security, compliance, and governance workflows. xAI: Composer 2.5 is now inside Grok Build, described as a fast, highly intelligent model for long‑running tasks and complex instructions. LangChain: highlights Fleet for secure agent access to private resources and adds LangSmith LLM Gateway spend limits that return a 402 error when caps are hit. Google Antigravity: is becoming a scientific workbench with a Science Skills bundle that runs complex workflows like protein analysis using Alpha* models and dozens of databases; Google Gemma: releases the first gemma‑skills iteration, enabling agents to build with Gemma, use MTP for speed, pick model size, and locate up‑to‑date resources. ClaudeDevs: resets 5‑hour and weekly rate limits for Pro/Max plans and fixes excessive parallel subagents; Cursor: raises usage limits for Teams and adds a Premium seat with 5× usage at 3× cost; Visual Studio Code: demos orchestrating agents via the VS Code Agents window; NVIDIA: adds real‑time AI media tools including Synthetic Video Detector (up to 92% accuracy, 22 ms latency), RTX Video Super Resolution and Frame Generation; Vercel: enables remote execution of Conductor’s parallel coding agents on fast Sandboxes; Perplexity: launches Search as Code, a new architecture that writes Python to call its search stack directly, now default in the Perplexity Agent API.

June 02, 2026

Qwen X Updates OpenAI

Coding 1 min read

Latest from X - 2026-05-29 (ClaudeDevs)

ClaudeDevs (@ClaudeDevs) announced an update to Opus 4.8, which now allows for the addition of system instructions mid-conversation without breaking the prompt cache. This improvement is expected to reduce the cost and latency of API requests. The update aims to enhance the efficiency of the system.

May 30, 2026

ClaudeDevs X Updates Coding

AI Models 2 min read

Claude Code Releases

Claude Code has released version 2.1.152, which includes several new features and bug fixes. The update applies review findings to the working tree after a review, simplifies code, and improves the user experience.

May 29, 2026

Claude Claude Code Agents

AI Models 4 min read

Latest from X - 2026-05-29

OpenRouter now supports "apply_patch," a server tool that lets models propose file edits using V4A diffs through the Responses API. The model generates a patch, and OpenRouter validates the diff syntax server-side. This feature allows for more efficient and accurate file editing. xAI has released grok-build-0.1 in public beta via the xAI API. This model powers the Grok Build CLI and excels at agentic coding, priced at $1/m input and $2/m output. Google AI has released an episode of Release Notes featuring the architects of Gemini, including @JeffDean, @koraykv, @OriolVinyalsML, and @NoamShazeer. They discuss their journey and the people behind the model. LangChain has released LangSmith LLM Gateway, which enforces spend limits and redacts PII before requests reach the model. They also announced Deep Agents v0.6, which makes harness profiles a first-class abstraction, allowing for production-grade performance at lower costs. NVIDIA has announced a new era of PC, but the details are unclear. OpenAI has launched Rosalind Biodefense to help trusted builders develop new biodefense and pandemic preparedness capabilities. They are also expanding trusted access to GPT-Rosalind for select U.S. government and allied partners.

May 29, 2026

OpenRouter X Updates Grok

AI Models 2 min read

MiniMax Drops State-of-the-Art AI Agent Model - Then Quietly Changes the License

MiniMax has released a new AI agent model, MiniMax M2.7, which matches top closed models in benchmarks. However, the model's license has been changed to restrict commercial use, sparking controversy among developers.

May 27, 2026

MiniMax Coding Agents

Coding 2 min read

GitHub Spark: AI-Powered App Development

GitHub Spark is a platform for building intelligent apps with AI capabilities. It allows users to create full-stack applications using natural language, visual tools, or code, with instant previews and one-click deployment.

May 27, 2026

Claude Claude Code Coding

AI Models 2 min read

GitHub Models for AI Development

GitHub Models is a new feature on GitHub that allows developers to manage and compare AI prompts, access multiple leading models through a single API key, and move from testing to production within the same environment.

May 27, 2026

Claude Claude Code Coding