Model Signal logo Model Signal Fast, verified AI updates
Coding

Latest from X - 2026-06-04

3 min read

Key Points

  • NVIDIA AI announced the release of Nemotron 3 Ultra, a 550 B MoE model that speeds inference fivefold, lowers agentic task costs up to 30 % and excels at coding, deep research, and long‑horizon planning.
  • OpenCode noted that Nemotron 3 Ultra is now free with 1 M context and fully open source.
  • Ollama said the model is available on its cloud platform, offering launch commands for Claude, Hermes and OpenClaw.
  • OpenAI introduced a new memory system for ChatGPT that automatically tracks important details, doubles memory capacity for Plus and Pro users in the US, and lets users review and steer remembered content via a summary.
  • Cursor added an interactive context‑usage report in its canvas, breaking down token distribution across prompts, tools, rules and skills.

Account Highlights

Updates by Account

NVIDIA AI (@NVIDIAAI)

Today we're shipping Nemotron 3 Ultra.

A 550B MoE frontier-intelligence open model built for long-running agents.

It delivers 5x faster inference and lowers the cost of complex agentic tasks by up to 30% versus other open frontier models. https://t.co/FEXqvfzQFO

Ultra excels at complex tasks like coding and deep research.

Long-running agents spend their time planning, using tools, recovering from failures, and deciding what to do next.

The model’s hybrid Mamba-Transformer MoE architecture enables more reasoning cycles within the same https://t.co/3YMvngUShw

Nemotron 3 Ultra delivers leading accuracy for agentic tasks, including agent productivity, coding, and long horizon planning. https://t.co/j2g21JTrL3

Beyond benchmark performance, Ultra can work through large codebases, reason across long chains of tool calls, and synthesize information gathered from hundreds of sources. https://t.co/itDu34WVHk

OpenCode (@opencode)

Nemotron 3 Ultra is now free on OpenCode

text · 1M context · fully open source

NVIDIA's latest open source model

ollama (@ollama)

NVIDIA’s Nemotron 3 Ultra is available on Ollama’s cloud!

Try it 👇

Claude Code:

ollama launch claude --model nemotron-3-ultra:cloud

Hermes Agent:

ollama launch hermes --model nemotron-3-ultra:cloud

OpenClaw:

ollama launch openclaw --model nemotron-3-ultra:cloud https://t.co/weiKLF1FQD https://t.co/zX7gQ6MoaY

OpenAI (@OpenAI)

The new memory system will keep track of important details automatically. If you prefer the legacy saved memories experience, you can switch back in settings.

The new memory system is rolling out to Plus and Pro users in the US today, along with 2x more memory.

To access it on

With the new memory system, you can review and steer what ChatGPT remembers through a memory summary, with more visibility and control over how context is used. https://t.co/kXMAds0g3q

Cursor (@cursor_ai)

Cursor can now show your agent's context usage as an interactive report in a canvas.

The context explorer breaks down where tokens go across the system prompt, tool definitions, rules, skills, and more. https://t.co/FccilnZzOz

Sources

  1. https://x.com/NVIDIAAI/status/2062521325076299981
  2. https://x.com/NVIDIAAI/status/2062521332563087775
  3. https://x.com/NVIDIAAI/status/2062521339693445395
  4. https://x.com/NVIDIAAI/status/2062521374090940721
  5. https://x.com/opencode/status/2062570516586573998
  6. https://x.com/ollama/status/2062591290743853291
  7. https://x.com/OpenAI/status/2062567561276100809
  8. https://x.com/OpenAI/status/2062567559673856346
  9. https://x.com/cursor_ai/status/2062611886370337103