Key Points
- NVIDIA AI announced the release of Nemotron 3 Ultra, a 550 B MoE model that speeds inference fivefold, lowers agentic task costs up to 30 % and excels at coding, deep research, and long‑horizon planning.
- OpenCode noted that Nemotron 3 Ultra is now free with 1 M context and fully open source.
- Ollama said the model is available on its cloud platform, offering launch commands for Claude, Hermes and OpenClaw.
- OpenAI introduced a new memory system for ChatGPT that automatically tracks important details, doubles memory capacity for Plus and Pro users in the US, and lets users review and steer remembered content via a summary.
- Cursor added an interactive context‑usage report in its canvas, breaking down token distribution across prompts, tools, rules and skills.
Account Highlights
- NVIDIA AI (@NVIDIAAI): 4 updates.
- OpenCode (@opencode): 1 update.
- ollama (@ollama): 1 update.
- OpenAI (@OpenAI): 2 updates.
- Cursor (@cursor_ai): 1 update.
Updates by Account
NVIDIA AI (@NVIDIAAI)
Today we're shipping Nemotron 3 Ultra.
A 550B MoE frontier-intelligence open model built for long-running agents.
It delivers 5x faster inference and lowers the cost of complex agentic tasks by up to 30% versus other open frontier models. https://t.co/FEXqvfzQFO
Ultra excels at complex tasks like coding and deep research.
Long-running agents spend their time planning, using tools, recovering from failures, and deciding what to do next.
The model’s hybrid Mamba-Transformer MoE architecture enables more reasoning cycles within the same https://t.co/3YMvngUShw
Nemotron 3 Ultra delivers leading accuracy for agentic tasks, including agent productivity, coding, and long horizon planning. https://t.co/j2g21JTrL3
Beyond benchmark performance, Ultra can work through large codebases, reason across long chains of tool calls, and synthesize information gathered from hundreds of sources. https://t.co/itDu34WVHk
OpenCode (@opencode)
Nemotron 3 Ultra is now free on OpenCode
text · 1M context · fully open source
NVIDIA's latest open source model
ollama (@ollama)
NVIDIA’s Nemotron 3 Ultra is available on Ollama’s cloud!
Try it 👇
Claude Code:
ollama launch claude --model nemotron-3-ultra:cloud
Hermes Agent:
ollama launch hermes --model nemotron-3-ultra:cloud
OpenClaw:
ollama launch openclaw --model nemotron-3-ultra:cloud https://t.co/weiKLF1FQD https://t.co/zX7gQ6MoaY
OpenAI (@OpenAI)
The new memory system will keep track of important details automatically. If you prefer the legacy saved memories experience, you can switch back in settings.
The new memory system is rolling out to Plus and Pro users in the US today, along with 2x more memory.
To access it on
With the new memory system, you can review and steer what ChatGPT remembers through a memory summary, with more visibility and control over how context is used. https://t.co/kXMAds0g3q
Cursor (@cursor_ai)
Cursor can now show your agent's context usage as an interactive report in a canvas.
The context explorer breaks down where tokens go across the system prompt, tool definitions, rules, skills, and more. https://t.co/FccilnZzOz
Sources
- https://x.com/NVIDIAAI/status/2062521325076299981
- https://x.com/NVIDIAAI/status/2062521332563087775
- https://x.com/NVIDIAAI/status/2062521339693445395
- https://x.com/NVIDIAAI/status/2062521374090940721
- https://x.com/opencode/status/2062570516586573998
- https://x.com/ollama/status/2062591290743853291
- https://x.com/OpenAI/status/2062567561276100809
- https://x.com/OpenAI/status/2062567559673856346
- https://x.com/cursor_ai/status/2062611886370337103