AI News & Analysis
Curated takes on AI industry news with perspective
Codex Adds Windows Computer Use + ChatGPT Mobile Windows Connections for On-the-Go Steering
OpenAI says Codex can now use Computer use on Windows to test apps, debug flows, and review work on your Windows machine, and that Codex in the ChatGPT mobile app can connect to Windows machines so you can steer tasks from your phone.
Claude Code adds dynamic workflows (research preview) for large parallel agent runs
Dynamic workflows in Claude Code let Claude write orchestration scripts, fan out work across tens to hundreds of parallel subagents, verify results, and resume long-running jobs.
Claude Opus 4.8 Fast Mode: 2.5x Faster Output Tokens in Research Preview
Anthropic launched Fast mode for Claude Opus 4.8 in research preview, promising 2.5x faster output token speeds with the same Opus-level intelligence. It is available now in Claude Code for developers with extra usage enabled, and on the Claude Platform API through an account manager or a waitlist form.
OpenAI's Secure MCP Tunnel: Connect Private MCP Servers Over Outbound-Only HTTPS
OpenAI published a Secure MCP Tunnel guide that connects private and on-prem MCP servers to ChatGPT and Codex without opening inbound firewall ports. A tunnel-client polls OpenAI for work over outbound HTTPS, forwards JSON-RPC requests to the local server, and posts responses back through the same tunnel.
Claude Code ships a security-guidance plugin for in-session vulnerability checks
Anthropic shipped an official security-guidance plugin for Claude Code. It runs automatic vulnerability checks while Claude edits files, at the end of each turn, and when Claude runs commits or pushes through its Bash tool.
Claude Agent SDK Gets a Monthly Credit on Paid Claude Plans Starting June 15, 2026
Anthropic is bundling a dedicated monthly credit for programmatic Claude usage into Pro, Max, Team, and Enterprise plans starting June 15, 2026. The credit covers Claude Agent SDK projects, `claude -p` non-interactive Claude Code, Claude Code GitHub Actions, and third-party apps built on the Agent SDK, with amounts ranging from $20 on Pro to $200 on Max 20x and seat-based Enterprise Premium.
OpenAI Codex adds Locked computer use on Mac (keep Computer Use running after lock)
Codex can now keep using Mac apps after your screen locks. Locked computer use is a narrow, Codex-only unlock path with a short-lived authorization window and safeguards like relocking on local input.
Claude Managed Agents Add Self-Hosted Sandboxes (Public Beta) and MCP Tunnels (Research Preview)
Anthropic says Claude Managed Agents can now run tool execution in a sandbox you control (public beta) and connect to private MCP servers via MCP tunnels (research preview). The update targets enterprise security requirements by keeping execution and private services within an organization’s perimeter.
Cursor Launches Composer 2.5: Better Long-Running Agent Work, New Pricing Tiers, and 2× Included Usage This Week
Cursor has released Composer 2.5, calling it its most powerful Composer model yet. Cursor says it’s more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions. Cursor’s launch post also says included usage is doubled for the first week. Cursor’s blog post adds token pricing for Standard vs Fast modes.
Claude Code 2.1.142: `claude agents` Gains Session Flags, Fast Mode Defaults to Opus 4.7, MCP Tool Timeout Honored
Anthropic shipped Claude Code 2.1.142 on May 14, 2026. The release adds eight session-configuration flags to `claude agents` (`--add-dir`, `--settings`, `--mcp-config`, `--plugin-dir`, `--permission-mode`, `--model`, `--effort`, `--dangerously-skip-permissions`), flips fast mode's default model from Opus 4.6 to Opus 4.7, and fixes `MCP_TOOL_TIMEOUT` not raising the per-request fetch timeout for remote HTTP/SSE MCP servers -- a regression that capped tool calls at 60 seconds regardless of configuration.
Claude Code Weekly Limits +50% Through July 13, 2026: Live for Pro, Max, Team, and Seat-Based Enterprise
Anthropic increased Claude Code weekly limits by 50% on May 13, 2026, through July 13, 2026. The lift is live now for Pro, Max, Team, and seat-based Enterprise users. It comes about a week after Anthropic doubled five-hour rate limits on the same plans and removed peak-hours limit reduction for Pro and Max -- so the practical headroom for heavy Claude Code users is materially larger than it was in early May.
Codex in the ChatGPT Mobile App: Preview Lets You Run, Review, and Steer Codex from Your Phone
OpenAI put Codex in the ChatGPT mobile app as a preview on May 14, 2026. You start work, review outputs, steer execution, and approve next steps from your phone while Codex keeps running on a laptop, Mac mini, or devbox. Rolling out on iOS and Android across all plans (including Free and Go) in supported regions, with Windows host support coming soon.
Codex Chrome Extension: How Codex Drives a Signed-In Browser for LinkedIn, Salesforce, Gmail, and Internal Tools
OpenAI's Codex Chrome extension lets the agent use Chrome for browser tasks that need signed-in state -- LinkedIn, Salesforce, Gmail, internal tools. Available in the Codex app in all regions except EU and UK at launch. Setup is Codex > Plugins > add Chrome > install extension > approve permissions. Invoke with @Chrome. By default Codex asks before each new website; allowlist/blocklist and elevated-risk options live in Computer Use settings.
Codex Hooks and Programmatic Access Tokens: How OpenAI Is Making Codex Easier to Automate Around Your Code
OpenAI is positioning Codex as an automatable platform. Hooks let you inject scripts at key points in the agent loop -- validators, secret scanning, conversation logging, per-repo behavior. Programmatic access tokens give Business and Enterprise teams scoped credentials they can use in CI, release jobs, and internal automations, created from the ChatGPT admin console with finite expirations and revocation. Both are live in the official Codex developer docs.
Cursor Bugbot Adds Effort Levels: Default, High, and Custom (Usage-Based Billing Required)
Cursor's May 11, 2026 changelog introduces three effort levels for Bugbot PR review -- Default (current behavior, optimized for speed), High (more reasoning, more bugs found, more expensive), and Custom (natural-language rules that decide effort per review). Customization requires usage-based billing. Cursor cites 0.7 bugs per run at Default versus 0.95 at High, with 79% of Default findings resolved at merge.
Codex CLI 0.130.0 Adds `remote-control`, Richer Plugin Sharing Metadata, and Better App-Server Thread Paging
OpenAI shipped Codex CLI 0.130.0 in May 2026. The release adds a new `codex remote-control` command for starting a headless, remotely controllable app-server, improves app-server clients with paging options for large threads (unloaded/summary/full turn items), expands plugin sharing with link metadata and discoverability controls, adds Bedrock auth support for AWS console-login credentials from `aws login` profiles, and fixes several app-server/thread reliability issues.
Gemini Interactions API: Steps Schema, `response_format`, and a June 8, 2026 Legacy Sunset
Google is rolling out breaking changes to the Gemini v1beta Interactions API that replace the `outputs` array with a `steps` array, remove `response_mime_type` in favor of a polymorphic `response_format`, and introduce new streaming event types. For REST users, the new schema becomes the default on May 26, 2026, and legacy behavior is removed on June 8, 2026; older Python/JS SDKs (1.x) also break on June 8.
Claude Code 2.1.133: `worktree.baseRef` Default Returns to `origin/<default>`, MCP OAuth Proxy Honored Across the Whole Flow
Anthropic shipped Claude Code 2.1.133 on May 7, 2026. The headline is a worktree-base behavior change: a new `worktree.baseRef` setting (`fresh` | `head`) defaults to `fresh`, which moves `EnterWorktree`'s base back to `origin/<default>` after several months of branching from local `HEAD`. The release also routes `HTTP(S)_PROXY` / `NO_PROXY` / mTLS through the entire MCP OAuth flow (discovery, dynamic client registration, token exchange, refresh), exposes effort level to hooks via `$CLAUDE_EFFORT`, adds Linux sandbox path overrides, and fixes a refresh-token race that was 401-ing parallel sessions.
Codex CLI 0.129.0 Adds Modal Vim Composer, Redesigned Resume/Fork Picker, and a `/hooks` Browser
OpenAI shipped Codex CLI 0.129.0 on May 7, 2026. The release brings modal Vim editing to the TUI composer via `/vim`, a redesigned resume/fork picker with raw scrollback and workspace-aware `/diff`, a new `/hooks` browser with before/after compaction support, expanded plugin management with workspace sharing and access controls, theme-aware status lines, and Codex Apps auth surfaced through Guardian. Plus a long bug-fix list across Linux/Windows sandboxes, MCP, and TUI input handling.
Cursor adds enterprise model controls, soft spend limits, and richer usage analytics
Cursor’s May 4, 2026 update adds granular model/provider access controls for Enterprise admins, introduces soft spend limits with automated alerts, and expands usage analytics so admins can break consumption down by product surface (including Cloud Agents, Bugbot, and Security Review).
Warp Goes Open Source: AGPL Client, MIT UI Framework, and a New `settings.toml`
On April 27, 2026 (changelog v0.2026.04.27.15.32) Warp open-sourced its client at github.com/warpdotdev/warp under AGPL v3, with the `warpui` UI framework crates released under MIT. The same release adds a TOML settings file editable from the settings page or by asking Warp's agent. The server stays closed-source. OpenAI is the founding sponsor.
Codex CLI 0.128.0 Lands Persisted `/goal` Workflows: Ralph-Style Agents That Don't Stop Until Done
OpenAI shipped Codex CLI 0.128.0 in April 2026 with a new `/goal` system that keeps a goal alive across turns and runs the agent until it is achieved. The release adds app-server APIs and model tools for goals, runtime continuation, and TUI controls to create, pause, resume, and clear goals. Felipe Coury credits co-worker Eric Traut (the Pyright lead) for the design, and frames it as Codex's take on the Ralph loop pattern.
Cursor SDK Lands in Public Beta: Programmatic Agents in TypeScript with Local and Cloud Runtimes
Cursor launched the Cursor SDK in public beta on April 29, 2026, exposing the same agent runtime that powers Cursor desktop, CLI, and web behind a TypeScript package. `@cursor/sdk` lets you spawn agents against local files, Cursor-hosted VMs, or self-hosted workers, stream results, and bill on standard token-based pricing -- moving Cursor from an editor surface to a programmable platform.
Anthropic's Claude Code Post-Mortem: Three Engineering Missteps Behind the Spring 2026 Quality Decline
Anthropic published a post-mortem on April 23, 2026 explaining the Claude Code quality regression that ran from early March through mid-April: a March 4 default-effort downgrade from high to medium, a March 26 caching change that wiped reasoning history every turn, and an April 16 verbosity prompt that capped responses at 25 words between tool calls. All three were resolved by April 20, the API was unaffected, and Anthropic reset usage limits for all subscribers.
Cursor 3.2 Adds /multitask Async Subagents, Worktrees Polish, and Multi-Root Workspaces
Cursor 3.2 shipped on April 24, 2026 with three changes that pull cross-repo, parallel agent work into the default flow: a new `/multitask` command that fans a request out to async subagents instead of queueing, an improved worktrees experience that runs isolated branch work in the background, and multi-root workspaces so a single agent session can target frontend, backend, and shared-library folders at once.
GPT-5.5 Is Here: State-of-the-Art Agentic Coding, 1M Context, and a New Pro Tier
OpenAI launched GPT-5.5 on April 23, 2026 -- its smartest model yet, with state-of-the-art scores on Terminal-Bench 2.0 (82.7%), GDPval (84.9%), and OSWorld-Verified (78.7%), GPT-5.4 per-token latency, and a new GPT-5.5 Pro tier for harder work. Available in ChatGPT and Codex today, with API at $5/M input and $30/M output coming soon.
Claude Design Launches: Anthropic Labs Turns Opus 4.7 Into a Prototype, Deck, and Wireframe Surface
Anthropic launched Claude Design on April 17, 2026 -- a research preview from Anthropic Labs that turns a prompt, uploaded image, or codebase into polished prototypes, pitch decks, and mockups. Powered by Claude Opus 4.7 vision, it learns your team's design system, exports to Canva, PDF, PPTX, or HTML, and packages finished designs for Claude Code handoff.
OpenAI Codex Goes 'For Almost Everything': Mac Computer Use, Browser Comment Mode, and Thread Automations Explained
OpenAI shipped a major Codex update on April 16, 2026 that pushes the product past coding into general work. Three changes matter: Codex can now drive your Mac apps directly, an in-app browser captures both screenshots and DOM elements through 'comment mode', and Codex threads can run continuously to watch Slack, email, and PRs. Here is how each works, the workflows that justify each one, and where Codex now sits relative to Perplexity Personal Computer and Claude Code Routines.
Cursor Self-Documentation: New Subagent-Powered Help Reads Cursor's Own Docs in Real Time
Cursor shipped a self-documentation feature on April 17, 2026: when you ask Cursor about its own features, capabilities, or settings, it now spawns a subagent that fetches the current Cursor docs and updates before answering. The change closes the most annoying gap in AI coding tools -- the model's training cutoff lagging the product's release cadence -- and is a small but telling preview of where AI tool documentation is heading across the industry.
Claude Opus 4.7 Is Here: State-of-the-Art Coding, xhigh Effort, and a New Cyber Safeguards Tier
Anthropic launched Claude Opus 4.7 on April 16, 2026 -- a notable improvement on Opus 4.6 in advanced software engineering, with the same pricing, a new xhigh effort level, /ultrareview in Claude Code, higher-resolution vision, and the first deployment of cyber safeguards from the Mythos Preview track.
Inside Claude Code's Rebuilt Desktop: Parallel Agents, Drag-Drop Panes, Side Chat
Anthropic rebuilt the Claude Code desktop app on April 14, 2026 around parallel agent workflows. The new app adds a multi-session sidebar, drag-and-drop pane layout, an in-app file editor, integrated terminal, side chat for asides, three view modes, and SSH support on macOS -- making the orchestrator role the default.
Claude Code Routines: Schedule, API, and GitHub-Trigger Your AI Agents
Claude Code Routines is Anthropic's new way to run saved Claude Code configurations automatically -- by schedule, API call, or GitHub event. Routines run on Anthropic's cloud infrastructure with a prompt, repo, and MCP connectors. Available in research preview on Pro, Max, Team, and Enterprise plans.
AI Tools Landscape: What Changed in Early 2026
The first quarter of 2026 brought three major shifts to the AI tools landscape: MCP became a mainstream standard adopted by most coding tools, AI coding assistants matured beyond autocomplete into full workflow partners, and the first generation of truly autonomous AI agents started shipping in production environments.