AI News & Analysis

Curated takes on AI industry news with perspective

openaiJul 16, 2026

OpenAI Introduces GPT-Red: An Internal Automated Red-Teaming Model for Prompt Injection

OpenAI published details on GPT-Red, an internal automated safety red-teaming model designed to find prompt injection vulnerabilities at scale and strengthen defenses before broader deployment.

cursorJul 12, 2026

Cursor Adds GPT-5.6 Sol, Terra, and Luna

Cursor says the GPT-5.6 model family (Sol, Terra, Luna) is now available in Cursor and selectable from the model picker, with a published CursorBench score for Sol.

cursorJul 11, 2026

Cursor Adds Side Chats: Durable Agent Threads You Can @-Mention Back Into the Main Conversation

Cursor introduced Side chats: separate, durable agent conversations that run alongside a main chat and can be @-mentioned to pull context back.

openaiJul 10, 2026

OpenAI Launches ChatGPT Work: A Codex-Sibling Agent for Long-Running Deliverables

OpenAI introduced ChatGPT Work, a new agent in ChatGPT powered by Codex and GPT-5.6 that takes action across apps and files and stays on a project for hours to turn a goal into finished work. Work handles research and deliverables; Codex stays the dedicated software-development agent.

anthropicJul 10, 2026

Claude Adds Reflect: A Monthly Recap and Usage Dashboard for How You Use Claude

Anthropic introduced Reflect, a beta dashboard in Claude's Settings that shows a monthly recap of when you use Claude most and what you worked on, with quiet hours and break nudges. It runs only when memory is on and excludes incognito chats and health integration content.

anthropicJul 8, 2026

Claude Cowork is coming to mobile and web (beta rollout starting with Max)

Anthropic says Claude Cowork will expand beyond desktop to mobile and web, starting with a beta rollout for Max plan users and expanding to more plans over the next several weeks.

openaiJul 7, 2026

OpenAI adds GPT-Realtime-2.1-mini to the API with reasoning and tool use

OpenAI says GPT-Realtime-2.1-mini is now available in the API, bringing reasoning and tool use to its Realtime mini lineup at the same cost as GPT-Realtime-mini. The announcement came from the OpenAI Developers account; platform docs did not yet list the model at publish time, so treat details beyond the announcement as unconfirmed.

anthropicJul 5, 2026

Claude Code Artifacts Expand to Pro and Max Plans (Private by Default)

Anthropic's ClaudeDevs says Claude Code Artifacts are now available on Pro and Max plans. The Claude Code docs now list Pro/Max/Team/Enterprise as eligible, with Pro/Max artifacts remaining private to the individual user.

anthropicJul 4, 2026

Anthropic raises Claude API rate limits and updates how tiers work

Anthropic says it has raised Claude Platform API rate limits for all users and simplified usage tiers. The company's developer announcement says tiers are no longer based on API spend, and points users to updated Claude Platform rate limit documentation for current per-model RPM and token throughput limits.

openaiJul 3, 2026

OpenAI introduces GeneBench-Pro, a research-level benchmark for agentic computational biology

On June 30, 2026, OpenAI announced GeneBench-Pro: a research-level benchmark meant to measure how well AI agents navigate messy biological data and make the judgment calls real computational biology depends on. OpenAI says GeneBench-Pro contains 129 questions and is open-sourcing 10 representative case studies as a public package on Hugging Face under the MIT License.

claudeJun 30, 2026

Claude Desktop for Linux Beta: Ubuntu and Debian, With Caveats

Anthropic shipped a beta of the Claude desktop app for Linux on Ubuntu 22.04+ and Debian 12+ (x86_64 or arm64). It gives the same Chat, Cowork, and Claude Code experience as macOS and Windows on all paid plans. Install via Anthropic's apt repo to get updates -- the app does not self-update, and a raw .deb install gets no updates. Computer Use and Dictation are not in the Linux beta yet, and only Debian-based distros are supported today.

claudeJun 30, 2026

Claude Science Beta: An AI Workbench for Reproducible Research

Anthropic launched Claude Science in public beta -- a research environment, not a model. It runs analyses, queries 60+ scientific databases, and traces every step from data wrangling to publication, with code, environment, and conversation provenance attached to each artifact. It runs on your own infrastructure (laptop, HPC, GPU clusters) and submits jobs over SSH, Slurm, or Modal. Available now on macOS and Linux for Pro, Max, Team, and Enterprise plans.

claudeJun 30, 2026

Claude Sonnet 5 Launch: Anthropic's Most Agentic Sonnet, Now the Default Tier

Anthropic launched Claude Sonnet 5 on June 30, 2026, calling it its most agentic Sonnet yet -- it plans, drives browsers and terminals, and runs autonomously. It is the default for Free and Pro, available across all plans, in Claude Code, and on the API as claude-sonnet-5, with introductory pricing of $2 per million input and $10 per million output tokens through August 31, 2026.

cursorJun 30, 2026

Cursor for iOS: Cloud Agents Go Mobile-First in Public Beta

Cursor shipped a native iOS app in public beta on all paid plans. It launches always-on cloud agents that run in isolated VMs with full dev environments, work asynchronously toward merge-ready PRs, and report back via Live Activities and push notifications. You can also remote-control agents on your computer, pick any frontier model, use voice and slash commands, review diffs and demos, leave follow-ups, and merge PRs from the phone. Composer 2.5 runs are 75% off in the app through July 5, 2026.

codexJun 29, 2026

Codex Permission Profiles: Least-Privilege Controls for Local Agent Work

OpenAI shipped Codex permission profiles in beta -- reusable, inheritable policies that replace the coarse sandbox_mode/sandbox_workspace_write combo. A profile binds OS-enforced filesystem read/write/deny rules (down to **/*.env) to per-domain network and Unix-socket rules. Enterprise admins get fail-closed allowlists via requirements.toml. Profiles govern local sandboxed command execution only, not MCP servers, app connectors, browser, or cloud.

anthropicJun 28, 2026

Claude Code Adds Artifacts: Live, Shareable Pages for PR Walkthroughs and Dashboards

Anthropic introduced Artifacts in Claude Code, letting Team and Enterprise orgs turn an in-progress Claude Code session into a live web page that updates as the session progresses and can be shared privately within the organization.

openaiJun 27, 2026

OpenAI Previews GPT-5.6: Sol, Terra, and Luna in Limited Preview

OpenAI announced a limited preview of the GPT-5.6 family: Sol, a next-generation frontier flagship OpenAI calls a step function better than GPT-5.5; Terra, a balanced model competitive with GPT-5.5 at 2x lower cost; and Luna, its most cost-efficient model. Access starts with trusted partners in Codex and the API.

openaiJun 26, 2026

OpenAI Ships a New GPT-5.5 Instant in ChatGPT: Better Intent, Constraints, and Shopping/Local Recs

OpenAI says a new version of GPT-5.5 Instant is rolling out in ChatGPT with better intent understanding, more reliable handling of complex constraints, and improved shopping and local recommendations. Per OpenAI, it reaches paid users first and free users the next day. GPT-5.5 Instant is ChatGPT's default for logged-in users.

cursorJun 25, 2026

You Can Now Delegate Tasks to Cursor From Inside Notion

Notion used the Cursor SDK to embed coding agents directly in its workspace. You can now tag Cursor in a doc, mention it in a thread, or assign it a database issue, and Cursor plans, builds, tests, verifies its own work, and opens a PR. Notion says it built the integration in a few weeks on the Cursor SDK.

claude-designJun 19, 2026

Claude Design's `/design-sync` Makes Claude Design and Claude Code a Two-Way Workflow

Anthropic's `/design-sync` pulls your design system into Claude Design so everything Claude builds starts from your real components, and keeps work synced as you move between Claude Design and Claude Code. You can start in either surface, hand a finished design off to Claude Code, and continue from existing work instead of a screenshot.

codexJun 19, 2026

OpenAI Codex Adds Record & Replay: Turn a Demonstrated Mac Workflow Into a Reusable Skill

OpenAI shipped Record & Replay in Codex app 26.616, a macOS feature that turns a workflow you demonstrate into a reusable skill. It builds on Computer Use, which you or your administrator must enable, and is unavailable at launch in the European Economic Area, the United Kingdom, and Switzerland.

cursorJun 17, 2026

Cursor announces Origin: code storage and Git hosting (waitlist)

Cursor says it's launching Origin, a new code storage and Git hosting product built for teams and agents, with availability planned for fall 2026 and a waitlist open now.

claude-fable-5Jun 13, 2026

Anthropic Abruptly Suspends Fable 5 and Mythos 5 Access After US Government Directive

Anthropic says a US government export control directive ordered it to suspend all access to Claude Fable 5 and Mythos 5 by any foreign national, inside or outside the US. To comply, Anthropic is disabling both models for all customers. It says access to every other Anthropic model is unaffected, and that it is working to restore Fable 5 and Mythos 5 as soon as possible.

openaiJun 12, 2026

OpenAI: Responses API web search can now return image results

OpenAI added image results to the Responses API web_search tool, letting apps retrieve web-grounded visuals (with source links) alongside regular text results.

cursorJun 11, 2026

Cursor Bugbot gets faster and cheaper, adds /review command + incremental review

Cursor's June 10, 2026 update claims Bugbot PR reviews now finish ~3x faster (about 90s average vs ~5 minutes), cost ~22% less per run, and find ~10% more bugs per review. Cursor also added a /review command so you can run Bugbot and Security Review before pushing, plus an option to review only what changed since the last review.

openaiJun 9, 2026

ChatGPT 'Dreaming': OpenAI's New Memory Architecture Curates What It Remembers in the Background

OpenAI rolled out a more capable, compute-efficient ChatGPT memory architecture built on 'dreaming' -- a background process that curates memories by referencing chat history without prompting. It carries context forward better, follows preferences across conversations, and updates memories as time passes. Plus and Pro users in the US first, with Free and international users following.

claude-fable-5Jun 9, 2026

Claude Fable 5 and Mythos 5: Mythos-Class Capability Goes General, With Caveats

Anthropic launched Claude Fable 5, a Mythos-class model made safe for general use and available today as claude-fable-5, plus Claude Mythos 5 for vetted cyberdefenders via Project Glasswing. Pricing is $10 per million input and $50 per million output tokens, with free subscription access ending June 23 and a mandatory 30-day data-retention policy on all Mythos-class traffic.

codexJun 9, 2026

Codex for Every Role: Role-Specific Plugins, Codex Sites, and Annotations Beyond Code

OpenAI is pushing Codex past software development with three releases: six role-specific plugins bundling 62 apps and 110 skills, Codex Sites that turn analysis into shareable hosted web apps in preview for business and enterprise, and annotations that now refine documents, spreadsheets, and presentations -- not just code and websites.

codexJun 6, 2026

Codex Build iOS Apps Plugin: Mirror the Simulator in the Browser and Hot-Reload SwiftUI Previews

OpenAI's Build iOS Apps plugin lets Codex mirror the iOS Simulator in the in-app browser and hot-reload package-backed SwiftUI previews without leaving Codex. It packages Swift and iOS workflows -- designing App Intents and Shortcuts, building and refactoring SwiftUI, auditing performance, and debugging on simulators through XcodeBuildMCP-backed flows. The plugin is open source in OpenAI's plugins repo.

cursorJun 6, 2026

Cursor Shared Canvases: Publish an Agent Canvas and Share It With Your Team via URL

Cursor added shared canvases -- you can now share a canvas from Cursor with your team by generating a link to a live snapshot that teammates open in the browser. Recipients view it read-only in the Cursor Dashboard, so you distribute a working dashboard or report instead of a full chat thread. Shared canvases are available on Pro, Teams, and Enterprise plans.

anthropicJun 5, 2026

Claude Platform's 'ant' CLI Brings the Full Claude API to Your Terminal

Anthropic's Claude Platform documents 'ant', a command-line tool that exposes every Claude API resource as a subcommand. It sends Messages requests, browses responses, version-controls agents and environments, and runs a self-hosted Managed Agents worker, with Claude Code able to drive it natively.

cursorJun 3, 2026

Cursor 3.5 brings Automations into the Agents Window (plus multi-repo and no-repo automations)

Cursor 3.5 moves Automations into the Agents Window, adds multi-repo and no-repo automations, and introduces five new no-repo templates (with a 7-day 50% promo on agent runs for new automations).

openaiJun 2, 2026

Codex Adds Windows Computer Use + ChatGPT Mobile Windows Connections for On-the-Go Steering

OpenAI says Codex can now use Computer use on Windows to test apps, debug flows, and review work on your Windows machine, and that Codex in the ChatGPT mobile app can connect to Windows machines so you can steer tasks from your phone.

anthropicMay 31, 2026

Claude Code adds dynamic workflows (research preview) for large parallel agent runs

Dynamic workflows in Claude Code let Claude write orchestration scripts, fan out work across tens to hundreds of parallel subagents, verify results, and resume long-running jobs.

claudeMay 30, 2026

Claude Opus 4.8 Fast Mode: 2.5x Faster Output Tokens in Research Preview

Anthropic launched Fast mode for Claude Opus 4.8 in research preview, promising 2.5x faster output token speeds with the same Opus-level intelligence. It is available now in Claude Code for developers with extra usage enabled, and on the Claude Platform API through an account manager or a waitlist form.

codexMay 30, 2026

OpenAI's Secure MCP Tunnel: Connect Private MCP Servers Over Outbound-Only HTTPS

OpenAI published a Secure MCP Tunnel guide that connects private and on-prem MCP servers to ChatGPT and Codex without opening inbound firewall ports. A tunnel-client polls OpenAI for work over outbound HTTPS, forwards JSON-RPC requests to the local server, and posts responses back through the same tunnel.

claudeMay 29, 2026

Claude Code ships a security-guidance plugin for in-session vulnerability checks

Anthropic shipped an official security-guidance plugin for Claude Code. It runs automatic vulnerability checks while Claude edits files, at the end of each turn, and when Claude runs commits or pushes through its Bash tool.

claudeMay 26, 2026

Claude Agent SDK Gets a Monthly Credit on Paid Claude Plans Starting June 15, 2026

Anthropic is bundling a dedicated monthly credit for programmatic Claude usage into Pro, Max, Team, and Enterprise plans starting June 15, 2026. The credit covers Claude Agent SDK projects, `claude -p` non-interactive Claude Code, Claude Code GitHub Actions, and third-party apps built on the Agent SDK, with amounts ranging from $20 on Pro to $200 on Max 20x and seat-based Enterprise Premium.

codexMay 23, 2026

OpenAI Codex adds Locked computer use on Mac (keep Computer Use running after lock)

Codex can now keep using Mac apps after your screen locks. Locked computer use is a narrow, Codex-only unlock path with a short-lived authorization window and safeguards like relocking on local input.

claudeMay 22, 2026

Claude Managed Agents Add Self-Hosted Sandboxes (Public Beta) and MCP Tunnels (Research Preview)

Anthropic says Claude Managed Agents can now run tool execution in a sandbox you control (public beta) and connect to private MCP servers via MCP tunnels (research preview). The update targets enterprise security requirements by keeping execution and private services within an organization's perimeter.

cursorMay 21, 2026

Cursor Launches Composer 2.5: Better Long-Running Agent Work, New Pricing Tiers, and 2x Included Usage This Week

Cursor has released Composer 2.5, calling it its most powerful Composer model yet. Cursor says it's more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions. Cursor's launch post also says included usage is doubled for the first week. Cursor's blog post adds token pricing for Standard vs Fast modes.

claude-codeMay 15, 2026

Claude Code 2.1.142: `claude agents` Gains Session Flags, Fast Mode Defaults to Opus 4.7, MCP Tool Timeout Honored

Anthropic shipped Claude Code 2.1.142 on May 14, 2026. The release adds eight session-configuration flags to `claude agents` (`--add-dir`, `--settings`, `--mcp-config`, `--plugin-dir`, `--permission-mode`, `--model`, `--effort`, `--dangerously-skip-permissions`), flips fast mode's default model from Opus 4.6 to Opus 4.7, and fixes `MCP_TOOL_TIMEOUT` not raising the per-request fetch timeout for remote HTTP/SSE MCP servers -- a regression that capped tool calls at 60 seconds regardless of configuration.

claude-codeMay 15, 2026

Claude Code Weekly Limits +50%: Promo Extended Through July 19, 2026

Anthropic's +50% Claude Code weekly-limits promo, first announced May 13, 2026, is still live: Anthropic extended the window to July 19, 2026 at 11:59 PM PT, past the original July 13 deadline. It covers Pro, Max, Team, and seat-based Enterprise plans across the CLI, IDE extensions, desktop, and web. When it ends, weekly limits return to their standard levels.

codexMay 15, 2026

Codex in the ChatGPT Mobile App: Preview Lets You Run, Review, and Steer Codex from Your Phone

OpenAI put Codex in the ChatGPT mobile app as a preview on May 14, 2026. You start work, review outputs, steer execution, and approve next steps from your phone while Codex keeps running on a laptop, Mac mini, or devbox. Rolling out on iOS and Android across all plans (including Free and Go) in supported regions, with Windows host support coming soon.

codexMay 15, 2026

Codex Chrome Extension: How Codex Drives a Signed-In Browser for LinkedIn, Salesforce, Gmail, and Internal Tools

OpenAI's Codex Chrome extension lets the agent use Chrome for browser tasks that need signed-in state -- LinkedIn, Salesforce, Gmail, internal tools. Available in the Codex app in all regions except EU and UK at launch. Setup is Codex > Plugins > add Chrome > install extension > approve permissions. Invoke with @Chrome. By default Codex asks before each new website; allowlist/blocklist and elevated-risk options live in Computer Use settings.

codexMay 15, 2026

Codex Hooks and Programmatic Access Tokens: How OpenAI Is Making Codex Easier to Automate Around Your Code

OpenAI is positioning Codex as an automatable platform. Hooks let you inject scripts at key points in the agent loop -- validators, secret scanning, conversation logging, per-repo behavior. Programmatic access tokens give Business and Enterprise teams scoped credentials they can use in CI, release jobs, and internal automations, created from the ChatGPT admin console with finite expirations and revocation. Both are live in the official Codex developer docs.

cursorMay 12, 2026

Cursor Bugbot Adds Effort Levels: Default, High, and Custom (Usage-Based Billing Required)

Cursor's May 11, 2026 changelog introduces three effort levels for Bugbot PR review -- Default (current behavior, optimized for speed), High (more reasoning, more bugs found, more expensive), and Custom (natural-language rules that decide effort per review). Customization requires usage-based billing. Cursor cites 0.7 bugs per run at Default versus 0.95 at High, with 79% of Default findings resolved at merge.

codexMay 9, 2026

Codex CLI 0.130.0 Adds `remote-control`, Richer Plugin Sharing Metadata, and Better App-Server Thread Paging

OpenAI shipped Codex CLI 0.130.0 in May 2026. The release adds a new `codex remote-control` command for starting a headless, remotely controllable app-server, improves app-server clients with paging options for large threads (unloaded/summary/full turn items), expands plugin sharing with link metadata and discoverability controls, adds Bedrock auth support for AWS console-login credentials from `aws login` profiles, and fixes several app-server/thread reliability issues.

googleMay 9, 2026

Gemini Interactions API: Steps Schema, `response_format`, and a June 8, 2026 Legacy Sunset

Google is rolling out breaking changes to the Gemini v1beta Interactions API that replace the `outputs` array with a `steps` array, remove `response_mime_type` in favor of a polymorphic `response_format`, and introduce new streaming event types. For REST users, the new schema becomes the default on May 26, 2026, and legacy behavior is removed on June 8, 2026; older Python/JS SDKs (1.x) also break on June 8.

claude-codeMay 8, 2026

Claude Code 2.1.133: `worktree.baseRef` Default Returns to `origin/<default>`, MCP OAuth Proxy Honored Across the Whole Flow

Anthropic shipped Claude Code 2.1.133 on May 7, 2026. The headline is a worktree-base behavior change: a new `worktree.baseRef` setting (`fresh` | `head`) defaults to `fresh`, which moves `EnterWorktree`'s base back to `origin/<default>` after several months of branching from local `HEAD`. The release also routes `HTTP(S)_PROXY` / `NO_PROXY` / mTLS through the entire MCP OAuth flow (discovery, dynamic client registration, token exchange, refresh), exposes effort level to hooks via `$CLAUDE_EFFORT`, adds Linux sandbox path overrides, and fixes a refresh-token race that was 401-ing parallel sessions.

codexMay 8, 2026

Codex CLI 0.129.0 Adds Modal Vim Composer, Redesigned Resume/Fork Picker, and a `/hooks` Browser

OpenAI shipped Codex CLI 0.129.0 on May 7, 2026. The release brings modal Vim editing to the TUI composer via `/vim`, a redesigned resume/fork picker with raw scrollback and workspace-aware `/diff`, a new `/hooks` browser with before/after compaction support, expanded plugin management with workspace sharing and access controls, theme-aware status lines, and Codex Apps auth surfaced through Guardian. Plus a long bug-fix list across Linux/Windows sandboxes, MCP, and TUI input handling.

cursorMay 6, 2026

Cursor adds enterprise model controls, soft spend limits, and richer usage analytics

Cursor's May 4, 2026 update adds granular model/provider access controls for Enterprise admins, introduces soft spend limits with automated alerts, and expands usage analytics so admins can break consumption down by product surface (including Cloud Agents, Bugbot, and Security Review).

warpMay 3, 2026

Warp Goes Open Source: AGPL Client, MIT UI Framework, and a New `settings.toml`

On April 27, 2026 (changelog v0.2026.04.27.15.32) Warp open-sourced its client at github.com/warpdotdev/warp under AGPL v3, with the `warpui` UI framework crates released under MIT. The same release adds a TOML settings file editable from the settings page or by asking Warp's agent. The server stays closed-source. OpenAI is the founding sponsor.

codexMay 1, 2026

Codex CLI 0.128.0 Lands Persisted `/goal` Workflows: Ralph-Style Agents That Don't Stop Until Done

OpenAI shipped Codex CLI 0.128.0 in April 2026 with a new `/goal` system that keeps a goal alive across turns and runs the agent until it is achieved. The release adds app-server APIs and model tools for goals, runtime continuation, and TUI controls to create, pause, resume, and clear goals. Felipe Coury credits co-worker Eric Traut (the Pyright lead) for the design, and frames it as Codex's take on the Ralph loop pattern.

cursorApr 30, 2026

Cursor SDK Lands in Public Beta: Programmatic Agents in TypeScript with Local and Cloud Runtimes

Cursor launched the Cursor SDK in public beta on April 29, 2026, exposing the same agent runtime that powers Cursor desktop, CLI, and web behind a TypeScript package. `@cursor/sdk` lets you spawn agents against local files, Cursor-hosted VMs, or self-hosted workers, stream results, and bill on standard token-based pricing -- moving Cursor from an editor surface to a programmable platform.

claude-codeApr 24, 2026

Anthropic's Claude Code Post-Mortem: Three Engineering Missteps Behind the Spring 2026 Quality Decline

Anthropic published a post-mortem on April 23, 2026 explaining the Claude Code quality regression that ran from early March through mid-April: a March 4 default-effort downgrade from high to medium, a March 26 caching change that wiped reasoning history every turn, and an April 16 verbosity prompt that capped responses at 25 words between tool calls. All three were resolved by April 20, the API was unaffected, and Anthropic reset usage limits for all subscribers.

cursorApr 24, 2026

Cursor 3.2 Adds /multitask Async Subagents, Worktrees Polish, and Multi-Root Workspaces

Cursor 3.2 shipped on April 24, 2026 with three changes that pull cross-repo, parallel agent work into the default flow: a new `/multitask` command that fans a request out to async subagents instead of queueing, an improved worktrees experience that runs isolated branch work in the background, and multi-root workspaces so a single agent session can target frontend, backend, and shared-library folders at once.

gpt-5-5Apr 23, 2026

GPT-5.5 Is Here: State-of-the-Art Agentic Coding, 1M Context, and a New Pro Tier

OpenAI launched GPT-5.5 on April 23, 2026 -- its smartest model yet, with state-of-the-art scores on Terminal-Bench 2.0 (82.7%), GDPval (84.9%), and OSWorld-Verified (78.7%), GPT-5.4 per-token latency, and a new GPT-5.5 Pro tier for harder work. Available in ChatGPT and Codex today, with API at $5/M input and $30/M output coming soon.

claude-designApr 17, 2026

Claude Design Launches: Anthropic Labs Turns Opus 4.7 Into a Prototype, Deck, and Wireframe Surface

Anthropic launched Claude Design on April 17, 2026 -- a research preview from Anthropic Labs that turns a prompt, uploaded image, or codebase into polished prototypes, pitch decks, and mockups. Powered by Claude Opus 4.7 vision, it learns your team's design system, exports to Canva, PDF, PPTX, or HTML, and packages finished designs for Claude Code handoff.

codexApr 17, 2026

OpenAI Codex Goes 'For Almost Everything': Mac Computer Use, Browser Comment Mode, and Thread Automations Explained

OpenAI shipped a major Codex update on April 16, 2026 that pushes the product past coding into general work. Three changes matter: Codex can now drive your Mac apps directly, an in-app browser captures both screenshots and DOM elements through 'comment mode', and Codex threads can run continuously to watch Slack, email, and PRs. Here is how each works, the workflows that justify each one, and where Codex now sits relative to Perplexity Personal Computer and Claude Code Routines.

cursorApr 17, 2026

Cursor Self-Documentation: New Subagent-Powered Help Reads Cursor's Own Docs in Real Time

Cursor shipped a self-documentation feature on April 17, 2026: when you ask Cursor about its own features, capabilities, or settings, it now spawns a subagent that fetches the current Cursor docs and updates before answering. The change closes the most annoying gap in AI coding tools -- the model's training cutoff lagging the product's release cadence -- and is a small but telling preview of where AI tool documentation is heading across the industry.

claude-opus-4-7Apr 16, 2026

Claude Opus 4.7 Is Here: State-of-the-Art Coding, xhigh Effort, and a New Cyber Safeguards Tier

Anthropic launched Claude Opus 4.7 on April 16, 2026 -- a notable improvement on Opus 4.6 in advanced software engineering, with the same pricing, a new xhigh effort level, /ultrareview in Claude Code, higher-resolution vision, and the first deployment of cyber safeguards from the Mythos Preview track.

claude-codeApr 15, 2026

Inside Claude Code's Rebuilt Desktop: Parallel Agents, Drag-Drop Panes, Side Chat

Anthropic rebuilt the Claude Code desktop app on April 14, 2026 around parallel agent workflows. The new app adds a multi-session sidebar, drag-and-drop pane layout, an in-app file editor, integrated terminal, side chat for asides, three view modes, and SSH support on macOS -- making the orchestrator role the default.

claude-codeApr 15, 2026

Claude Code Routines: Schedule, API, and GitHub-Trigger Your AI Agents

Claude Code Routines is Anthropic's new way to run saved Claude Code configurations automatically -- by schedule, API call, or GitHub event. Routines run on Anthropic's cloud infrastructure with a prompt, repo, and MCP connectors. Available in research preview on Pro, Max, Team, and Enterprise plans.

ai-newsMar 4, 2026

AI Tools Landscape: What Changed in Early 2026

The first quarter of 2026 brought three major shifts to the AI tools landscape: MCP became a mainstream standard adopted by most coding tools, AI coding assistants matured beyond autocomplete into full workflow partners, and the first generation of truly autonomous AI agents started shipping in production environments.

OpenAI Introduces GPT-Red: An Internal Automated Red-Teaming Model for Prompt Injection

Cursor Adds GPT-5.6 Sol, Terra, and Luna

Cursor Adds Side Chats: Durable Agent Threads You Can @-Mention Back Into the Main Conversation

OpenAI Launches ChatGPT Work: A Codex-Sibling Agent for Long-Running Deliverables

Claude Adds Reflect: A Monthly Recap and Usage Dashboard for How You Use Claude

Claude Cowork is coming to mobile and web (beta rollout starting with Max)

OpenAI adds GPT-Realtime-2.1-mini to the API with reasoning and tool use

Claude Code Artifacts Expand to Pro and Max Plans (Private by Default)

Anthropic raises Claude API rate limits and updates how tiers work

OpenAI introduces GeneBench-Pro, a research-level benchmark for agentic computational biology

Claude Desktop for Linux Beta: Ubuntu and Debian, With Caveats

Claude Science Beta: An AI Workbench for Reproducible Research

Claude Sonnet 5 Launch: Anthropic's Most Agentic Sonnet, Now the Default Tier

Cursor for iOS: Cloud Agents Go Mobile-First in Public Beta

Codex Permission Profiles: Least-Privilege Controls for Local Agent Work

Claude Code Adds Artifacts: Live, Shareable Pages for PR Walkthroughs and Dashboards

OpenAI Previews GPT-5.6: Sol, Terra, and Luna in Limited Preview

OpenAI Ships a New GPT-5.5 Instant in ChatGPT: Better Intent, Constraints, and Shopping/Local Recs

You Can Now Delegate Tasks to Cursor From Inside Notion

Claude Design's `/design-sync` Makes Claude Design and Claude Code a Two-Way Workflow

OpenAI Codex Adds Record & Replay: Turn a Demonstrated Mac Workflow Into a Reusable Skill

Cursor announces Origin: code storage and Git hosting (waitlist)

Anthropic Abruptly Suspends Fable 5 and Mythos 5 Access After US Government Directive

OpenAI: Responses API web search can now return image results

Cursor Bugbot gets faster and cheaper, adds /review command + incremental review

ChatGPT 'Dreaming': OpenAI's New Memory Architecture Curates What It Remembers in the Background

Claude Fable 5 and Mythos 5: Mythos-Class Capability Goes General, With Caveats

Codex for Every Role: Role-Specific Plugins, Codex Sites, and Annotations Beyond Code

Codex Build iOS Apps Plugin: Mirror the Simulator in the Browser and Hot-Reload SwiftUI Previews

Cursor Shared Canvases: Publish an Agent Canvas and Share It With Your Team via URL

Claude Platform's 'ant' CLI Brings the Full Claude API to Your Terminal

Cursor 3.5 brings Automations into the Agents Window (plus multi-repo and no-repo automations)

Codex Adds Windows Computer Use + ChatGPT Mobile Windows Connections for On-the-Go Steering

Claude Code adds dynamic workflows (research preview) for large parallel agent runs

Claude Opus 4.8 Fast Mode: 2.5x Faster Output Tokens in Research Preview

OpenAI's Secure MCP Tunnel: Connect Private MCP Servers Over Outbound-Only HTTPS

Claude Code ships a security-guidance plugin for in-session vulnerability checks

Claude Agent SDK Gets a Monthly Credit on Paid Claude Plans Starting June 15, 2026

OpenAI Codex adds Locked computer use on Mac (keep Computer Use running after lock)

Claude Managed Agents Add Self-Hosted Sandboxes (Public Beta) and MCP Tunnels (Research Preview)

Cursor Launches Composer 2.5: Better Long-Running Agent Work, New Pricing Tiers, and 2x Included Usage This Week

Claude Code 2.1.142: `claude agents` Gains Session Flags, Fast Mode Defaults to Opus 4.7, MCP Tool Timeout Honored

Claude Code Weekly Limits +50%: Promo Extended Through July 19, 2026

Codex in the ChatGPT Mobile App: Preview Lets You Run, Review, and Steer Codex from Your Phone

Codex Chrome Extension: How Codex Drives a Signed-In Browser for LinkedIn, Salesforce, Gmail, and Internal Tools

Codex Hooks and Programmatic Access Tokens: How OpenAI Is Making Codex Easier to Automate Around Your Code

Cursor Bugbot Adds Effort Levels: Default, High, and Custom (Usage-Based Billing Required)

Codex CLI 0.130.0 Adds `remote-control`, Richer Plugin Sharing Metadata, and Better App-Server Thread Paging

Gemini Interactions API: Steps Schema, `response_format`, and a June 8, 2026 Legacy Sunset

Claude Code 2.1.133: `worktree.baseRef` Default Returns to `origin/<default>`, MCP OAuth Proxy Honored Across the Whole Flow

Codex CLI 0.129.0 Adds Modal Vim Composer, Redesigned Resume/Fork Picker, and a `/hooks` Browser

Cursor adds enterprise model controls, soft spend limits, and richer usage analytics

Warp Goes Open Source: AGPL Client, MIT UI Framework, and a New `settings.toml`

Codex CLI 0.128.0 Lands Persisted `/goal` Workflows: Ralph-Style Agents That Don't Stop Until Done

Cursor SDK Lands in Public Beta: Programmatic Agents in TypeScript with Local and Cloud Runtimes

Anthropic's Claude Code Post-Mortem: Three Engineering Missteps Behind the Spring 2026 Quality Decline

Cursor 3.2 Adds /multitask Async Subagents, Worktrees Polish, and Multi-Root Workspaces

GPT-5.5 Is Here: State-of-the-Art Agentic Coding, 1M Context, and a New Pro Tier

Claude Design Launches: Anthropic Labs Turns Opus 4.7 Into a Prototype, Deck, and Wireframe Surface

OpenAI Codex Goes 'For Almost Everything': Mac Computer Use, Browser Comment Mode, and Thread Automations Explained

Cursor Self-Documentation: New Subagent-Powered Help Reads Cursor's Own Docs in Real Time

Claude Opus 4.7 Is Here: State-of-the-Art Coding, xhigh Effort, and a New Cyber Safeguards Tier

Inside Claude Code's Rebuilt Desktop: Parallel Agents, Drag-Drop Panes, Side Chat

Claude Code Routines: Schedule, API, and GitHub-Trigger Your AI Agents

AI Tools Landscape: What Changed in Early 2026

Get the weekly AI Catchup