AI Developer Digest

Tue, Jun 9, 2026

20 signals that cleared the gate27 min read

The Signal — start here

Today is Anthropic's biggest public release day since Opus 4.8 launched. Claude Fable 5 — the first Mythos-class model available to any developer — ships on June 9 with SWE-bench Verified at 95.0%, a 1M context window, 128k max output, and concrete API changes that require migration planning before adoption. It is not a drop-in replacement for Opus 4.8: adaptive thinking is always on, thinking: {"type": "disabled"} returns a 400, and the model won't run under zero data retention. On the same day: Windows KB5039239 delivers the Aion 1.0 AI runtime to Windows 11 24H2 devices, making on-device inference via Windows Copilot Runtime API generally available for the first time. Claude Code v2.1.170 also shipped with a --safe-mode debugging flag and a /cd command that preserves prompt cache when changing working directories.

Must-reads today

Claude Fable 5 — First Mythos-class model for general use; 95.0% SWE-bench Verified, $10/$50 per MTok; read the migration guide before adopting — thinking.disabled, budget_tokens, and assistant prefill all return 400 errors.

Windows KB5039239 — Aion 1.0 (Instruct + 14B Plan) ships today to Windows 11 24H2; Windows Copilot Runtime API goes live, enabling free on-device inference via ONNX across NPU/GPU/CPU.

Claude Code v2.1.170 — --safe-mode, /cd, and disableBundledSkills land in the same build; noteworthy if you debug Claude Code sessions or manage complex CLAUDE.md setups.

Breaking Changes

●Breaking

Claude Fable 5: `thinking.disabled`, `budget_tokens`, and Assistant Prefill Return 400

What changed

On Claude Fable 5 and Claude Mythos 5, three previously supported parameters are not supported and return HTTP 400 errors: (1) thinking: {"type": "disabled"}, (2) manual extended thinking budget_tokens, and (3) assistant prefill (a pre-populated role: "assistant" message as the last turn). All three worked on Opus 4.8 and earlier models. Additionally, the Messages API now returns stop_reason: "refusal" (as a successful HTTP 200, not an error) when a safety classifier blocks a request — code that only checks for network errors or non-200 responses will silently swallow refusals.

TL;DR

Migrating from Opus 4.8 to Claude Fable 5 requires removing thinking.disabled, removing manual budget_tokens, removing assistant prefill, and explicitly handling stop_reason: "refusal" — all four break silently or hard-fail if unaddressed.

Developer signal

Before switching any production integration to claude-fable-5: (1) Search your codebase for thinking: {"type": "disabled"} and remove or conditionally skip it when targeting Fable 5 — use the effort parameter instead to control thinking depth (e.g., effort: "low" to minimize thinking overhead). (2) Search for budget_tokens in extended thinking configurations and replace with effort. (3) Search for any pattern where role: "assistant" is the last message in the messages array before the API call — this is assistant prefill and will return a 400. (4) Add an explicit check for stop_reason === "refusal" in your response handler. The refusal comes back as HTTP 200 with the stop_reason field set — your error handler will not catch it. A refused request produces no output and is not billed; use the fallbacks parameter (beta, Claude API and Platform on AWS only) to automatically retry the request on another model. (5) Fable 5 requires 30-day data retention and is not available under zero data retention (ZDR) — if your org has ZDR enabled, Fable 5 is not accessible until you contact your account team.

Affects you ifYou have code currently using thinking: {"type": "disabled"} or budget_tokens; you use assistant prefill as a prompting technique; you are evaluating adoption of claude-fable-5 in a ZDR-enabled org; you route on stop_reason values in productionEffortModerate (four specific migration items to address; ZDR constraint may require org-level policy decision)

Anthropic Platform Release Notes | Date: June 9, 2026 | Link: https://platform.claude.com/docs/en/release-notes/overviewhttps://platform.claude.com/docs/en/about-claude/models/introducing-claude-fable-5-and-claude-mythos-5

Model Releases

High

Claude Fable 5 — First Publicly Available Mythos-Class Model

What changed

Anthropic launched claude-fable-5 as its most capable generally available model, and claude-mythos-5 for Project Glasswing participants only. The prior public frontier was claude-opus-4-8; Fable 5 and Mythos 5 share the same underlying model, with Fable 5 running Anthropic's safety classifiers on every request and falling back to Opus 4.8 for requests in high-risk categories (cybersecurity, biology, chemistry, distillation), while Mythos 5 skips the classifiers entirely and is restricted to vetted partners.

TL;DR

Claude Fable 5 launches at $10/$50 per MTok (input/output), 1M token context window, 128k max output tokens, always-on adaptive thinking, with SWE-bench Verified at 95.0%, SWE-bench Pro at 80.3%, MMLU Pro at 91.50%, and LiveCodeBench at 89.78%.

Developer signal

Fable 5 is not a drop-in replacement for Opus 4.8 — review the Breaking Changes section above before upgrading any production integration. Once you've addressed those migration items: (1) API access — use model ID claude-fable-5; it's available today on the Claude API, Claude Platform on AWS, Amazon Bedrock, Vertex AI, and Microsoft Foundry. (2) Thinking — adaptive thinking is always on and cannot be disabled; control depth via the effort parameter (low, medium, high). Default display is "omitted" (thinking blocks exist but are empty); set thinking.display: "summarized" if you want readable summaries of the reasoning. Pass thinking blocks back unchanged in multi-turn conversations. (3) Token counting — Fable 5 uses the tokenizer introduced with Claude Opus 4.7; the same text produces roughly 30% more tokens than with pre-4.7 models. Recalibrate any fixed-length prompts, token budget estimates, and prompt caching breakpoints before deploying at scale. Use the token counting API with model: "claude-fable-5" to measure your actual prompts. (4) Fallback — the fallbacks parameter (beta, Claude API and Claude Platform on AWS only) lets you specify fallback models to retry on when Fable 5 refuses; alternatively, use the SDK middleware for TypeScript, Python, Go, Java, or C#. (5) GitHub Copilot — Fable 5 is available today in GitHub Copilot for Pro+, Max, Business, and Enterprise tiers under usage-based billing at provider list pricing; the 1M context window is available and requires opting in to the extended context size selector.

Affects you ifYou build on the Claude API, AWS Bedrock, Vertex AI, or Microsoft Foundry; you use GitHub Copilot on a paid tier above Individual; you have agentic coding or long-horizon research workflows; you evaluate frontier model capability for production useEffortModerate (migration checklist from Opus 4.8: remove disabled thinking, budget_tokens, and prefill; add refusal handling; retest token counts; check ZDR policy)

Anthropic | Date: June 9, 2026 | Link: https://www.anthropic.com/news/claude-fable-5-mythos-5https://platform.claude.com/docs/en/about-claude/models/introducing-claude-fable-5-and-claude-mythos-5

API & SDK Changes

Medium

Claude Fable 5: New `stop_details.category: "reasoning_extraction"` and `fallbacks` Parameter (Beta)

What changed

Two new API capabilities launch alongside Fable 5. First, stop_details.category on refusal responses gains a new "reasoning_extraction" value (returned when a request is blocked for attempting to reverse-engineer or duplicate model outputs, under Anthropic's ToS); the existing "cyber" and "bio" values are unchanged. Second, a new fallbacks parameter (beta, Messages API on Claude API and Platform on AWS, not supported on Message Batches API) allows you to specify one or more fallback model IDs; if Fable 5 refuses a request, the API automatically retries on the fallback model and returns a fallback_credit that refunds the prompt-cache overhead of switching.

TL;DR

Two new Fable 5 API additions: stop_details.category: "reasoning_extraction" for ToS-refusal classification, and the fallbacks beta parameter for automatic model-level retry on refusal (Claude API + Platform on AWS only).

Developer signal

(1) If your code switches on stop_details.category to route different refusal types to different handling paths, add "reasoning_extraction" to your switch/case. (2) The fallbacks parameter is the recommended pattern for production Fable 5 integrations: specify fallbacks: ["claude-opus-4-8"] (or your preferred fallback) to get automatic retry on the fallback model when Fable 5 refuses, with fallback_credit handling the cache cost of switching. This is beta on the first-party Claude API and Platform on AWS only — not on Bedrock, Vertex, or Foundry at launch, and not supported in the Message Batches API. For other platforms, use the SDK middleware for client-side fallback handling (available for TypeScript, Python, Go, Java, C#).

Affects you ifYou handle stop_details.category in routing logic; you want automatic retry logic when Fable 5 refuses a request; you are using the Messages API on the Claude API or Platform on AWSEffortQuick (fallbacks is an additive parameter — no existing code breaks; just add it to your request body)

Research

Nothing cleared the quality bar this period. arXiv direct fetch returned 403 for June 8–9 submission listings. No papers from recognized labs (DeepMind, Meta FAIR, Stanford, MIT, CMU, AI2) with measurable benchmark numbers and associated code were confirmed within the scan window via search.

Tooling

Medium

Windows KB5039239 Ships — Aion 1.0 and Windows Copilot Runtime API Now Live

What changed

Windows Update KB5039239 shipped today on Windows 11 24H2, delivering the Aion 1.0 model family and expanded Windows Copilot Runtime API to end-user devices for the first time. Previously, Aion 1.0 was announced at Build 2026 (June 2–3) but required the June 9 update to reach devices. Aion 1.0 Instruct is a small on-device language model for everyday text tasks (summarization, rewrite, accessibility); Aion 1.0 Plan is a 14B-parameter reasoning and tool-calling model (32K context) designed for agentic workflows on-device. Both run through the Windows Copilot Runtime API as a unified ONNX inference graph across NPU, GPU, and CPU.

TL;DR

Windows KB5039239 ships today with Aion 1.0 Instruct (text SLM, preview) and Aion 1.0 Plan (14B reasoning model, 32K context, tool-calling) accessible via the Windows Copilot Runtime API as ONNX across NPU/GPU/CPU; also includes Speech Recognition API public preview and Phi Silica GPU expansion.

Developer signal

If you build Windows 11 applications that use AI: (1) KB5039239 is the minimum baseline for Aion 1.0 and the expanded Copilot Runtime API — user devices must have this update installed. Document this as a system requirement if you adopt any of these APIs. (2) Access Aion 1.0 through the Windows Copilot Runtime API — the unified ONNX inference path routes automatically to NPU first, then GPU, then CPU depending on device capability; you get free on-device inference with no per-call API cost. (3) Aion 1.0 Instruct is in preview — do not use it for production inference on critical paths yet. Aion 1.0 open weights land on Hugging Face in July 2026; from that point, you can also fine-tune or run the base weights directly if Windows API abstractions aren't sufficient. (4) Speech Recognition API public preview: test for new accuracy improvements if you use speech recognition in your Windows app — preview APIs may change before GA. (5) Phi Silica GPU expansion extends the existing Phi Silica on-device text model (already present in 24H2) to GPU inference on more device classes — if you use Phi Silica today, test your workloads after KB5039239 installs to confirm behavior is unchanged.

Affects you ifYou build Windows 11 apps (UWP, Win32, WinUI, .NET MAUI) that integrate AI inference; you use or are evaluating the Windows Copilot Runtime API; you have end-user apps that currently call cloud AI APIs and want an on-device alternativeEffortModerate (requires KB5039239 baseline, Windows Copilot Runtime API integration, testing across NPU/GPU/CPU device variants, and noting Aion Instruct is preview)

Windows Developer Blog | Date: June 9, 2026 | Link: https://blogs.windows.com/windowsdeveloper/2026/06/02/build-2026-furthering-windows-as-the-trusted-platform-for-development/https://blogs.windows.com/windowsdeveloper/2026/06/02/build-2026-furthering-windows-as-the-trusted-platform-for-development/

Notable

Claude Code v2.1.170: `--safe-mode`, `/cd`, and `disableBundledSkills`

What changed

v2.1.170 adds three new developer controls: a --safe-mode CLI flag (also CLAUDE_CODE_SAFE_MODE env var) that starts Claude Code with all customizations disabled; a /cd slash command to move the active session to a new working directory without breaking the prompt cache; and a disableBundledSkills config setting (also CLAUDE_CODE_DISABLE_BUNDLED_SKILLS env var) that hides Anthropic's built-in bundled skills, workflows, and slash commands from the model's context. Previous versions had no CLI-level safe mode and no in-session directory change.

TL;DR

Claude Code v2.1.170 ships with --safe-mode for isolation debugging (disables CLAUDE.md, plugins, skills, hooks, MCP servers), /cd to change session directory without cache loss, and disableBundledSkills to strip built-in skills from model context.

Developer signal

(1) --safe-mode / CLAUDE_CODE_SAFE_MODE=1 — use this when diagnosing unexpected model behavior and you want to rule out custom CLAUDE.md, hooks, or MCP server interference. All customizations (CLAUDE.md at all levels, plugins, skills, hooks, MCP servers) are disabled; model sees a baseline environment. (2) /cd <path> — use when you start a session in one directory but need to work on a different repo or subdirectory mid-session without starting a new session (which would cost you the prompt cache). (3) disableBundledSkills / CLAUDE_CODE_DISABLE_BUNDLED_SKILLS=1 — use when Anthropic's built-in bundled skills, workflows, or slash commands conflict with your custom setup, or when you want to reduce context bloat by preventing built-ins from being included in the model's toolset.

Affects you ifYou use Claude Code CLI; you debug unexpected model behavior in projects with complex CLAUDE.md or hook setups; you work across multiple repos in a single session; you build Claude Code integrations where bundled skills interfere with custom workflowsEffortQuick (update Claude Code via the standard update path; all three features are additive, no config migration needed)

anthropics/claude-code GitHub | Date: June 9, 2026 | Link: https://github.com/anthropics/claude-code/releaseshttps://github.com/anthropics/claude-code/releases/tag/v2.1.170

Benchmarks & Leaderboards

Medium

SWE-bench Verified: Claude Fable 5 at 95.0% — New Benchmark High Watermark

What changed

Anthropic reports Claude Fable 5 at 95.0% on SWE-bench Verified (up from Claude Mythos Preview's 93.9%, the prior high). On SWE-bench Pro (a harder multi-file agentic variant), Fable 5 scores 80.3% vs GPT-5.5's 58.6% and Opus 4.8's 69.2%. Other coding benchmarks: LiveCodeBench 89.78%, IOI 72.25%, CursorBench 72.9%, Terminal-Bench 2.1 80.52%.

TL;DR

Anthropic reports Claude Fable 5 at 95.0% on SWE-bench Verified (highest reported score for any publicly available model) and 80.3% on SWE-bench Pro, vs GPT-5.5 at 82.6% Verified / 58.6% Pro.

Developer signal

Two caveats before treating these numbers as definitive: (1) Benchmark methodology matters. The independent swebench.com public leaderboard currently shows GPT-5.5 leading SWE-bench Verified at 82.60% — Fable 5's 95.0% is Anthropic's self-reported number and may reflect a different agent scaffold or run configuration than third-party submissions. Watch the official swebench.com leaderboard for independent verification over the next 1–2 weeks. (2) Starred benchmarks (including some cybersecurity and biology SWE tasks) show Fable 5 performing closer to Opus 4.8 because safety classifiers trigger fallback to Opus 4.8 on those task categories — Mythos 5 scores 1–3 percentage points higher on those tasks without classifiers. If you evaluate models on security-adjacent coding benchmarks, test Fable 5 on your actual workload rather than relying on aggregate headline scores.

Affects you ifYou choose models based on benchmark performance; you have agentic coding workflows where SWE-bench-class task completion matters; you evaluate code generation quality for production useEffortQuick (informational — no code changes required to act on this information)

Anthropic / swebench.com | Date: June 9, 2026 | Link: https://www.anthropic.com/news/claude-fable-5-mythos-5https://www.anthropic.com/news/claude-fable-5-mythos-5

Trends & Emerging Tech

Classifier-As-Feature: The Fable/Mythos Split as a New Safety Architecture Pattern

What's happening

Fable 5 and Mythos 5 are the same underlying model served through different classifier configurations: Fable 5 runs safety classifiers on every request and automatically falls back to Opus 4.8 on high-risk categories; Mythos 5 skips classifiers entirely and is gated to vetted partners. This creates a three-tier model surface: (1) Mythos 5 for full capability without classifiers (Project Glasswing, restricted), (2) Fable 5 for Mythos-class capability with classifiers (general, $10/$50 per MTok), and (3) Opus 4.8 as the automatic fallback for requests Fable 5 refuses (~$5/$25 per MTok). The architecture also surfaces a new API primitive — stop_reason: "refusal" as a first-class response, plus the fallbacks parameter to handle it automatically.

Why watch this

The fallbacks + stop_reason: "refusal" pattern is a concrete precedent: it makes the classifier a routing decision point rather than a failure state, and the fallback_credit mechanism means you don't pay twice for the prompt-cache cost of switching. If this pattern proliferates — other labs introducing tiered classifier configurations with API-level fallback routing — the implication for builders is that model selection becomes a policy decision at call time, not at design time. A single API call surface can span multiple model tiers transparently. Watch whether the fallbacks parameter exits beta, extends to Bedrock/Vertex/Foundry, and whether Anthropic adds more sophisticated routing rules (e.g., per-category fallback targets).

Anthropic | Date: June 9, 2026 | Link: https://www.anthropic.com/news/claude-fable-5-mythos-5

Technical Discussions

Nothing cleared the quality bar this period. No qualifying Hacker News threads (score >200 with technical depth) found for June 8–9. No qualifying posts from Nathan Lambert (last confirmed post: June 1), Eugene Yan, or Sebastian Raschka in the scan window. Simon Willison's blog returned 403 on direct fetch; no qualifying posts confirmed via search for June 8–9.

Quick Hits

Claude Fable 5 GA in GitHub Copilot (June 9) — Available today on Pro+, Max, Business, and Enterprise tiers; billed at provider list pricing under Usage Based Billing; 1M context window available via the extended context size selector; note 30-day data retention applies. [https://github.blog/changelog/2026-06-09-claude-fable-5-is-generally-available-for-github-copilot/]

Worth Watching (Announced, Not Yet Shipped)

⚠️⚠️⚠️ Claude Sonnet 4 + Opus 4 — Retirement June 15 (6 days)

(Countdown updated)

claude-sonnet-4-20250514 and claude-opus-4-20250514 return errors June 15. Migrate to claude-sonnet-4-6-20260217 and claude-opus-4-8 respectively. Review the Opus 4.8 migration guide before upgrading — adaptive thinking replaces budget_tokens; setting temperature, top_p, or top_k to non-default values returns a 400 error.

Anthropic | Link: https://platform.claude.com/docs/en/about-claude/model-deprecations

⚠️⚠️ Gemini CLI Hard Stop — June 18 (9 days)

(Countdown updated)

gemini CLI and Gemini Code Assist IDE extensions stop serving requests on June 18. Replacement is Antigravity CLI (agy). Audit CLI scripts and CI pipeline steps now — Antigravity CLI does not have 1:1 feature parity.

Google Developers Blog | Link: https://developers.googleblog.com/an-important-update-transitioning-gemini-cli-to-antigravity-cli/

⚠️⚠️ Gemini API Unrestricted Key Deadline — June 19 (10 days)

(Countdown updated)

All unrestricted Gemini API keys blocked June 19. Restrict via AI Studio → API Keys → "Restrict to Gemini API." Takes 2 minutes; no code changes required.

Google AI for Developers | Link: https://ai.google.dev/gemini-api/docs/api-key

⚠️ Gemini Image Models Shutdown — June 25 (16 days)

(Countdown updated)

gemini-3.1-flash-image-preview and gemini-3-pro-image-preview shutting down June 25, 2026. Migrate to stable image model equivalents.

Google AI for Developers | Link: https://ai.google.dev/gemini-api/docs/deprecations

⚠️ GPT-4.5 Retirement from ChatGPT — June 27 (18 days)

(Countdown updated)

GPT-4.5 being retired from the ChatGPT product surface on June 27. Direct API route retirement unconfirmed. Audit gpt-4.5 model identifiers in code.

OpenAI Platform Changelog | Link: https://platform.openai.com/docs/changelog

⚠️ Aion 1.0 Open Weights — July 2026 (~3 weeks)

NEW — Added June 9, 2026

Aion 1.0 Instruct open weights land on Hugging Face in July 2026. If you want to run, fine-tune, or evaluate the model outside the Windows Copilot Runtime API, wait for the weights. No confirmed specific date yet.

Windows Developer Blog | Link: https://blogs.windows.com/windowsdeveloper/2026/06/02/build-2026-furthering-windows-as-the-trusted-platform-for-development/

⚠️⚠️ Claude Opus 4.1 Retirement — August 5 (57 days)

(Countdown updated)

claude-opus-4-1-20250805 retires August 5. Migrate to claude-opus-4-8. See the June 6, 2026 digest for the full migration checklist including breaking changes around adaptive thinking, sampling parameters, and tokenizer differences.

Anthropic | Link: https://platform.claude.com/docs/en/about-claude/model-deprecations

⚠️ OpenAI Reusable Prompts (`v1/prompts`) Shutdown — November 30 (175 days)

Deprecated June 3, shutdown November 30, 2026. Move prompt content to application code.

OpenAI | Link: https://developers.openai.com/api/docs/deprecations

⚠️ OpenAI Evals Platform Shutdown — November 30 (175 days)

Read-only October 31, shutdown November 30, 2026. Export eval configs before October 31.

OpenAI | Link: https://developers.openai.com/api/docs/deprecations

⚠️ OpenAI Agent Builder Shutdown — November 30 (175 days)

Shutdown November 30, 2026. Migrate to Agents SDK (openai.agents) or ChatGPT Workspace Agents.

OpenAI | Link: https://developers.openai.com/api/docs/deprecations

Apple iOS 27 / macOS Golden Gate / Core AI GA — Fall 2026 (September, ~3 months)

(Carried — status unchanged)

iOS 27, iPadOS 27, and macOS Golden Gate ship with iPhone 18 in September 2026. Includes: Siri Extensions API (App Intents-based, third-party AI providers), Core AI (replaces Core ML), expanded Foundation Models multi-provider support. Developer Beta 1 available now. Public beta expected mid-July. Start auditing Core ML usage and planning Extensions integration now.

Apple Developer / WWDC 2026 | Link: https://developer.apple.com/ios/

Gemini 3.5 Pro — Expected June 2026 (No Date Confirmed)

(Carried — no official date)

Still in limited Vertex preview. Sundar Pichai stated "give us until next month" at Google I/O 2026 (May 19). No official model card, API pricing, model ID, or benchmark numbers. Expected: 2M token context window, Deep Think reasoning mode.

Claude Mythos 5 General Availability — No Timeline

(Carried — status unchanged)

Currently only for vetted Project Glasswing participants. Not available on public API. Contact your Anthropic, AWS, or Google Cloud account team for access.

Anthropic | Link: https://www.anthropic.com/news/expanding-project-glasswing

Filtered from 30+ primary sources against a published quality rubric. No press releases, no fluff — only what changes what you build.

Breaking Changes

Claude Fable 5: `thinking.disabled`, `budget_tokens`, and Assistant Prefill Return 400

Model Releases

Claude Fable 5 — First Publicly Available Mythos-Class Model

API & SDK Changes

Claude Fable 5: New `stop_details.category: "reasoning_extraction"` and `fallbacks` Parameter (Beta)

Research

Tooling

Windows KB5039239 Ships — Aion 1.0 and Windows Copilot Runtime API Now Live

Claude Code v2.1.170: `--safe-mode`, `/cd`, and `disableBundledSkills`

Benchmarks & Leaderboards

SWE-bench Verified: Claude Fable 5 at 95.0% — New Benchmark High Watermark

Trends & Emerging Tech

Classifier-As-Feature: The Fable/Mythos Split as a New Safety Architecture Pattern

Technical Discussions

Quick Hits

Worth Watching (Announced, Not Yet Shipped)

⚠️⚠️⚠️ Claude Sonnet 4 + Opus 4 — Retirement **June 15 (6 days)**

⚠️⚠️ Gemini CLI Hard Stop — **June 18 (9 days)**

⚠️⚠️ Gemini API Unrestricted Key Deadline — **June 19 (10 days)**

⚠️ Gemini Image Models Shutdown — **June 25 (16 days)**

⚠️ GPT-4.5 Retirement from ChatGPT — **June 27 (18 days)**

⚠️ Aion 1.0 Open Weights — **July 2026 (~3 weeks)**

⚠️⚠️ Claude Opus 4.1 Retirement — **August 5 (57 days)**

⚠️ OpenAI Reusable Prompts (`v1/prompts`) Shutdown — **November 30 (175 days)**

⚠️ OpenAI Evals Platform Shutdown — **November 30 (175 days)**

⚠️ OpenAI Agent Builder Shutdown — **November 30 (175 days)**

Apple iOS 27 / macOS Golden Gate / Core AI GA — **Fall 2026 (September, ~3 months)**

Gemini 3.5 Pro — Expected June 2026 (No Date Confirmed)

Claude Mythos 5 General Availability — No Timeline

⚠️⚠️⚠️ Claude Sonnet 4 + Opus 4 — Retirement June 15 (6 days)

⚠️⚠️ Gemini CLI Hard Stop — June 18 (9 days)

⚠️⚠️ Gemini API Unrestricted Key Deadline — June 19 (10 days)

⚠️ Gemini Image Models Shutdown — June 25 (16 days)

⚠️ GPT-4.5 Retirement from ChatGPT — June 27 (18 days)

⚠️ Aion 1.0 Open Weights — July 2026 (~3 weeks)

⚠️⚠️ Claude Opus 4.1 Retirement — August 5 (57 days)

⚠️ OpenAI Reusable Prompts (`v1/prompts`) Shutdown — November 30 (175 days)

⚠️ OpenAI Evals Platform Shutdown — November 30 (175 days)

⚠️ OpenAI Agent Builder Shutdown — November 30 (175 days)

Apple iOS 27 / macOS Golden Gate / Core AI GA — Fall 2026 (September, ~3 months)