🌟Vasilij’s note

Three things happened this week that belong in the same sentence. Anthropic launched Claude Fable 5 – a Mythos-class model, now available to anyone with an API key, wrapped in hard safety classifiers for cybersecurity and biology. At the same time, Claude Managed Agents quietly added scheduled deployments and credential vaults, which means agents can now run as cron jobs without a human pressing start. And sitting behind all of it: Anthropic's confidential IPO filing and a revenue run-rate of $47 billion. What looked like an ambitious AI lab two years ago is now infrastructure. The question for consultancies is no longer whether to use these tools. It is whether your governance, your workflow design, and your client data policies are ready for agents that run while you sleep.

In today's edition

This week in agents | What changed

Claude Fable 5 launches publicly

Anthropic released its Mythos-class model to the general API, with a 1 million-token context window, top-ranked agentic score of 80.7, and hard safety classifiers blocking high-risk cybersecurity and biology requests. Blocked prompts fall back to Claude Opus 4.8 at Opus pricing, with at least 95% of sessions running fully on Fable. → The most capable publicly available Claude model ever. For firms building coding agents, long-horizon research workflows, or complex document analysis, this is the new baseline. Pricing at $10/$50 per million tokens is double Opus 4.8 – run the maths before migrating production workloads wholesale.

Claude Managed Agents adds scheduled deployments and credential vaults

Agents can now run on a cron schedule and securely access CLI tools and authenticated services via vault-stored environment variables, both in public beta. Each schedule fires a new session automatically – no scheduler to build or host. → This is the plumbing change that moves agents from demo to production infrastructure. A nightly compliance scan, a weekly data sync, a daily client digest – none of these required a human trigger anymore. The governance question that follows immediately: who reviews what the agent did while you were away?

Anthropic files confidential IPO with SEC at $965 billion valuation

Revenue run-rate hit $47 billion in May 2026, up from $9 billion at the end of 2025. The filing came one week after a $65 billion Series H. Target listing window is October 2026. → For clients and consultancies depending on Claude infrastructure, this changes the firm's strategic position. Public markets require financial transparency, stable SLAs, and accountable governance. If you have been building on Anthropic's API without vendor risk plans, now is the time to document your fallback posture.

Top moves | Signal → impact

  • Claude Fable 5 as Advisor Model

    Fable 5 can now operate as an advisor model mid-task: faster, lower-cost worker models call Fable 5 to check their plan and evaluate their work before continuing. → For consultancies running multi-step workflows, this is the architecture to watch. Rather than running Fable 5 on every token, you use cheaper models for execution and Fable 5 for judgment checkpoints. That pattern reduces cost while maintaining quality at the decisions that matter – proposal sign-off, risk assessment, client-facing output review.

  • OpenAI Codex reaches 5 million weekly users, 20% non-developers

    Codex crossed 5 million weekly active users as of 2 June, with knowledge workers (analysts, marketers, operators, designers) growing three times faster than developers. Role-specific plugins launched for equity research, banking, and sales. → Anthropic and OpenAI are converging on the same enterprise buyer. The battle is no longer model quality alone – it is workflow integration depth. For consultancies advising clients on AI tooling, the practical question is: does your client's workflow sit closer to Codex's plugin ecosystem or Claude's agent harness? Both are valid. Choosing without mapping the workflow first is the expensive mistake.

  • EU AI Act Omnibus defers high-risk deadline but August obligations remain

    The Digital Omnibus on AI (agreed 7 May 2026) postponed Annex III high-risk AI system obligations from August 2026 to December 2027. However, Article 50 transparency requirements and AI literacy obligations still apply from August 2, 2026. → The deadline extension for high-risk AI is real relief, but do not treat it as permission to stop. Transparency and literacy obligations are live. For consultancies serving financial services, healthcare, and public sector clients, client-facing AI tools still need documented oversight, disclosure mechanisms, and staff training records before August. The firms that completed governance foundations in H1 are ahead.

Upskilling spotlight | Learn this week

Anthropic's Claude Fable 5 & Managed Agents Developer Docs

Understand the advisor model pattern, fallback API behaviour, and data retention obligations before touching production workloads. The 30-day retention requirement is a contract consideration, not just a technical one.

EU AI Act Digital Omnibus Summary (Latham & Watkins)

A precise reading of which deadlines shifted, which did not, and what Article 50 transparency obligations mean for client-facing AI tools before August. Saves the time of reading the legislative text directly

Operator’s picks | Tools to try

Claude Managed Agents with Self-Hosted Sandboxes

Use for: running agents inside your own infrastructure perimeter – your network policies, audit logging, and security tooling apply, and files do not leave your environment. Supports Cloudflare, Daytona, Modal, and Vercel as managed compute options.

Standout: the agent loop orchestration stays on Anthropic's infrastructure while execution moves into your boundary – you get both reliability and control without rebuilding the harness. Required for any client engagement where data residency is non-negotiable.

Caveat: self-hosted sandbox is public beta; test failure modes before production.

claude-api skill in Claude Code

Use for: migrating existing workloads to Fable 5 without manual search-and-replace across your codebase. Run /claude-api migrate to update model names, prompts, and settings automatically.

Standout: removes the friction that causes most teams to delay model adoption – what normally takes an afternoon of careful editing takes minutes.

Caveat: review the output before committing – Fable 5 carries a 30-day mandatory data retention requirement that may affect existing client data agreements and DPAs.

OpenCode

Use for: model-agnostic coding agent work across 75+ providers (Claude, GPT, Gemini, DeepSeek, local via Ollama) – particularly useful for teams that want to run Fable 5 on judgment tasks and cheaper models on execution without being locked into a single provider's tooling.

Standout: LSP integration feeds compiler diagnostics directly back to the model – no other coding agent does this – and true air-gapped deployment makes it viable for regulated client environments. 160K+ GitHub stars; 7.5 million monthly active developers as of June 2026.

Caveat: benchmarks show it runs 78% slower than Claude Code on the same model, and generates more thorough output as a result – the trade-off is worth understanding before choosing between them.

Deep dive | Claude Fable 5: What the Architecture Actually Changes

Fable 5 is not simply a more powerful model. It is a different deployment architecture – the first publicly available Mythos-class model, sold with mandatory safety classifiers, a mandatory data retention window, and a built-in fallback chain to Opus 4.8. Understanding these constraints is as important as understanding the capabilities.

On paper
  • Ranked #1 of 377 models for overall intelligence, #1 for coding, #1 for agentic tasks with a score of 80.7 – the highest in the field.

  • 1 million-token context window, supporting text, image, and file inputs with tool use and function calling.

  • Priced at $10 input / $50 output per million tokens – double Opus 4.8. The advisor model pattern lets you use Fable 5 selectively on judgment tasks while cheaper models handle execution.

  • Safety classifiers block a defined set of cybersecurity and biology requests, returning an error rather than a model response. Fallback API option routes blocked requests to Opus 4.8 at Opus pricing.

  • 30-day mandatory data retention period. Retained data used only for misuse detection, not model training. Admins must accept updated terms in Claude Console before use.

  • Available on the Claude API, Amazon Bedrock, AWS, Google Cloud, and Microsoft Foundry.

In practice
  • Anthropic reports at least 95% of Fable sessions run fully on the model with no fallback. Early third-party results: Hex gave Fable 5 a 90% pass rate on complex long-running analytical tasks; Genspark said it outperformed every other model on UI design and game coding.

  • The data retention obligation is the deployment blocker most teams will hit first. If your client contracts specify zero retention or require data processing agreements, you need to review those before putting client data through Fable 5. This is not a technical task – it is a contracts and DPA task.

  • The advisor model pattern is novel but requires deliberate workflow design. You need to decide which decisions in your pipeline are worth Fable 5's cost and which are acceptable at Opus 4.8 rates. Doing this upfront prevents the "let's just run everything on the best model" cost creep that kills ROI.

  • Claude Sonnet 4 and Opus 4.0 retire on 15 June 2026. If you have workflows still using those model strings, migration is not optional.

Issues/backlash
  • The 30-day data retention requirement has drawn scrutiny from enterprise buyers in regulated sectors. Anthropic states clearly the data is not used for training, but data processing agreements still need updating before deployment.

  • The fallback mechanism – where blocked requests route silently to Opus 4.8 unless you opt out – has concerned some security-conscious teams who want deterministic model behaviour. The opt-out is available but requires explicit configuration.

  • Pricing at $10/$50 per million tokens makes Fable 5 the most expensive publicly available Claude model. For high-volume, lower-complexity workflows (document triage, classification, summarisation), the cost uplift over Opus 4.8 will not justify the capability gain.

My take (what to do)
  • Startup (15–40 staff): Do not migrate everything to Fable 5 by default. Run a cost analysis first: identify the 2–3 workflows where quality at the decision point actually matters to clients – proposal review, contract risk assessment, complex research synthesis. Those are Fable 5 candidates. Everything else stays on Opus 4.8 until you have evidence the quality improvement justifies the cost. Before using Fable 5 with any client data, check whether your client MSAs cover a 30-day retention window. If they do not, get legal sign-off or use a data agreement update before deployment.

  • SMB (50–120 staff): You likely have a mix of client-facing and internal workflows. Treat Fable 5 as a quality upgrade for your highest-value client deliverables, not a platform-wide replacement. Assign one delivery lead to run a 4-week pilot on one complex workflow – deep research synthesis, audit analysis, or multi-document comparison. Measure quality improvement against cost increase. Set a clear threshold: if the cost increase per deliverable exceeds what you can absorb or bill, the workflow stays on Opus 4.8. On governance: create a Fable 5 deployment register before any production use. Document which workflows use it, what data they touch, and which client agreements have been reviewed for the retention requirement.

  • Enterprise (150–250 staff): The advisor model pattern is the architecture to prioritise. It gives you Fable 5 quality at checkpoints without paying Fable 5 pricing for every token. Work with your ops team to map your agent pipelines and identify the judgment nodes – the points where the output goes to a client, gets committed to a system, or drives a significant decision. Route those nodes through Fable 5. Route execution between them through cheaper models. This architecture requires upfront workflow documentation, but it is the only way to scale Fable 5 economically. For regulated clients: complete DPA reviews for the 30-day retention requirement before Q3 deployments. Do not wait for a client to raise it.

How to try (15-minute path)
  1. Go to the Claude Console, accept the updated Fable 5 terms, and run one complex, multi-document task you currently do with Opus 4.8 – a proposal review, a competitive analysis, or a contract summary. Note the output quality and the token cost. (7 min)

  2. Run the same task on Opus 4.8. Compare output quality side by side. Write down the specific quality difference, if any, in one sentence. (5 min)

  3. Calculate: does the quality difference justify the price premium for this specific workflow? If yes, it is a Fable 5 candidate. If not, Opus 4.8 is the right call. (3 min)

Success metric: a documented yes/no decision for one workflow, based on actual output comparison – not benchmarks.

"It's a really big upfront cost to train the models and to serve inference on them. My guess is that over time, the sort of core set of companies that are working to advance the frontier are just going to need access to capital, and I think the public market is very well suited to that."

Daniela Amodei, President & Co-Founder, Anthropic – Bloomberg Tech Conference, 4 June 2026

Purpose: Production-grade agent infrastructure for recurring autonomous work – nightly data syncs, weekly compliance scans, daily briefings – without custom scheduler infrastructure.

Edge: Cron schedules, vault-stored credentials, and self-hosted sandbox support in one managed platform. Agents run inside your own perimeter if needed (Cloudflare, Daytona, Modal, Vercel), with Anthropic handling the orchestration layer.

→ Scheduled deployments via cron with no external scheduler to build or host
→ Credential vaults for secure CLI and authenticated service access
→ Self-hosted sandbox option for enterprise data residency requirements
→ Private MCP server connectivity for internal tool access

What did you think of today's issue?

Login or Subscribe to participate

Did you find it useful? Or have questions? Please drop me a note., I respond to all emails. Simply reply to the newsletter or email [email protected].

This issue’s sponsor

n8n

An open‑source automation platform that lets you chain tools like DeepSeek, OpenAI, Gemini and your existing SaaS into real business workflows without paying per step. Ideal as the backbone for your first serious AI automations.

Refer and win

Share this newsletter for a chance to win!

Keep Reading