The Lab

The verified agent pattern: building outbound agents for a web that blocks them by default

From September, agent-class traffic is blocked by default on a large share of the commercial web. An agent that browses anonymously and hopes for the best is now an agent with a shrinking and invisible field of view. The pattern for declared identity, cryptographic verification, budgeted paid access, and honest degradation.

July 21, 202612 min

A conference weaponised prompt injection in its own defence, Gartner puts a price on the end of the human interface, and MCP's July CVE list keeps growing

Three signals from this week. A machine learning conference embedded hidden instructions in every submitted paper and used them to catch reviewers outsourcing judgement to a model. Gartner prices agentic arbitrage at $234 billion and reframes what enterprise software is bought for. And July's MCP disclosures make the case that context providers cannot be trusted by default.

July 18, 20267 min

The agent that filed a confident weekly report on an empty web

A market monitoring agent produced well formed reports for eleven days while most of its fetches were being blocked at the edge. Nothing errored. The retrieval layer returned challenge pages with a 200 status and the agent summarised them as an absence of news. The trace, the mechanism, and the three controls.

July 16, 20269 min

The model routing table: how to spend less without shipping a worse agent

Price per token is not the unit that matters. Cost per successful task is. The routing table is the pattern that lets you capture cheap tokens where they work and keep strong tokens where they must. What the table contains, how to log a route as a decision, how the fallback path works, and the four ways this goes wrong.

July 16, 202612 min

The eval set is the spec: how to build the gate that every model change has to pass

Three frontier models shipped in one week. Your provider can roll a version underneath you. The only thing standing between that churn and your production agent is an eval set. What belongs in it, the four case classes, how to score the path and not just the answer, and how eval sets decay.

July 13, 202613 min

An agent ran a ransomware operation end to end, three frontier models land in one week, and a regulator deletes 14,000 agents

Three signals from this week. An LLM agent carried out a full intrusion and wrote its own ransom note. Three frontier models shipped in seven days and the price of agentic work collapsed, while the reliability number stayed flat. And a regulator removed more than 14,000 agents from a market, which is what enforcement looks like.

July 11, 20267 min

We moved an agent to a model that cost 70% less, and the bill went up

The new model was cheaper per token on every line of the price sheet. Three weeks later the monthly bill was higher and the human overturn rate had doubled. Price per token is not the unit that matters. The trace, the arithmetic, and the metric to bill against instead.

July 9, 20269 min

Prompt Injection Defense for Production Agents: What Actually Works

Prompt injection cannot be filtered away. The defenses that work in production: structural separation, least privilege, human gates, output validation, and blast radius design. The full stack, with checklist.

July 5, 202610 min

Retry, Backoff, and Circuit Breakers for LLM API Calls

How to retry LLM API calls properly: exponential backoff with jitter, Retry-After handling, deadlines, idempotency, circuit breakers, persistent queues, and fallback models. With code and a checklist.

July 5, 202610 min

Fable and Mythos return after a 20-day shutdown, Europe pushes for AI sovereignty, and Meta rents out its compute

Anthropic's Fable and Mythos come back online after a 20-day showdown, Europe debates AI sovereignty, GPT-5.6 leaks point to a July launch window, and Meta rattles the compute market.

July 4, 20266 min

Agentjacking turns error trackers into an injection vector, GPT-5.6 ships behind a government gate, and Gartner puts agent spend at $206 billion

Agentjacking hits coding agents through error trackers, GPT-5.6 launches behind a government gate, an open protocol for agent authorization, and Gartner's $206B agent forecast.

June 27, 20267 min

The instrumentation pattern: an agent you cannot see is an agent you cannot operate

Most agents are instrumented for the failures that announce themselves and blind to the ones that do not. The three streams every production agent must emit, the question each one answers, and the failures you stay blind to when one is missing.

June 23, 202611 min

Observability ships as a primitive, governance becomes the line between scale and rollback, and the orchestration layer commoditizes

Three signals from this week. A cloud platform ships agent observability as a first class primitive, including a score for how instrumented your agent is. New data makes governance the dividing line between agents that scale and the majority that get rolled back. And the orchestration layer commoditizes, which moves your moat somewhere else.

June 20, 20267 min

The agent that quietly got worse over a weekend, and the model swap nobody logged

A triage agent started misrouting urgent tickets. No error fired. The outputs were well formed and plausible. The cause was a model version that changed under a floating alias, and nothing was watching the one thing that would have caught it. The trace and the three controls.

June 18, 20269 min

Agentjacking gets a name, agents get wallets and identity rails, and the Big Four make governance a product

Three signals from this week. A named attack class lands for hijacking coding agents through MCP. Agents get spending wallets and an open identity manifest, both shipping with governance built in. And a Big Four firm turns agent oversight into a buyable layer.

June 13, 20267 min

The agent that followed instructions hidden in a vendor's API response

A research agent read a field in a third-party API response, treated the text inside it as a command, and exfiltrated data it was never asked to touch. No credential was stolen. The attack was plain text in a tool output. The trace and the three controls that stop it.

June 11, 20269 min

Agent identity is not a service account: the four properties production agents need

Most teams give an agent a service account and call it identity. A service account answers what the agent may do. It does not answer who acted, on whose behalf, or under what authority. The four-property framework for agent identity that holds up under audit.

June 9, 202612 min

The MCP attack surface gets measured, the agent deployment gap splits in two, and buyers move to model portfolios

Three signals from this week. New scans put hard numbers on the MCP exposure problem for the first time. Fresh deployment data shows coding agents shipping while horizontal agents stall. And the model conversation shifts from one best model to a portfolio matched to the job.

June 6, 20267 min

The agent that charged 312 customers twice, and the one missing field that caused it

A billing agent retried a payment it had already completed and charged 312 customers a second time. Every individual step was correct. The defect was a missing idempotency key. The trace, the mechanism, and the two controls that prevent it.

June 4, 20269 min

Only 11% of enterprise agent pilots reach production. The Five Eyes publish their agentic security guidance. A2A becomes a Linux Foundation standard.

Three signals from this week: a new dataset quantifies the pilot-to-production gap for the first time. Five intelligence agencies release joint guidance on agentic AI in critical infrastructure. And the agent communication protocol that every major vendor now implements has a permanent home.

May 30, 20266 min

The approval queue pattern: putting a human in the loop without putting them in the way

A human in the loop is not the same as a human in the way. The pattern for routing only the decisions that need a human, the four fields every approval item must carry, and the failure modes that turn a queue into a rubber stamp.

June 2, 202610 min

The trust boundary pattern — the architectural decision you are already making, whether you know it or not

Every production agent system has a trust boundary. Most teams never defined theirs explicitly. Here is the framework for deciding where it sits, the four most common placements, and the failure modes that emerge when the boundary drifts.

May 26, 202611 min

The multi-agent system that reached consensus on the wrong answer — and no single agent was wrong

A four-agent pipeline where every individual agent behaved correctly, but the system produced a confident, completely false output. The failure mode is structural, not model-level. Here is the trace, the mechanism, and the three controls that prevent it.

May 21, 20269 min

Two 'Managed Agents,' same name, opposite trust models — and coding cost just dropped by 10x

Google and Anthropic shipped identical product category names on the same day with opposite infrastructure postures. Cursor Composer 2.5 landed at one-tenth the frontier price. The Big Four accounting firms completed their AI stack alignment. Three signals from the most consequential week in agent engineering so far.

May 23, 20267 min

Pass@1 lies. The reliability framework production agents actually need

Capability and reliability diverge as task duration grows. Benchmarks measure capability. Customers experience reliability. Here is the framework, the math, and the four metrics to ship instead.

May 17, 202613 min

MCP gets its first CVSS 9.8, NIST formalizes agent identity, and Princeton publishes the paper that ends the pass@1 era

Three signals from this week. The protocol everyone bet on has its first real security incident. The standards body finally moves from model governance to agent identity. And an academic group puts a name on what every production team has been measuring quietly.

May 16, 20266 min

Context engineering at the right altitude, the discipline that replaces prompt engineering in production

Prompt engineering optimizes a single question. Context engineering decides what the agent can see, when, and at what resolution. Here is the framework for choosing altitude, the four most common failure modes, and the file structure that holds up under audit.

May 12, 202611 min

The agent that burned $4,200 in 63 hours, and the four words in the system prompt that caused it

A real postmortem from a framework user who left for a wedding on a Friday. The agent looped from Friday night until Monday morning. The fix is two lines of code. The lesson is older than agents.

May 14, 20269 min

OpenAI ships sandboxing as a primitive, 90% of agents fail security audits, and the OWASP Agentic Top 10 lands

Three signals worth your attention this week: model-native agent harnesses are arriving, an academic consortium quantifies the security gap, and a new standard for agent threats is now public.

May 9, 20266 min

The kill switch pattern, and why your CIO needs to know where it is at 3 AM

A production agent without a kill switch is not a production agent. The architecture pattern, the team protocol, and the failure modes a kill switch must handle.

May 5, 202611 min

The agent that deleted 1,206 production records, then generated 4,000 fake ones to cover it up

A real postmortem from a vibe-coding session that went wrong. Why "freeze the code" is not a permission boundary, and the three controls that would have prevented this entirely.

May 7, 20269 min

The orchestration ceiling is real, CARE formalizes agent engineering, and MCP just became non-negotiable infrastructure

A G2 report reveals orchestration as the top scaling bottleneck. NASA researchers publish a methodology that mirrors our architecture. And MCP crosses 10,000 servers, here's what that means for your tool contracts.

May 2, 20266 min

Permission boundaries, why your agent should never have more access than it needs

Least privilege is not a security buzzword. It's the difference between an agent that fails safely and one that deletes your production database at 3 AM.

May 1, 20269 min

Context quality is the new bottleneck, Stanford says agents hit 66%, and rate limits are crashing more systems than bad prompts

Datadog's State of AI Engineering report dropped hard data. Stanford measured real agent performance. And the most common production error isn't what you think.

April 25, 20265 min

What happened in agent engineering this week

Anthropic's tool-use update, a quiet retrieval paper worth your weekend, and three production postmortems from the field.

April 18, 20265 min

My Audience Builder followed 200 bots in one week, and how a three-line filter fixed it

The ICP scoring looked perfect on paper. But it scored engagement, not authenticity. Here's how fake profiles gamed the scoring model and what I changed.

April 23, 20267 min

The seven tests your agent must pass before it touches production

Not seven categories. Seven specific, concrete test scenarios, with expected outcomes defined before you run them. No exceptions.

April 21, 202610 min

Why your agent needs a memory policy, not just a memory

Most agents remember everything or nothing. Both are wrong. Here's the framework for deciding what to store, what to forget, and how long information stays relevant.

April 19, 202612 min

Why my agent crashed on a 429, and the retry pattern that fixed it

A naive exponential backoff was hiding a deeper concurrency leak. Here's the trace, the fix, and the regression test that now catches it.

April 15, 20268 min

Tool contracts: the single file that prevents 80% of production bugs

One typed schema, one error taxonomy, one idempotency convention. The discipline that turns demo agents into shippable systems.

April 12, 202612 min