The Hardest Part of Managing AI Agents Isn't the Tech. It's You.

In January 2026, BNY Mellon announced something worth understanding carefully.

They didn't deploy 20,000 AI agents. They trained 20,000 employees to build AI agents... and those employees now run 130+ autonomous "Digital Employees" across a 52,000-person workforce.

The platform is called Eliza. The results are real: legal contract review dropped from 4 hours to 1 hour. Financial planning prep cut by 60%. Three thousand vendor agreements processed annually without a human hand on each one.

A hybrid team meeting, some seats filled by human colleagues, others by glowing holographic AI presences

So here's the question the press release didn't ask: who leads the agents?

The Tech Isn't Your Problem

Most leaders who read about BNY Mellon's deployment walk away thinking about platforms and procurement. Wrong takeaway.

The challenge isn't getting AI agents into your organisation. It's what happens to your leadership when they're there.

When half your delivery runs autonomously... when agents make decisions faster than you read your inbox... when a junior analyst's work passes through a digital employee before it reaches you... your job changes.

Not incrementally. Fundamentally.

I've watched technology shifts reshape management for three decades. Email changed how teams communicated. The cloud changed how we deployed. Agile changed how we structured work. Every one of those shifts created leaders who adapted and leaders who waited too long. The agentic shift is faster and deeper than any of those, and most organisations are treating it like a procurement decision.

It isn't. It's a leadership problem.

The Jagged Frontier

Deloitte's 2026 research on agentic AI is worth reading. Only 14% of organisations have deployment-ready solutions, and 11% are actively running them in production. Gartner estimates over 40% of agentic AI projects get canceled by 2027.

Sit with those numbers. 40% canceled.

Not because the AI doesn't work. Because organisations aren't ready to integrate it into how they operate. The gap isn't in the tech stack. It's in the leadership model.

Deloitte points to a key competency most managers lack: understanding the "jagged frontier." The ability to know where AI genuinely outperforms humans and where humans remain essential.

The frontier is jagged because it's uneven. An AI agent reviewing 3,000 vendor contracts for standard compliance clauses sits on the right side of it. An agent deciding which employee to put on a performance improvement plan does not. An agent summarising regulatory changes overnight so your team starts the day informed sits on the right side. An agent setting strategy in ambiguous conditions does not.

Your job as a leader is to place work on the right side of this frontier. Not to chase the newest tool. Not to automate everything in sight. To know which judgment calls belong to a person and which don't.

A leadership skill. Not a technical one.

A human hand and a glowing digital hand reaching toward the same document on a desk, showing human-AI collaboration

The Warning From Reddit

Reddit's r/LangChain and r/artificial were buzzing last month with what happens when AI agents hit production with weak guardrails. Hallucinated citations at 0.95 confidence. Cyclic debugging loops in agent graphs. Lawyers correcting AI output by hand.

This is what happens when you prioritise deployment speed over oversight.

BNY Mellon solved this differently. They paired mass democratisation with serious governance. The Eliza platform has 125 live use cases in production. Each passed risk and compliance review. The "Empowered Builders" aren't running free... they're building within a framework.

Someone in leadership created this framework. Someone decided speed mattered AND guardrails mattered. Someone decided what agents were permitted to decide alone and what required a human at the handoff point.

Not a software setting. A leadership decision.

Who's Responsible When the Agent Gets It Wrong?

Here's the question worth sitting with.

An agent reviews a legal contract and misses a liability clause. An agent approves a vendor payment failing checks it wasn't trained to run. An agent summarises performance data incorrectly, so a manager gives feedback based on fiction.

Who owns it?

In the old model, organisations attached accountability to a person: a manager, an analyst, an executive. In an agentic world, the lines blur fast.

The answer BNY Mellon is arriving at... and I think it's right... is that a human always owns the outcome. The agent is a tool, a fast one, an autonomous one. But the leader is responsible for how it's configured, what it's pointed at, and what oversight exists.

Deloitte calls these people "agent supervisors." Not rubber-stampers of agent output. People positioned at strategic handoff points, who understand what the agent does well enough to know when to step in.

You need those people. Most organisations don't have them yet.

A manager reviewing an AI agent status dashboard, several agents active, one flagged for human review

The Skills Becoming More Valuable, Not Less

Here's the counterintuitive part. The rise of AI agents doesn't make leadership skills less important. It makes certain leadership skills far more important than before.

When agents handle the repeatable work, humans are left doing what agents genuinely struggle with: ethical judgment, relationship repair, creative ambiguity, political navigation, and understanding what a number means in context. These aren't soft skills. They're the hardest skills in any organisation.

The leaders who thrive in the next five years will be the ones who invested in developing those capabilities before they needed them. Understanding people well enough to know which humans to place at which handoff points. Building trust deep enough so your team tells you when the agent got it wrong. Having the judgment to know when the cost of speed is too high.

I've written about this dynamic on Step It Up HR: the leaders who get ahead of capability shifts don't do it by learning the technology. They do it by getting better at the human parts the technology cannot replace.

What to Do Before This Lands on Your Doorstep

BNY Mellon has 52,000 people and built a dedicated platform for this transformation. Most organisations reading this are smaller, and don't need to match that scale to start thinking seriously.

Three things worth doing now:

Map your jagged frontier. Look at what your team produces each week. What would an AI agent handle faster and with fewer errors? What requires human judgment no model should replace? Write it down. The discipline of thinking through it is valuable, even before deploying anything.

Build the oversight reflex. Your team needs the habit of asking "is this agent output trustworthy?" rather than treating AI responses like database queries always correct. Train the scepticism before you need it under pressure.

Know who your agent supervisors are. Not the most technical people. The people with enough judgment, enough domain knowledge, and enough professional backbone to say "this doesn't look right" and stop the process. If you cannot name those people now, you have work to do.

The Real Story

BNY Mellon's headline number is 20,000 empowered builders. The real story is the leadership culture making it possible.

Someone trusted 20,000 non-engineers to do serious technical work. Someone built a governance model allowing autonomous agents to make real decisions without going off the rails. Someone decided "AI for everyone, everywhere" was a leadership imperative, not an IT project.

The companies getting through the next five years well won't be the ones with the most sophisticated agents. They'll be the ones whose leaders understood the difference between deploying AI and leading through it.

Are you one of them?