Agents don't respect cloud boundaries. Your architecture should.

Before you deploy agents across AWS, Azure, and GCP, there's a reckoning your infrastructure isn't ready for.

By Neal Ungerleider

Apr 2, 2026 • 5 Minute Read

Please set an alt value for this image...

Subscribe to the newsletter

Agentic AI is the next big thing. You've heard it—probably more times than you'd like.

Imagine that you’re the CTO of a company working with AWS, Google Cloud, and OCI cloud infrastructure.

Your organization sees agentic potential everywhere. There are meetings, pilot projects, and research initiatives.

Then the first big hiccup happens.

Your organization, ironically enough, rolls out a cloud cost optimization agent. The agent automatically scans your AWS and Azure accounts and recommends or applies cost optimizations.

Things go wrong. Quickly.

The agent makes hundreds of unnecessary API calls and multiple follow-up queries. Working autonomously, it sends transaction spending into the stratosphere. A single request unintentionally triggers dozens or hundreds of follow-up transactions across multiple clouds.

If your organization is embracing agentic, you need to change your cloud strategy.

Table of Contents

Agentic AI and cloud infrastructure challenges
Redefining multicloud for the agentic age
Master the stack that defines the future

Agentic AI and cloud infrastructure challenges

Enterprise cloud strategies were built for applications and data pipelines—not autonomous agents. Agents operate differently, and your infrastructure needs to reflect that.

Here's what agent-ready multicloud infrastructure actually looks like.

Agents have specific cloud infrastructure needs

Human employees understand cloud boundaries—AWS for this, Azure for that, Google Cloud for the rest. Set a red line, and they generally stay behind it.

Agents are less predictable than human workers.

They need clear guardrails and risk controls in order to operate as expected.

Agents call models, retrieve data, and execute decisions across multiple environments in real time. They deal with multiple applications and data silos simultaneously, which creates a whole bunch of new, thorny cloud architecture requirements.

Most importantly, agentic inference and training have different infrastructure requirements—and most organizations are optimized for only one of them.

The assumption that one cloud handles everything is now a liability.

Agents don’t care about your cloud spend

Cloud spend gets expensive quickly. AWS, Google Cloud, Azure, and OCI bills have a nasty habit of growing and growing.

When your agents operate autonomously across assets and services on different cloud providers, the surprises come fast.

Because agentic applications behave differently from a human at the wheel, tasks can result in unexpected API calls and multiple follow-up queries. When the agent acts up, you may end up paying an extra US$10,000 in cloud transaction fees before you can resolve the issue.

Build spend guardrails and war-game failure scenarios with your team before deployment.

Agents have a sprawl problem

“Agent sprawl” is a new-ish term that describes agent growth across teams. As organizations begin building their agentic workflows, they can quickly deploy 50-200 agents, each with its own infrastructure, monitoring, security, and compute requirements.

If agents are misconfigured, that can cause runaway reasoning loops that translate into US$10,000+ cost spikes.

Agent sprawl also translates into larger cloud and on-prem storage costs. Agents writing to conversation logs, embeds, vector databases, intermediate outputs, cached prompts, tool responses, and more can result in much larger storage bills.

Redefining multicloud for the agentic age

Build agentic, multicloud strategies that support one another. If your organization is investing additional resources into agentic, make sure your multicloud architecture doesn’t introduce new complications.

Emphasize a three-tier agent architecture emphasizing task-level utility agents, workflow-level super agents, and business-level orchestrator agents.

That covers agent architecture. Now the cloud side: look at ways to split your infrastructure across different services. Lean on public cloud for training workloads and steer inference workloads closer to operational data.

Find ways to connect your organization’s private data sources with multiple cloud AI models using high-speed interconnection for flexible inference. Set up your agentic workflows so your teams have easier workloads, not harder ones.

This requires a hybrid multicloud AI architecture as a strategy.

Leverage existing hyperscaler resources. AWS offers prescriptive guidance for agentic AI, Microsoft has extensive literature for the Foundry Agent Service, and Google Cloud has detailed Agentic AI architecture guides.

Finally, work with your team to fine-tune an orchestration layer that governs which of your agents call which model on which cloud. Edit your orchestration layer with cost and efficiencies in mind—different clouds have different strengths and pricing structures. You want results, not endless QA and fine-tuning.

Multicloud governance and agentic AI

There’s another ingredient in the mix for agentic multicloud strategy: Governance.

A combination of push and pull factors means that agentic introduction causes extra cloud concerns.

For organizations operating in the European Union, the EU AI Act has many, many provisions that will keep your company lawyers busy and make agent implementation across multiple cloud platforms more difficult.

There are also state laws in the United States (California, proposed laws in Ohio) with line-item provisions around privacy and data protection, which impact multicloud strategy when agents are involved.

Governance means tracking agent behavior, enforcing data-sharing rules, and maintaining detailed audit logs.

Compliance should be built into the architecture and not added after deployment.

Building better multicloud frameworks

The organizations that get this right won't just run agents—they'll run them predictably, at scale, with full visibility.

Autonomy requires guardrails, and it’s up to leadership to build those guardrails.

Agents can move faster than humans, call more services, and generate more infrastructure activity than traditional applications—but they can also create more risks.

Organizations that treat agents like just another application feature will quickly run into problems. Instead, anchor architectures around monitoring, cost controls, and governance from day one. Multicloud infrastructure can be an advantage in this environment—especially if it’s designed intentionally.

Done properly, modifying multicloud architecture for agentic systems will keep them safe, efficient, and accountable.

Agents are powerful but not universal. Some problems are ideal for automation; others are not. Choose your use cases deliberately.

That's the actual competitive advantage.

Master the stack that defines the future

In a world of automated deployments, the goal has shifted from "managing" cloud costs to "governing" intelligent systems. This evolution is central to cloud infrastructure and critical to AI and cybersecurity.

Stay ahead of the curve with the intel you need to lead. Subscribe to Pluralsight’s weekly newsletters—Scale, Prompt, and Exploit—for weekly deep dives into cloud, AI, and cybersecurity, delivered by experts and built for the enterprise.

Neal U.

Neal Ungerleider is Pluralsight's Senior Cloud Computing Newsletter Writer. He previously wrote for Fast Company, Wired, and many other publications. Neal is based in suburban Chicago, where he drinks lots of coffee and still roots for the Knicks.

More about this author