The explosive growth of generative AI has led to a boom in demand for AI wares from both on-premises and public cloud providers, with many expecting this trend to continue for at least the next five years. However, beneath the surface, a different story is unfolding. Chief Information Officers (CIOs) and Chief Financial Officers (CFOs) are increasingly vocal about unexpectedly high cloud expenses, which are approximately 2.5 times more expensive than anticipated. As cloud AI adoption continues to rise, concerns about larger, unpredictable cloud bills in the future are growing.
The high costs of AI systems are a major contributing factor to these concerns. Training a single advanced AI model can cost tens of millions of dollars, with ongoing costs for fine-tuning, retraining, and inferencing. While public cloud providers have the infrastructure to handle these tasks, the costs are becoming unsustainable for many enterprises. Alastair Edwards, chief analyst at Canalys, highlights the dilemma organizations are facing: as they move past the experimental and training phases of AI adoption into production-scale inferencing, the financial costs start to outweigh the benefits.
Cloud computing typically offers predictable economic benefits, including pay-as-you-go pricing and on-demand elasticity. However, as AI use cases grow and scale across an organization, these predictable economics quickly lose their luster when companies face around-the-clock usage of hundreds or thousands of GPUs or other resources needed for AI. The growing disparity between costs and benefits is becoming increasingly apparent.
Further compounding the issue, energy costs are rising worldwide at the same time AI systems demand ever-increasing power for training, cooling, and deployment. A report from IDC suggests that corporate spending on compute and storage hardware for AI deployments grew by 37% in the first half of 2024. While public clouds are still capturing the lion's share of early-stage AI investments, a growing portion of that spending is being redirected outside of public cloud providers as enterprises transition to deploying AI at scale.
In response to these challenges, a new ecosystem of AI infrastructure providers has emerged to fill the growing gaps in cost competitiveness that public clouds are leaving behind. Colocation services, GPU-as-a-service specialists, and hybrid cloud providers offer enterprises an attractive middle ground, allowing them to maintain better control over their AI workloads while sidestepping the runaway expenses of running these systems exclusively on public clouds.
Companies like CoreWeave and Foundry are leading the charge in the GPU-as-a-service market, offering heavy investments in GPU capacity and pay-as-you-go models that rival those of hyperscalers. Even legacy players like Rackspace are getting in on the action by launching their own GPU-as-a-service offerings, while colocation providers are also seeing renewed interest. These alternatives are often built from the ground up to handle the unique demands of modern AI infrastructure, featuring high-density GPU configurations, liquid cooling systems, and energy-efficient designs.
Public cloud providers, who have positioned themselves as the natural home for building and deploying AI workloads, are at risk of pricing themselves out of their market. By relying heavily on consumption-based pricing models, they are failing to address the concerns of their customers. As companies shift from experimentation to production, long-term, GPU-heavy AI workloads don't translate into cost efficiencies. If public cloud vendors don't adjust their business models to address these concerns, they risk being sidelined by players more attuned to AI's unique demands and economics at scale.
The irony is that cloud providers, who helped create today's AI gold rush, are in danger of losing the market they helped create. The very users they worked so hard to attract are finding that colocation services, GPU-as-a-service providers, and other hybrid infrastructure models offer a more sustainable balance between cost, control, and flexibility. It remains to be seen whether public cloud vendors will adapt to these changing market dynamics and customer needs, or continue down a path that may ultimately lead to their downfall.