The State of AI Infrastructure At scale 2024

AIIA / ClearML Report – March 2024

In this report, we surveyed a 1000 more enterprises on to see how they’re adapting to the growing demands of AI in their infrastructure. 96% of companies plan to expand their AI compute capacity and investment to embrace the new possibilities of AI. We discovered:

  • Open Source AI solutions and model customization are top priorities, with 96% of companies focused on customizing primarily Open Source models.
  • Optimizing GPU utilization is a major concern for 2024-2025, with the majority of GPUs underutilized during peak times.
  • A staggering 74% of companies are dissatisfied with their current job scheduling tools and face resource allocation constraints regularly, while limited on-demand and self-serve access to GPU compute inhibits productivity.

We focused on C-suite and team leaders with job titles like CIO, CTO, Head of AI, VP of Data or VP of AI, across a range of verticals.

Get it now. FREE.

Please enable JavaScript in your browser to complete this form.

Key Charts

Biggest Impact

Estimate your current allocation of existing GPU resources (i.e.non-idle GPUs) during peak periods.

When asked about peak periods for GPU usage, 15% of respondents report that fewer than 50% of available GPUs are in use. 53% believe 51-70% of GPU resources are utilized, and 25% believe their GPU utilization reaches 85%. Only 7% of companies believe their GPU infrastructure achieves more than 85% utilization during peak periods

Resources

What is your organization’s greatest concern about deploying Generative AI?

The biggest concern for deploying Generative AI was moving too fast and missing important considerations (e.g. prioritizing the wrong use cases), whereas the second most-important concern was moving too slow due to lack of ability to execute, exposing ambiguity amidst leadership. It appears that executives are caught between the desire to move quickly and the danger of costly mistakes.

Governance also weighed in the back of respondents’ minds, with upcoming regulations and lack of control over usage and scaling as the next two most-important concerns.

Level of satisfaction

Rank your organization’s compute concerns for 2024

When asked about their organization’s compute concerns, latency was the top-ranked answer for 28% of respondents, followed by power consumption which was 21% of respondents’ top-ranked issue. Time delays in getting access to compute is also weighing on respondents’ minds; although it was top-ranked for only 14% of respondents, it received 30% of the votes as the second-ranked concern.

 

The real question is can we keep up with how fast AI is moving and can we govern it well enough to deliver a fantastic experience for users?

DevOps Lead at Fortune 500 Company

Get it Now!

And start planning for the future, today.

Download the Report

Just plug in your email and we'll immediately redirect you to the report to download now!

Report - AI Ecosystem

Your report is on the way. Check your email. Be sure to CHECK YOUR PROMOTIONS OR SPAM Folders!

Download the Report

Just plug in your email and we'll immediately redirect you to the report to download now!

Your report is on the way. Check your email. Be sure to CHECK YOUR PROMOTIONS OR SPAM Folders!