AI must get smaller to go big in enterprise

AI energy efficiency

From virtual assistants and copilots to predictive analytics and intelligent automation, AI systems have become embedded into how organisations operate and compete. However, as adoption accelerates, a significant obstacle is forming: energy consumption.

Data centres already rank amongst the largest electricity users, and AI is intensifying that demand. According to the International Energy Agency, energy use tied to AI workloads is forecast to rise by around 15% annually, outstripping growth in most other industries. By 2030, electricity demand from data centers worldwide is expected to double.

Training and deploying large language models (LLMs) requires vast computational power, and every increase in model complexity brings a corresponding rise in electricity usage. This trend prompts a fundamental question about AI’s trajectory: can innovation keep scaling if it depends on continuously expanding power requirements?

Power constraints can restrict AI’s growth

For years, AI’s progress has been fuelled by scale. Expanding model sizes, increased parameters and ever-larger datasets have produced notable leaps in capability. However, the financial and resource costs of achieving those improvements have escalated too.

Energy pricing, grid capacity and data centre availability have shifted from operational details to strategic considerations. In some regions, limited access to reliable power is already influencing where AI infrastructure can be constructed and which organisations can sustain large-scale deployments.

This creates a complex balancing act. Businesses see AI as a driver of productivity and differentiation, yet the ongoing cost of running advanced models can be significant. Policymakers face a broader dilemma – how to encourage AI-led economic expansion while safeguarding energy security and environmental commitments. If AI systems continue to grow without regard for efficiency, rising energy demand could become a bottleneck at a critical moment of growth.

Cost-efficiency will democratise AI

Efforts to democratise AI often emphasise open access to tools and models, yet cost remains a decisive but overlooked factor. If only the largest and most resource-rich organisations can afford to operate advanced AI, its advantages will remain unevenly distributed.

In reality, most organisations do not require cutting-edge, ultra-large models. They need solutions that are dependable, scalable and economically sustainable. This applies across all sectors, from government departments and manufacturers to mid-sized enterprises and emerging companies.

AI holds immense potential to transform industries, enhance productivity and address complex challenges. Ensuring that its growth remains both inclusive and sustainable will shape how broadly those gains are realised.

Efforts to democratise AI often emphasise open access to tools and models, yet cost remains a decisive but overlooked factor. If only the largest and most resource-rich organisations can afford to operate advanced AI, its advantages will remain unevenly distributed.

In reality, most organisations do not require cutting-edge, ultra-large models. They need solutions that are dependable, scalable and economically sustainable. This applies across all sectors, from government departments and manufacturers to mid-sized enterprises and emerging companies.

Delivering strong results from smaller models

There is a persistent belief that downsizing AI models inevitably weakens their capabilities, but recent developments in optimisation techniques are now challenging that assumption. Approaches such as model compression, pruning and optimisation make it possible to significantly reduce the size of LLMs while preserving their effectiveness in practical applications.

This enables organisations to implement AI in environments where large-scale systems would be cost-prohibitive or technically impractical, without compromising performance. The scale of improvement can be striking. In some cases, compressed models are up to 95% smaller, requiring dramatically less memory and compute power.

These efficiencies translate into reduced energy use and faster response times, while maintaining expected accuracy levels. This evolution signals a shift in priorities, away from sheer expansion and toward thoughtful optimisation. Performance is no longer measured purely by size, but by how efficiently intelligence can be delivered in real-world contexts.

Securing sustainability while remaining competitive

As AI becomes integral to digital infrastructure, scrutiny of its environmental impact will intensify. Organisations are under increasing pressure to meet ESG targets, and stakeholders are paying closer attention to the sustainability of digital operations. At the same time, governments must integrate AI growth into long-term energy and climate planning.

Energy-conscious AI strategies align with these objectives, as reduced electricity consumption decreases carbon emissions, eases stress on power grids and drops the cost of deployment. This approach also enhances resilience by reducing dependence on resources. Focusing on efficiency does not hinder innovation, but instead removes one of the major obstacles to sustained expansion, creating a more stable foundation for advancement.

Ensuring the future of AI

The future of AI will be defined less by model size and more by practical, scalable implementation. The next wave of progress depends on solutions that combine capability with efficiency and sustainability.

Achieving this will require a coordinated effort across the ecosystem – from researchers designing leaner architectures to organisations reassessing deployment strategies and infrastructure choices. It also demands a broader view of innovation, one that recognises efficiency as a key performance metric alongside accuracy and scale.

AI holds immense potential to transform industries, enhance productivity and address complex challenges. Ensuring that its growth remains both inclusive and sustainable will shape how broadly those gains are realised.

Confronting the energy demand of AI is central to that mission. Managed effectively, it can usher in a future where intelligent systems are not constrained by escalating power consumption, but strengthened by smarter, more efficient engineering.

Enrique Lizaso, Co-Founder & CEO of Multiverse Computing

Enrique Lizaso

Enrique Lizaso, Co-Founder & CEO of Multiverse Computing, where he leads the development of advanced AI and quantum-inspired optimisation technologies. His work focuses on making machine learning more efficient, scalable and energy conscious, while applying quantum and quantum-inspired solutions to financial institutions, banks, investment funds and tax agencies.

Author

Scroll to Top

SUBSCRIBE

SUBSCRIBE