Telefónica and Multiverse Computing Develop an AI-Based Model to Support Customer Service Agents With 75% Energy Savings

Business / Press Release November 13, 2025

Madrid, November 12, 2025 -- Telefónica and Multiverse Computing have reached an important milestone in the application of artificial intelligence in the telecommunications sector by successfully compressing and fine-tuning two large language models (LLMs), based on machine learning and natural language processing, for internal use in customer service.

These compressed models are expected to be used in the near future within the chat system that supports Customer Service agents as part of the “Movistar por ti” initiative, Movistar’s new, more agile, proactive and closer approach to customer care. The goal is to optimize response times for queries while reducing energy consumption in the systems.

The solution, based on AI model compression, delivers major improvements in speed, efficiency, energy usage and costs, all while maintaining the accuracy of the information that helps agents manage customer service more effectively.

Specifically, Multiverse Computing has applied cutting-edge quantum-inspired techniques to Meta’s Llama 3.1 8B and Llama 3.3 70B models, pre-trained language models (LLMs) that can be applied to a wide variety of intelligent assistant use cases.

The result achieved is an 80% reduction in model size, which implies a considerable decrease in storage needs, while maintaining the quality of the generated responses.

Another important aspect is the environmental dimension: in addition to being able to run in the cloud, the compressed models developed can be deployed directly on Telefónica’s network, including local (on-premise) facilities. This makes it possible to reduce energy consumption by up to 75% compared to uncompressed models.

In this way, this improvement also reinforces Telefónica and Multiverse Computing’s joint commitment to reducing the environmental impact of technology.

Furthermore, thanks to local deployment in Telefónica’s central offices, where 100% of the electricity comes from renewable sources and there is continuous work to improve efficiency, the operator has also managed to reduce the CO₂ emissions associated with the use of artificial intelligence in this case.

As for the original models that have been compressed (Llama), they are open-source models, in line with Telefónica’s goal of promoting openness, security and technological neutrality to foster standards and accelerate AI adoption.

Ultimately, Telefónica’s future application of the compressed models developed will deliver major operational efficiencies in the use of artificial intelligence, as they will make it possible to maintain the original quality of large language models (LLMs) while using much more modest hardware and lowering query costs for the models both in the cloud and on-premise. On top of this, energy consumption at Telefónica’s facilities will be reduced to a minimum.

A pioneering collaboration for scalable AI

This collaboration highlights the strategic potential of combining Telefónica’s scale with Multiverse Computing’s deep technical innovation. By deploying a compressed, high-performance and environmentally efficient AI solution, both companies reaffirm their leadership in developing more accessible and scalable AI for enterprise use.

QuantumWire

Telefónica and Multiverse Computing Develop an AI-Based Model to Support Customer Service Agents With 75% Energy Savings

Most Recent

New Levitating Sensors Could Pave Way to Dark Matter Detection and Quantum Sensing

Progress Towards a Quantum Internet

Scientists Achieve Breakthrough on Quantum Signaling

Qolab Deploys First Superconducting-Qubit Devices at the IQCC to Accelerate International Collaboration in Quantum Computing

Strangeworks and Tech Mahindra Sign MOU to Accelerate Quantum and Quantum-Inspired Solutions for Global Enterprises

Scientists Describe Exciton Formation in Thin Magnetic Crystals—With Potential for Quantum Computing or Other Advanced Technologies

A New Approach Links Quantum Physics and Gravitation

Finnish Optical Clock Sets New Accuracy Record and Brings Us Closer to a New Definition of the Second

World’s First Mobile Quantum Brain Scanner Under Development to Measure Effects of Blast Exposure

D-Wave Announces Formation of U.S. Government Business Unit

NewsFlash

Industry Pioneers

Editor's Picks

Quandela and OVHcloud Join Forces to Democratize Quantum Machine Learning With MerLin

IBM and Cisco Announce Plans to Build a Network of Large-Scale, Fault-Tolerant Quantum Computers

QuantWare, Q-CTRL, and Qblox Launch the Quantum Utility Block — the Fastest Path to Quantum Utility for Enterprise and Research Institutions

World’s Leading Scientific Supercomputing Centers Adopt NVIDIA NVQLink to Integrate Grace Blackwell Platform With Quantum Processors

Telefónica and Multiverse Computing Develop an AI-Based Model to Support Customer Service Agents With 75% Energy Savings

Topics:

Tags:

Most Recent

NewsFlash

Industry Pioneers

Editor's Picks