Technology

The compute rethink: Scaling AI where data lives, at the edge

November 7, 2025

Presented by Arm

Artificial intelligence is rapidly transitioning from centralized cloud environments to the very edge where data originates-embedded within devices, sensors, and network nodes. This migration toward edge AI is propelled by critical factors such as reducing latency, enhancing privacy, and lowering operational costs, all of which are top priorities for organizations intensifying their AI investments.

Chris Bergey, Senior Vice President and General Manager of Arm’s Client Business, emphasizes the strategic advantage for business leaders: prioritize AI-centric platforms that complement cloud infrastructure, enable instantaneous decision-making, and safeguard sensitive information.

“With the surge in connected devices and the expansion of the Internet of Things (IoT), edge AI unlocks unprecedented opportunities for companies to outpace competitors through accelerated and efficient AI processing,” Bergey notes. “Early adopters don’t just gain operational efficiency-they transform customer expectations. AI is evolving into a key differentiator in trust, agility, and innovation. Embedding AI deeply into business processes accelerates the compounding of these benefits.”

Transforming Operations by Processing Data Locally

Organizations are realizing that deploying AI at the edge is more than a performance enhancement-it represents a fundamental shift in operational strategy. By analyzing data on-site, businesses reduce reliance on cloud connectivity, enabling faster, more secure, and context-aware decisions.

Consider a manufacturing plant that instantly monitors machinery health to preempt failures, or a healthcare facility that runs diagnostic AI models directly on-premises to protect patient confidentiality. Similarly, retail outlets utilize in-store AI-powered video analytics to optimize customer experiences, while logistics firms leverage edge AI to streamline fleet management in real time.

By minimizing the need to transmit large datasets to centralized servers, companies can act on insights immediately where data is generated. This approach fosters a responsive, privacy-conscious, and cost-efficient AI ecosystem.

Meeting Consumer Demands for Speed and Privacy

Arm’s collaboration with Alibaba’s Taobao platform-the largest e-commerce marketplace in China-demonstrates the power of on-device AI. By enabling product recommendations to update instantly without cloud dependency, shoppers enjoy faster, more personalized experiences while their browsing data remains private.

In consumer electronics, Meta’s Ray-Ban smart glasses exemplify a hybrid AI model: quick user commands are processed locally for immediate feedback, whereas complex functions like language translation and image recognition are handled in the cloud.

“Every technological revolution reshapes how businesses engage customers and generate revenue,” Bergey explains. “As AI capabilities and user expectations evolve, shifting intelligence closer to the edge becomes essential to deliver the immediacy and trust consumers now demand.”

This paradigm is also reflected in everyday tools such as Microsoft Copilot and Google Gemini, which blend cloud and edge AI to provide generative AI experiences that are faster, more secure, and contextually aware. Across sectors, the principle remains: the more intelligence that can be safely and efficiently deployed at the edge, the more responsive, private, and valuable the operation.

Scaling Intelligence with Sustainable Infrastructure

The proliferation of AI at the edge necessitates not only advanced processors but also intelligent infrastructure design. Matching computational resources to specific workload requirements allows enterprises to optimize energy use without sacrificing performance-a critical factor as sustainability becomes a competitive edge.

“Demand for compute power, whether cloud-based or on-premises, is skyrocketing,” Bergey states. “The challenge lies in extracting maximum value from that compute. This is achievable only by investing in scalable platforms and software aligned with your AI goals. Ultimately, success is measured by the value created for the enterprise, not just efficiency metrics.”

Foundations for Next-Generation AI Performance

The fast-paced development of AI models-especially those supporting edge inference, multimodal applications, and ultra-low latency-requires a robust hardware foundation that balances high performance with energy efficiency. Traditional architectures designed for legacy workloads no longer suffice in this diverse and distributed landscape.

Modern CPUs are evolving into the core of heterogeneous systems that deliver sophisticated on-device AI capabilities. Their versatility, power efficiency, and extensive software ecosystem enable them to handle everything from conventional machine learning to advanced generative AI tasks. When combined with specialized accelerators like Neural Processing Units (NPUs) or Graphics Processing Units (GPUs), CPUs orchestrate workload distribution intelligently, ensuring optimal performance and energy use.

Arm’s innovations, such as the Scalable Matrix Extension 2 (SME2) for Armv9 CPUs, provide cutting-edge matrix computation acceleration. Additionally, Arm KleidiAI, an intelligent software layer integrated with leading AI frameworks, automatically enhances performance across a broad spectrum of AI applications-from natural language processing to speech and vision-on Arm-based edge devices, all without requiring developers to modify their codebases.

“These advancements empower AI frameworks to fully leverage Arm-based hardware capabilities effortlessly,” Bergey explains. “This approach makes AI scalable and sustainable by embedding intelligence directly into the compute foundation, enabling innovation to progress at the pace of software development rather than hardware cycles.”

Democratizing access to powerful compute resources will drive the next generation of intelligent, real-time experiences enterprise-wide-not just in flagship products but across entire device ecosystems.

The Future Trajectory of Edge AI

As AI transitions from experimental pilots to widespread adoption, success will favor organizations that seamlessly integrate intelligence throughout their infrastructure layers. Autonomous AI systems will rely on this interconnectedness to perform reasoning, coordination, and instant value delivery.

“History shows that during disruptive waves, slow-moving incumbents risk being outpaced by agile newcomers,” Bergey warns. “The leaders of tomorrow are those who embrace an AI-first mindset daily. Just as the internet and cloud computing reshaped industries, those who fully commit to AI will define the coming decade.”