Thought Leadership

Video: Azure, AMD, and Neural Magic Raise the Bar for High-Performance Computing

Mar 22, 2022

Author(s)

Sasa Zelenovic

Head of Developer Marketing, Neural Magic

Microsoft, AMD, and Neural Magic are raising the bar for high-performance computing. With a combination of HBv3 virtual machines and our sparsity-aware inference engine, we are able to run deep learning workloads on CPUs at speeds previously reserved only for GPUs.

For example, together we deliver 5x inference speedup for BERT NLP models over other conventional approaches. More details here.

Hear more what Azure Chief Technology Officer, Mark Russinovich, has to say about the powerful combination of Microsoft, AMD, and Neural Magic (minute 1:50).

Was this article helpful?

YesNo

Author(s)

Sasa Zelenovic

Head of Developer Marketing, Neural Magic

Stay Up to Date

Join the Conversation

Card Image

Open Source

Jul 15, 2024

vLLM Brings FP8 Inference to the Open-Source Community

Card Image

Thought Leadership

Jun 18, 2024

Deploy Llama 3 8B with vLLM

Card Image

Research

May 09, 2024

Unlock Accurate, Affordable, and Sustainable LLMs by Removing Billions of Parameters

Subscribe to Neural Magic events & news

Company Policies

© 2024 Neuralmagic, Inc.

Neuralmagic, Inc. 55 Davis Sq STE 3 Somerville, MA 02144 United States