Thought Leadership

Video: Azure, AMD, and Neural Magic Raise the Bar for High-Performance Computing

Mar 22, 2022

Author(s)

Sasa Zelenovic

Head of Developer Marketing, Neural Magic

Microsoft, AMD, and Neural Magic are raising the bar for high-performance computing. With a combination of HBv3 virtual machines and our sparsity-aware inference engine, we are able to run deep learning workloads on CPUs at speeds previously reserved only for GPUs.

For example, together we deliver 5x inference speedup for BERT NLP models over other conventional approaches. More details here.

Hear more what Azure Chief Technology Officer, Mark Russinovich, has to say about the powerful combination of Microsoft, AMD, and Neural Magic (minute 1:50).

Was this article helpful?

YesNo

Author(s)

Sasa Zelenovic

Head of Developer Marketing, Neural Magic

Stay Up to Date

Join the Conversation

Card Image

Open Source

Mar 20, 2025

3.5X Faster Vision-Language Models with Quantization

Card Image

Open Source

Mar 14, 2025

Optimizing vLLM for DeepSeek-R1

Card Image

Open Source

Feb 27, 2025

Quantized DeepSeek-R1 Models: Deployment-Ready Reasoning Models

Subscribe to Neural Magic events & news

Company Policies

© 2024 Neuralmagic, Inc.

Neuralmagic, Inc. 55 Davis Sq STE 3 Somerville, MA 02144 United States