How to Get Faster MobileNetV2 Performance on CPUs

07/23/20
TL;DR: Learn more about use cases for lightweight MobileNetV2 models, and how Neural Magic’s Inference Engine exploits its architecture to run them even faster on commodity CPUs. Read Time: 4 minutes, 32 seconds Ever wonder what’s the machine learning model that powers “Portrait Mode” on your iPhone or the ability to swap out backgrounds on… Read More How to Get Faster MobileNetV2 Performance on CPUs

What deep learning models are in the Neural Magic Model Repo?

06/18/20
Last updated: November 17, 2020 The Neural Magic Model Repo includes pre-trained, performance-optimized models ready to use for your machine learning projects The Model Repo features models sparsified with the latest pruning techniques to deliver exceptional performance on CPUs, and accelerates the process of deploying those models in production. Currently, teams can choose from a… Read More What deep learning models are in the Neural Magic Model Repo?

Neural Magic Launches High-Performance Inference Engine and Tool Suite for CPUs

06/18/20
Run computer vision models at lower cost with a suite of new tools that simplify model performance. Today, Neural Magic is announcing the release of its Inference Engine software, the NM Model Repo, and our ML Tooling. Now, data science teams can run computer vision models in production on commodity CPUs – at a fraction… Read More Neural Magic Launches High-Performance Inference Engine and Tool Suite for CPUs

How to Run ResNet at a Fraction of the Cost

06/04/20
With greater speeds and accuracy. Does your data science team use ResNet? Neural Magic found a novel way to run ResNet models on commodity CPUs with GPU-class performance, at a fraction of the cost. By making ResNet models achieve best-in-class performance on everyday CPUs, teams can experience drastic cost savings. In this blog post, we’ll… Read More How to Run ResNet at a Fraction of the Cost

Neural Magic, Cisco, and Intel Collaborate to Accelerate Deep Learning Performance

05/29/20
Read Time: 2 min We are excited to announce a joint collaboration between Neural Magic, Cisco, and Intel to accelerate deep learning performance. Today, Enterprises struggle with the process of getting trained machine learning models into production in support of their mission critical business applications and subsequent inference needs. Too often, sacrifices and trade-offs are… Read More Neural Magic, Cisco, and Intel Collaborate to Accelerate Deep Learning Performance

Who is Neural Magic? How does it work?

05/22/20
Neural Magic is expanding what’s possible with machine learning. It levels the machine learning playing field by turning everyday CPUs into high performance machine learning compute resources. With Neural Magic, you can now achieve machine learning performance breakthroughs, at scale, with all the flexibility and cost efficiency of software. How is this possible? Let us… Read More Who is Neural Magic? How does it work?

The combination of the right software and commodity hardware will prove capable of handling most machine learning tasks

05/14/20
Earlier this year, Nir Shavit, professor of EECS at MIT and CEO of Neural Magic, joined Ben Lorica for an open discussion on the Data Exchange Podcast. The conversation spanned multicore software, neurobiology and deep learning. The full episode can be downloaded from iTunes, Android, Spotify, Stitcher, Google, and RSS. The full transcript follows, lightly… Read More The combination of the right software and commodity hardware will prove capable of handling most machine learning tasks

Companies Lack Resources to Get Deep Learning Models into Production [Survey]

04/30/20
How many deep learning models do companies typically have in production? A lot fewer than you’d think. 84% of companies had five or fewer models in production. For many teams, this process is simply too hard or too costly. We recently surveyed more than 290 machine learning engineers and data scientists to find out how… Read More Companies Lack Resources to Get Deep Learning Models into Production [Survey]

The Challenges of EfficientNets And the Way Forward

04/27/20
If you work in the world of deep learning, odds are you know all about EfficientNets, a family of models developed by Google researchers which achieve better accuracy with much smaller models than previous convolutional neural networks (CNNs). They are, more specifically, an image classification architecture that have set new records for accuracy on the… Read More The Challenges of EfficientNets And the Way Forward

The Software GPU, Pruning for Success & More at ODSC East

04/01/20
Neural Magic to Speak at and Sponsor ODSC EAST 2020 Neural Magic is excited to be participating in the Open Data Science Conference East, also known as ODSC EAST, this April. The conference, to be held virtually (previously in Boston) April 14th-17th, will feature people and companies who are working on the cutting edge of… Read More The Software GPU, Pruning for Success & More at ODSC East