Neural Magic January 2021 Product Release

Jan 08, 2021

Author(s)

Jeannie Finks

Head of Customer Success, Neural Magic

We are excited to announce the Neural Magic January 2021 product release. This milestone contains new product features, an improved user experience, and stability enhancements that will simplify the ability for our clients to achieve GPU-class performance on commodity CPUs.

NEW - Introducing Sparsify BETA

Experience driven tooling to simplify the process of analyzing and optimizing deep learning models for performance - without sacrificing accuracy for business outcomes - through an interactive and GUI based design. Users have the ability to leverage industry leading techniques in the field of model compression, pruning, and transfer learning codified in simple easy-to-use recipes that can be used in tandem with the SparseZoo or user private models.

SparseZoo

Simplify time to value and reduce skill burden to build performant deep learning models by having a collection of pre-trained, performance-optimized deep learning models to prototype from. The repository consists of popular image classification and object detection models and is constantly growing.

Performant model additions:

YOLOv3 (COCO)
ResNet-50-SSD-300 (VOC, COCO)
MobileNetv2-SSDLite (VOC, COCO)

NM Inference Engine

Enables clients to run mission critical deep learning models on commodity CPUs to reduce cost per inferences and generate price-performant deployments. This feature set includes the inference engine, ONNX conversion tooling, model server if needed, and is focused on model deployment and scaling machine learning pipelines.

Quantized (int8) AVX-512 convolution support for ResNet-50 VNNI
Quantized (int8) support for depthwise convolutions
Benchmarking API enhancements for ease of use

SparseML

Enables data scientists to optimize their model for performance without having to sacrifice accuracy required for business outcomes. This feature set includes model pruning APIs and CLIs as well as transfer learning APIs and CLIs, simplifying the process of achieving performance on deep learning models with Neural Magic.

Support for PyTorch 1.7
Quantized Aware Training and ONNX model export in PyTorch
Keras exporter for ONNX
Object Detection end-to-end install and benchmark notebooks added

As of February 2021, our products have been renamed and versions re-numbered; most have been open-sourced and their release notes can be found in GitHub!