Jan 08, 2021
Author(s)
We are excited to announce the Neural Magic January 2021 product release. This milestone contains new product features, an improved user experience, and stability enhancements that will simplify the ability for our clients to achieve GPU-class performance on commodity CPUs.
NEW - Introducing Sparsify BETA
Experience driven tooling to simplify the process of analyzing and optimizing deep learning models for performance - without sacrificing accuracy for business outcomes - through an interactive and GUI based design. Users have the ability to leverage industry leading techniques in the field of model compression, pruning, and transfer learning codified in simple easy-to-use recipes that can be used in tandem with the SparseZoo or user private models.
SparseZoo
Simplify time to value and reduce skill burden to build performant deep learning models by having a collection of pre-trained, performance-optimized deep learning models to prototype from. The repository consists of popular image classification and object detection models and is constantly growing.
Performant model additions:
- YOLOv3 (COCO)
- ResNet-50-SSD-300 (VOC, COCO)
- MobileNetv2-SSDLite (VOC, COCO)
NM Inference Engine
Enables clients to run mission critical deep learning models on commodity CPUs to reduce cost per inferences and generate price-performant deployments. This feature set includes the inference engine, ONNX conversion tooling, model server if needed, and is focused on model deployment and scaling machine learning pipelines.
- Quantized (int8) AVX-512 convolution support for ResNet-50 VNNI
- Quantized (int8) support for depthwise convolutions
- Benchmarking API enhancements for ease of use
SparseML
Enables data scientists to optimize their model for performance without having to sacrifice accuracy required for business outcomes. This feature set includes model pruning APIs and CLIs as well as transfer learning APIs and CLIs, simplifying the process of achieving performance on deep learning models with Neural Magic.
- Support for PyTorch 1.7
- Quantized Aware Training and ONNX model export in PyTorch
- Keras exporter for ONNX
- Object Detection end-to-end install and benchmark notebooks added
As of February 2021, our products have been renamed and versions re-numbered; most have been open-sourced and their release notes can be found in GitHub!
- Sparsify
- SparseML (formerly Neural Magic ML Tooling)
- SparseZoo (formerly Neural Magic Model Repo)
- DeepSparse Engine (formerly Neural Magic Inference Engine)