YOLOv8 Detection 10x Faster With DeepSparse—Over 500 FPS on a CPU

01/18/23
Introducing YOLOv8—the latest object detection, segmentation, and classification architecture to hit the computer vision scene! Developed by Ultralytics, the authors behind the wildly popular YOLOv3 and YOLOv5 models, YOLOv8 takes object detection to the next level with its anchor-free design. But it's not just about cutting-edge accuracy. YOLOv8 is designed for real-world deployment, with a… Read More YOLOv8 Detection 10x Faster With DeepSparse—Over 500 FPS on a CPU

Revolutionizing Biology Research With Lightning-Fast NLP: Introducing Sparse BioBERT

01/09/23
The field of biology is constantly advancing as researchers around the world work to uncover new insights into the mechanisms of life. With the vast amount of information being published on a daily basis, it can be a daunting task for biologists to stay up-to-date and extract relevant data for their research. This is where… Read More Revolutionizing Biology Research With Lightning-Fast NLP: Introducing Sparse BioBERT

Deploy YOLOv5 With Neural Magic’s DeepSparse for GPU-Class Performance on CPUs 

01/06/23
We’ve partnered with Ultralytics to optimize and simplify your YOLOv5 deployment. Want to accelerate the deployment of your YOLOv5 models? We’ve got you covered! Introducing our newest partner, Ultralytics, who makes artificial intelligence easy. While Ultralytics YOLOv5 object detection architectures and pre-trained models offer popular vision AI, Neural Magic provides software tools that emphasize peak… Read More Deploy YOLOv5 With Neural Magic’s DeepSparse for GPU-Class Performance on CPUs 

Image Segmentation: Your Ultimate Guide to Easy Deployment and Fast Inferencing

01/04/23
In the Sparse Real-time Instance Segmentation post, you saw how to perform real-time segmentation on a laptop using YOLACT (You Only Look At CoefficienTs). You learned that image segmentation is applied in areas such as detecting a fruit, picking it up, and placing it in a bin. Image segmentation models can also: Image segmentation—also referred… Read More Image Segmentation: Your Ultimate Guide to Easy Deployment and Fast Inferencing

2022 Year in Review at Neural Magic

12/30/22
Neural Magic is excited to wrap up a year of innovation that included new versions of our machine learning libraries and tools—DeepSparse, SparseML, and SparseZoo, which are designed to accelerate inference using the power of software. Here are some of Neural Magic’s greatest hits for 2022:   Innovation: We proudly launched commercial support for our… Read More 2022 Year in Review at Neural Magic

Real-time Instance Segmentation With Sparse YOLACT on a Laptop

12/28/22
Accurately detecting and segmenting objects during sorting and packaging helps improve the process quality and significantly lower inspection costs of outbound packages. Fruit detection is an example of such a task. The objective is to identify the presence of fruit, classify it, and accurately locate it.  Detailed information about the object's location can be obtained… Read More Real-time Instance Segmentation With Sparse YOLACT on a Laptop

Neural Magic 1.3 Product Release

12/23/22
As the year comes to a close and we look forward to celebrating the holidays together with our friends and families, all of us at Neural Magic would like to thank you for your continued community support. Here are highlights of the 1.3 product release of our DeepSparse, SparseML, and SparseZoo libraries. The full technical… Read More Neural Magic 1.3 Product Release

Accelerate Hugging Face Inference Endpoints with DeepSparse

12/19/22
A well known burden with today’s cloud providers is the ever-growing necessity for technical experts to handle their infrastructure. The unfortunate consequence of this challenge can ultimately lead to long deployment times impacting the iteration rate on your model’s journey to production. Hugging Face 🤗 Inference Endpoints is a new service for automating the deployment… Read More Accelerate Hugging Face Inference Endpoints with DeepSparse

Search Documents Quickly With Extractive Question Answering and Sparse Transformers

12/13/22
Companies have numerous documents such as wikis and internal documentation. These documents could be in the hundreds or thousands. Searching for information from these documents is a painful, long, and tedious process. For instance, you have to manually go through numerous documents to get an answer to a specific question. What if there was a… Read More Search Documents Quickly With Extractive Question Answering and Sparse Transformers

Deploy Serverless Machine Learning Inference on AWS Lambda with DeepSparse

12/01/22
AWS Lambda is a serverless, event-driven environment for making quick auto-scalable deployments for various applications including machine learning. The most convenient feature of serverless environments is that server management is delegated to the AWS infrastructure, allowing the developer to focus on the deployment with minimum management. In addition, Lambda only incurs a cost in the… Read More Deploy Serverless Machine Learning Inference on AWS Lambda with DeepSparse