Join Us Every Other Week For
vLLM Office Hours
As a leading contributor to vLLM, Neural Magic partners with vLLM project committers and the vLLM team at UC Berkeley to host bi-weekly office hours. Join us to give feedback, ask questions, and hear about cutting-edge developments to accelerate your inference. Typical office hours agenda:
- 20-minute vLLM update
- 20-minute special guest topic; see below for details 👇
- 20-minute open discussion, feedback loop, and Q&A
[vLLM Office Hours #25] Structured Outputs in vLLM - May 8, 2025
Structured outputs enable you to define specific constraints on the format of the output generated by an LLM. Join us to explore the current capabilities in vLLM, how it works, and what exciting enhancements are on the horizon. Plus hear what's new in vLLM v0.8.5!
Session slides: https://docs.google.com/presentation/d/1a5dHf3iRXSgbeOCa_TBaWujxq9B5EwEw/
Join our bi-weekly vLLM Office Hours to learn about the latest features and updates: https://hubs.li/Q02Y5Pbh0 ...