Events
Connect with the community, attend workshops, and join our weekly office hours. Check out our events on Luma
Upcoming Events
AMD Dev Day 2026
Building Next-Gen AI Infrastructure: Scaling Enterprise LLM Serving and Training with RadixArk
AMD Dev Day Meetup with dstack and Crusoe
dstack and Crusoe are hosting an AI infra & open-source models meetup after AMD Dev Day, bringing together GPU experts, AI researchers, and AI infra experts
Core-Dev Meeting
This weekly meeting brings together core developers to review major features, urgent issues, and the project roadmap. All contributors are welcome to participate and submit proposals.
Dynamo After Hours
Join us for a high-signal night with the NVIDIA developer community—built for developers pushing the limits of AI systems, inference, and scale. We will be speaking about Agentic inference with SGLang
SGLang Workshop @ GOSIM Paris 2026
We're running live labs so developers, builders, and researchers in Paris can get hands-on with LLM and diffusion inference on SGLang, plus a deep dive into end-to-end RL training with the Miles project
SGLang Office Hours
We invite you to join the core team for our bi-weekly office hours.
MLSys 2026 Happy Hour
CVPR 2026 Happy Hour
SGLang & LMSYS Summit 2026
With SGLang now powering inference across major labs and hyperscalers, we're bringing together 1,500+ top-tier practitioners to shape the future of open-source AI infra. We’re opening with speakers like Lip-Bu Tan (Intel CEO), Logan Kilpatrick (Google DeepMind), Bill Jia (Google), and more!
Past Events
SGLang Office Hours on Gemma 4
Office HoursLet's Talk Gemma 4: Architecture & How to Run It on SGLang. Khoa Pham, SGLang developer, broke down Gemma 4's architecture and how to run it with SGLang. We walked through how the model works, cover what it takes to support it in SGLang, share deployment tips across GB200, GB300, and AMD.
HumanX 2026 After Hours with SGLang
MeetupWith SGLang, Qwen, MuleRun, Novita AI, Modal, and Doubleword covering everything from frontier models to inference, agents, and production infra.
SGLang Training Lab
WorkshopThis training lab demonstrated how to optimize and scale LLM workflows with SGLang. Walked through practical performance tuning using the SGL-Cookbook, talked about profiling and bottleneck analysis, and demonstrated deep integration with Miles RL framework.
SGLang x Alibaba Cloud x NVIDIA x Qwen
MeetupFeaturing core members from SGLang, Qwen, Alibaba Cloud's Tair KVCache team, and partners like NVIDIA and Mooncake, covering topics from SGLang's roadmap to low-latency LLM inference optimization and KVCache storage breakthroughs.
SGLang Office Hours
Office HoursOH on SGLang-Diffusion.CloudRipple from OpenMOSS shared how they brought MoVA model support to SGLang-Diffusion. Xingyu from Skywork AI walked through the inference optimizations powering production-ready diffusion serving.
SGLang x OpenAnolis Meetup
MeetupSGLang x OpenAnolis Meetup in Beijing. Brought together frontline engineers to break down real, production-ready system designs.
SGLang x Modal x Qwen Meetup
MeetupMeetup with Modal, Qwen & SGLang. The evening explored how open tools deliver competitive performance and cost for LLM inference.
Dynamo Day
ConferenceOn NVIDIA AI’s Dynamo Day, our core developer, Baizhou, shared firsthand insights on large-scale AI inference with the Dynamo community.
SGLang x Ant OSS Meetup
MeetupMeetup with Ant Open Source. SGLang core member shared the latest progress and roadmap, and engaged in deep discussions on inference performance, system design, and real-world deployment.
SGLang Office Hours
Office HoursA technical deep dive into SGLang’s vision-language model capabilities, covering architecture and implementation details.
SGLang x OpenCloudOS x AMD Meetup
MeetupWith OpenCloudOS and AMD. Featured end-to-end system optimization, low-precision performance breakthroughs, and hands-on Triton kernel development.
SGLang x AtomGit Meetup
MeetupWith AtomGit. Talked about KV cache management, inference stalls during RL/agent weight updates, new demands of GLM/Mamba/MoE models and real-world performance on Ascend hardware
SGLang x Baidu Meetup
MeetupWith Baidu. Featured large-scale inference system optimization, distributed architectures, the latest SGLang ecosystem advances, plus an in-depth look at Baidu Baige’s inference acceleration for the DeepSeek V3 series.
SGLang Diffusion Workshop
WorkshopAt Tsinghua PACMAN Lab, we presented the SGLang Diffusion roadmap. We reused proven LLM infrastructure with a redesigned diffusion pipeline to cut inference latency by up to ~57% on H100/H200.
SGLang x AMD x Moreh AI Meetup
MeetupWith AMD and Moreh AI. Showcased SGLang’s core features, roadmap, and GPU optimizations, along with hands-on exploration of open-source LLM acceleration on AMD MI300X GPUs.
SGLang Diffusion Tutorial
WorkshopNew tutorial introducing SGLang Diffusion, covering high-performance image and video generation workflows with multi-GPU acceleration, CFG parallelism, and hands-on demos using CLI and Python APIs.
SGLang NeurIPS 2025 Meetup
ConferenceSGLang x Atlas Cloud NeurIPS 2025 happy hour
SGLang Workshop
WorkshopOn NV Developer Day, our core contributor introduced SGLang, covering our core features, key performance optimizations, real-world large-scale deployment lessons, and the future roadmap.
SGLang x Volcengine x NVIDIA Meetup
MeetupSGLang x Volcengine x NVIDIA meetup, discussed framework updates, stability optimization, distributed KVCache acceleration, VLM and MoE inference, and efficient deployment on DGX Spark.
EMNLP Happy Hour
ConferenceSGLang x Abaka AI co-hosted the EMNLP after-party, bringing together ML engineers, tech leads, and enthusiasts for conversations on AI infra, LLM agents, multimodality, and interpretability.
SGLang x Meituan x AWS Meetup
MeetupSGLang meetup with Meituan and AWS, brought together developers and researchers to discuss LLM inference, speculative decoding, quantization, and real-world SGLang deployments
SGLang x GOSIM Workshop
WorkshopSGLang x GOSIM Workshop, featuring deep-dive talks from core SGLang members on topics like efficient inference, PD disaggregation, and Ascend, alongside industry speakers from Qwen, NVIDIA, and Volcengine.
SGLang x AMD Meetup
MeetupSGLang & AMD Meetup in San Francisco. We had a hands-on GPU workshop and talks from AMD, xAI, and the SGLang team on roadmap, MoE scaling, and inference optimization.
