Events
Connect with the community, attend workshops, and join our weekly office hours. Check out our events on Luma
Upcoming Events
SGLang Office Hours
We invite you to join the core team for our bi-weekly office hours.
Core-Dev Meeting
This weekly meeting brings together core developers to review major features, urgent issues, and the project roadmap. All contributors are welcome to participate and submit proposals.
Past Events
Ant Tech Day
Invited TalksAnt tech day with Ant Open Source and Inclusion AI discussed the infrastructure behind Agentic AI. Invited to talk about Miles framework and it's role in Agentic AI.
MLSys 2026 Happy Hour Session 2
MeetupCelebrated MLSys with dinner + drinks at a restaurant near the MLSys conference venue in Bellevue!Co-hosted by RadixArk, SGLang, Essence, and Delta Institute
MLSys 2026 Happy Hour Session 1
ConferenceWe co-hosted with Ai2 a happy hour for the open AI community during MLSys 2026 in Bellevue, sponsored by Crusoe and Doubleword, bringing together folks working on open models, infra, hardware, and applications.
Dynamo After Hours
MeetupA high-signal night with the NVIDIA developer community—built for developers pushing the limits of AI systems, inference, and scale. We will be speaking about Agentic inference with SGLang
SGLang Workshop @ GOSIM Paris 2026
WorkshopRan live labs for developers, builders, and researchers in Paris and to get hands-on with LLM and diffusion inference on SGLang, plus a deep dive into end-to-end RL training with the Miles project
AMD Dev Day 2026
Invited TalksBuilding Next-Gen AI Infrastructure: Scaling Enterprise LLM Serving and Training with RadixArk
SGLang Office Hours on Gemma 4
Office HoursLet's Talk Gemma 4: Architecture & How to Run It on SGLang. Khoa Pham, SGLang developer, broke down Gemma 4's architecture and how to run it with SGLang. We walked through how the model works, cover what it takes to support it in SGLang, share deployment tips across GB200, GB300, and AMD.
HumanX 2026 After Hours with SGLang
MeetupWith SGLang, Qwen, MuleRun, Novita AI, Modal, and Doubleword covering everything from frontier models to inference, agents, and production infra.
SGLang Training Lab
WorkshopThis training lab demonstrated how to optimize and scale LLM workflows with SGLang. Walked through practical performance tuning using the SGL-Cookbook, talked about profiling and bottleneck analysis, and demonstrated deep integration with Miles RL framework.
SGLang x Alibaba Cloud x NVIDIA x Qwen
MeetupFeaturing core members from SGLang, Qwen, Alibaba Cloud's Tair KVCache team, and partners like NVIDIA and Mooncake, covering topics from SGLang's roadmap to low-latency LLM inference optimization and KVCache storage breakthroughs.
SGLang Office Hours
Office HoursOH on SGLang-Diffusion.CloudRipple from OpenMOSS shared how they brought MoVA model support to SGLang-Diffusion. Xingyu from Skywork AI walked through the inference optimizations powering production-ready diffusion serving.
SGLang x OpenAnolis Meetup
MeetupSGLang x OpenAnolis Meetup in Beijing. Brought together frontline engineers to break down real, production-ready system designs.
SGLang x Modal x Qwen Meetup
MeetupMeetup with Modal, Qwen & SGLang. The evening explored how open tools deliver competitive performance and cost for LLM inference.
Dynamo Day
ConferenceOn NVIDIA AI’s Dynamo Day, our core developer, Baizhou, shared firsthand insights on large-scale AI inference with the Dynamo community.
SGLang x Ant OSS Meetup
MeetupMeetup with Ant Open Source. SGLang core member shared the latest progress and roadmap, and engaged in deep discussions on inference performance, system design, and real-world deployment.
SGLang Office Hours
Office HoursA technical deep dive into SGLang’s vision-language model capabilities, covering architecture and implementation details.
SGLang x OpenCloudOS x AMD Meetup
MeetupWith OpenCloudOS and AMD. Featured end-to-end system optimization, low-precision performance breakthroughs, and hands-on Triton kernel development.
SGLang x AtomGit Meetup
MeetupWith AtomGit. Talked about KV cache management, inference stalls during RL/agent weight updates, new demands of GLM/Mamba/MoE models and real-world performance on Ascend hardware
SGLang x Baidu Meetup
MeetupWith Baidu. Featured large-scale inference system optimization, distributed architectures, the latest SGLang ecosystem advances, plus an in-depth look at Baidu Baige’s inference acceleration for the DeepSeek V3 series.
SGLang Diffusion Workshop
WorkshopAt Tsinghua PACMAN Lab, we presented the SGLang Diffusion roadmap. We reused proven LLM infrastructure with a redesigned diffusion pipeline to cut inference latency by up to ~57% on H100/H200.
SGLang x AMD x Moreh AI Meetup
MeetupWith AMD and Moreh AI. Showcased SGLang’s core features, roadmap, and GPU optimizations, along with hands-on exploration of open-source LLM acceleration on AMD MI300X GPUs.
SGLang Diffusion Tutorial
WorkshopNew tutorial introducing SGLang Diffusion, covering high-performance image and video generation workflows with multi-GPU acceleration, CFG parallelism, and hands-on demos using CLI and Python APIs.
SGLang NeurIPS 2025 Meetup
ConferenceSGLang x Atlas Cloud NeurIPS 2025 happy hour
SGLang Workshop
WorkshopOn NV Developer Day, our core contributor introduced SGLang, covering our core features, key performance optimizations, real-world large-scale deployment lessons, and the future roadmap.
SGLang x Volcengine x NVIDIA Meetup
MeetupSGLang x Volcengine x NVIDIA meetup, discussed framework updates, stability optimization, distributed KVCache acceleration, VLM and MoE inference, and efficient deployment on DGX Spark.
EMNLP Happy Hour
ConferenceSGLang x Abaka AI co-hosted the EMNLP after-party, bringing together ML engineers, tech leads, and enthusiasts for conversations on AI infra, LLM agents, multimodality, and interpretability.
SGLang x Meituan x AWS Meetup
MeetupSGLang meetup with Meituan and AWS, brought together developers and researchers to discuss LLM inference, speculative decoding, quantization, and real-world SGLang deployments
SGLang x GOSIM Workshop
WorkshopSGLang x GOSIM Workshop, featuring deep-dive talks from core SGLang members on topics like efficient inference, PD disaggregation, and Ascend, alongside industry speakers from Qwen, NVIDIA, and Volcengine.
SGLang x AMD Meetup
MeetupSGLang & AMD Meetup in San Francisco. We had a hands-on GPU workshop and talks from AMD, xAI, and the SGLang team on roadmap, MoE scaling, and inference optimization.
