Events

Connect with the community, attend workshops, and join our weekly office hours. Check out our events on Luma

Upcoming Events

Invited Talk
April 30, 2026
12:00 AM PST

AMD Dev Day 2026

Building Next-Gen AI Infrastructure: Scaling Enterprise LLM Serving and Training with RadixArk

San Francisco
Invited Talk
April 30, 2026
5:00 PM PST

AMD Dev Day Meetup with dstack and Crusoe

dstack and Crusoe are hosting an AI infra & open-source models meetup after AMD Dev Day, bringing together GPU experts, AI researchers, and AI infra experts

San Francisco
Recurring
April 30, 2026
7:00 PM PST

Core-Dev Meeting

This weekly meeting brings together core developers to review major features, urgent issues, and the project roadmap. All contributors are welcome to participate and submit proposals.

Google Meet
Meetup
May 6, 2026
12:00 AM PST

Dynamo After Hours

Join us for a high-signal night with the NVIDIA developer community—built for developers pushing the limits of AI systems, inference, and scale. We will be speaking about Agentic inference with SGLang

San Francisco
Workshop
May 6, 2026
12:00 AM PST

SGLang Workshop @ GOSIM Paris 2026

We're running live labs so developers, builders, and researchers in Paris can get hands-on with LLM and diffusion inference on SGLang, plus a deep dive into end-to-end RL training with the Miles project

Paris, France
Recurring
May 6, 2026
6:00 PM PST

SGLang Office Hours

We invite you to join the core team for our bi-weekly office hours.

StreamYard
Conference
May 18, 2026
12:00 AM PST

MLSys 2026 Happy Hour

Seattle, WA
Conference
June 3, 2026
12:00 AM PST

CVPR 2026 Happy Hour

Denver, CO
Conference
June 23, 2026
9:00 AM PST

SGLang & LMSYS Summit 2026

With SGLang now powering inference across major labs and hyperscalers, we're bringing together 1,500+ top-tier practitioners to shape the future of open-source AI infra. We’re opening with speakers like Lip-Bu Tan (Intel CEO), Logan Kilpatrick (Google DeepMind), Bill Jia (Google), and more!

San Francisco

Past Events

SGLang Office Hours with Ray

Office Hours
April 22, 2026

Scaling LLM Serving with Ray and SGLang. We co-hosted with Anyscale to dig into how Ray powers large-scale LLM serving with SGLang.

Ollama x SGLang Gemma Day

Invited Talk
April 15, 2026

Ollama hosted a meetup with Google DeepMind's Gemma team in Palo Alto, and we were invited to do a live demo of deploying Gemma 4 using SGLang.

SGLang Office Hours on Gemma 4

Office Hours
April 8, 2026

Let's Talk Gemma 4: Architecture & How to Run It on SGLang. Khoa Pham, SGLang developer, broke down Gemma 4's architecture and how to run it with SGLang. We walked through how the model works, cover what it takes to support it in SGLang, share deployment tips across GB200, GB300, and AMD.

HumanX 2026 After Hours with SGLang

Meetup
April 7, 2026

With SGLang, Qwen, MuleRun, Novita AI, Modal, and Doubleword covering everything from frontier models to inference, agents, and production infra.

San FranciscoRecap

SGLang Training Lab

Workshop
March 19, 2026

This training lab demonstrated how to optimize and scale LLM workflows with SGLang. Walked through practical performance tuning using the SGL-Cookbook, talked about profiling and bottleneck analysis, and demonstrated deep integration with Miles RL framework.

LinkedIn x SGLang Meetup

Meetup
March 18, 2026

LinkedIn x SGLang Meetup: Discover the Future of LLMs for Search and Recommendation,with deep discussion into how large-scale LLM systems are built, optimized, and deployed in real-world production environments.

Mountain View, CaliforniaRecapSlides

GTC 2026 Happy Hours

Conference
March 17, 2026

SGLang x RadixArk. With great food, unlimited drinks, and a relaxed evening with some of the brightest minds in model, inference, and RL infrastructure.

San JoseRecapSlides

SGLang x Alibaba Cloud x NVIDIA x Qwen

Meetup
March 7, 2026

Featuring core members from SGLang, Qwen, Alibaba Cloud's Tair KVCache team, and partners like NVIDIA and Mooncake, covering topics from SGLang's roadmap to low-latency LLM inference optimization and KVCache storage breakthroughs.

ShanghaiSlides

SGLang Office Hours

Office Hours
February 25, 2026

OH on SGLang-Diffusion.CloudRipple from OpenMOSS shared how they brought MoVA model support to SGLang-Diffusion. Xingyu from Skywork AI walked through the inference optimizations powering production-ready diffusion serving.

NVIDIA Developer Community Meetup

Meetup
February 19, 2026

Our core dev speaker Qiaolin Yu shared SGLang’s key priorities, upcoming initiatives, and milestones for the quarter.

San FranciscoRecapSlides

SGLang x Modal Office Hours

Office Hours
February 11, 2026

SGLang x Modal Office Hours: Deploying Big MoE Models, From Zero to Serving

Google MeetRecordingRecap

SGLang x OpenAnolis Meetup

Meetup
January 31, 2026

SGLang x OpenAnolis Meetup in Beijing. Brought together frontline engineers to break down real, production-ready system designs.

BeijingRecap

SGLang x Modal x Qwen Meetup

Meetup
January 29, 2026

Meetup with Modal, Qwen & SGLang. The evening explored how open tools deliver competitive performance and cost for LLM inference.

San FranciscoRecap

Dynamo Day

Conference
January 22, 2026

On NVIDIA AI’s Dynamo Day, our core developer, Baizhou, shared firsthand insights on large-scale AI inference with the Dynamo community.

Virtual EventRecording

SGLang x Ant OSS Meetup

Meetup
January 17, 2026

Meetup with Ant Open Source. SGLang core member shared the latest progress and roadmap, and engaged in deep discussions on inference performance, system design, and real-world deployment.

HangzhouSlides

SGLang Office Hours

Office Hours
December 29, 2025

A technical deep dive into SGLang’s vision-language model capabilities, covering architecture and implementation details.

Virtual EventSlides

SGLang x OpenCloudOS x AMD Meetup

Meetup
December 20, 2025

With OpenCloudOS and AMD. Featured end-to-end system optimization, low-precision performance breakthroughs, and hands-on Triton kernel development.

ShenzhenRecap

SGLang x AtomGit Meetup

Meetup
December 20, 2025

With AtomGit. Talked about KV cache management, inference stalls during RL/agent weight updates, new demands of GLM/Mamba/MoE models and real-world performance on Ascend hardware

HangzhouRecap

SGLang x Baidu Meetup

Meetup
December 14, 2025

With Baidu. Featured large-scale inference system optimization, distributed architectures, the latest SGLang ecosystem advances, plus an in-depth look at Baidu Baige’s inference acceleration for the DeepSeek V3 series.

BeijingRecap

SGLang Diffusion Workshop

Workshop
December 14, 2025

At Tsinghua PACMAN Lab, we presented the SGLang Diffusion roadmap. We reused proven LLM infrastructure with a redesigned diffusion pipeline to cut inference latency by up to ~57% on H100/H200.

BeijingSlides

SGLang x AMD x Moreh AI Meetup

Meetup
December 13, 2025

With AMD and Moreh AI. Showcased SGLang’s core features, roadmap, and GPU optimizations, along with hands-on exploration of open-source LLM acceleration on AMD MI300X GPUs.

SeoulRecap

SGLang Diffusion Tutorial

Workshop
December 12, 2025

New tutorial introducing SGLang Diffusion, covering high-performance image and video generation workflows with multi-GPU acceleration, CFG parallelism, and hands-on demos using CLI and Python APIs.

Virtual EventRecording

SGLang NeurIPS 2025 Meetup

Conference
December 4, 2025

SGLang x Atlas Cloud NeurIPS 2025 happy hour

San DiegoRecap

SGLang Workshop

Workshop
November 14, 2025

On NV Developer Day, our core contributor introduced SGLang, covering our core features, key performance optimizations, real-world large-scale deployment lessons, and the future roadmap.

SuzhouSlides

SGLang x Volcengine x NVIDIA Meetup

Meetup
November 7, 2025

SGLang x Volcengine x NVIDIA meetup, discussed framework updates, stability optimization, distributed KVCache acceleration, VLM and MoE inference, and efficient deployment on DGX Spark.

ShanghaiRecap

EMNLP Happy Hour

Conference
November 5, 2025

SGLang x Abaka AI co-hosted the EMNLP after-party, bringing together ML engineers, tech leads, and enthusiasts for conversations on AI infra, LLM agents, multimodality, and interpretability.

SuzhouRecap

SGLang x Meituan x AWS Meetup

Meetup
October 25, 2025

SGLang meetup with Meituan and AWS, brought together developers and researchers to discuss LLM inference, speculative decoding, quantization, and real-world SGLang deployments

BeijingRecap

SGLang x NVIDIA Meetup

Meetup
October 2, 2025

SGLang x NVIDIA SF Meetup had incredible talks and discussions on LLM inference acceleration, distributed compute, and open infra, featuring amazing speakers from across the AI infra community.

San FranciscoRecordingRecap

SGLang x GOSIM Workshop

Workshop
September 14, 2025

SGLang x GOSIM Workshop, featuring deep-dive talks from core SGLang members on topics like efficient inference, PD disaggregation, and Ascend, alongside industry speakers from Qwen, NVIDIA, and Volcengine.

HangzhouRecap

SGLang x AMD Meetup

Meetup
August 22, 2025

SGLang & AMD Meetup in San Francisco. We had a hands-on GPU workshop and talks from AMD, xAI, and the SGLang team on roadmap, MoE scaling, and inference optimization.

San FranciscoRecap