Events

Connect with the community, attend workshops, and join our weekly office hours. Check out our events on Luma

Upcoming Events

Recurring

July 1, 2026

6:00 PM PST

SGLang Office Hours

We invite you to join the core team for our bi-weekly office hours.

StreamYard

Recurring

July 2, 2026

7:00 PM PST

Core-Dev Meeting

This weekly meeting brings together core developers to review major features, urgent issues, and the project roadmap. All contributors are welcome to participate and submit proposals.

Google Meet

Past Events

CVPR 2026 Happy Hour

Conference

June 4, 2026

The unofficial CVPR 2026 afterparty, co-hosted with Philo Labs, Reactor, and Crusoe! Themed around multimodal RL: reinforcement learning meeting VLMs, world models, and any-to-any (text/image/video/audio) generation.

Denver, CORecap

NY Tech Week Happy Hour

Meetup

June 3, 2026

SGLang, HOF Capital, Crusoe, Cloudflare, and Arklex AI had co-hosted for an evening of lightning talks and conversation with the people building inference systems for finance

New YorkRecap

Ant Tech Day

Invited Talks

May 23, 2026

Ant tech day with Ant Open Source and Inclusion AI discussed the infrastructure behind Agentic AI. Invited to talk about Miles framework and it's role in Agentic AI.

HangzhouRecap

MLSys 2026 Happy Hour Session 2

Meetup

May 19, 2026

Celebrated MLSys with dinner + drinks at a restaurant near the MLSys conference venue in Bellevue!Co-hosted by RadixArk, SGLang, Essence, and Delta Institute

Seattle, WA

MLSys 2026 Happy Hour Session 1

Conference

May 18, 2026

We co-hosted with Ai2 a happy hour for the open AI community during MLSys 2026 in Bellevue, sponsored by Crusoe and Doubleword, bringing together folks working on open models, infra, hardware, and applications.

Seattle, WARecap

Dynamo After Hours

Meetup

May 6, 2026

A high-signal night with the NVIDIA developer community—built for developers pushing the limits of AI systems, inference, and scale. We will be speaking about Agentic inference with SGLang

San FranciscoRecap

SGLang Workshop @ GOSIM Paris 2026

Workshop

May 6, 2026

Ran live labs for developers, builders, and researchers in Paris and to get hands-on with LLM and diffusion inference on SGLang, plus a deep dive into end-to-end RL training with the Miles project

Paris, FranceRecap

AMD Dev Day Meetup with dstack and Crusoe

Invited Talks

April 30, 2026

dstack and Crusoe are hosting an AI infra & open-source models meetup after AMD Dev Day, bringing together GPU experts, AI researchers, and AI infra experts

San FranciscoRecap Slides

AMD Dev Day 2026

Invited Talks

April 30, 2026

Building Next-Gen AI Infrastructure: Scaling Enterprise LLM Serving and Training with RadixArk

San FranciscoRecap

SGLang Office Hours with Ray

Office Hours

April 22, 2026

Scaling LLM Serving with Ray and SGLang. We co-hosted with Anyscale to dig into how Ray powers large-scale LLM serving with SGLang.

StreamYardRecording Recap Slides

Ollama x SGLang Gemma Day

Invited Talk

April 15, 2026

Ollama hosted a meetup with Google DeepMind's Gemma team in Palo Alto, and we were invited to do a live demo of deploying Gemma 4 using SGLang.

Palo AltoRecording Recap

SGLang Office Hours on Gemma 4

Office Hours

April 8, 2026

Let's Talk Gemma 4: Architecture & How to Run It on SGLang. Khoa Pham, SGLang developer, broke down Gemma 4's architecture and how to run it with SGLang. We walked through how the model works, cover what it takes to support it in SGLang, share deployment tips across GB200, GB300, and AMD.

StreamYardRecording Recap Slides

HumanX 2026 After Hours with SGLang

Meetup

April 7, 2026

With SGLang, Qwen, MuleRun, Novita AI, Modal, and Doubleword covering everything from frontier models to inference, agents, and production infra.

San FranciscoRecap

SGLang Training Lab

Workshop

March 19, 2026

This training lab demonstrated how to optimize and scale LLM workflows with SGLang. Walked through practical performance tuning using the SGL-Cookbook, talked about profiling and bottleneck analysis, and demonstrated deep integration with Miles RL framework.

San JoseRecording Recap Slides

LinkedIn x SGLang Meetup

Meetup

March 18, 2026

LinkedIn x SGLang Meetup: Discover the Future of LLMs for Search and Recommendation,with deep discussion into how large-scale LLM systems are built, optimized, and deployed in real-world production environments.

Mountain View, CaliforniaRecap Slides

GTC 2026 Happy Hours

Conference

March 17, 2026

SGLang x RadixArk. With great food, unlimited drinks, and a relaxed evening with some of the brightest minds in model, inference, and RL infrastructure.

San JoseRecap Slides

SGLang x Alibaba Cloud x NVIDIA x Qwen

Meetup

March 7, 2026

Featuring core members from SGLang, Qwen, Alibaba Cloud's Tair KVCache team, and partners like NVIDIA and Mooncake, covering topics from SGLang's roadmap to low-latency LLM inference optimization and KVCache storage breakthroughs.

ShanghaiSlides

SGLang Office Hours

Office Hours

February 25, 2026

OH on SGLang-Diffusion.CloudRipple from OpenMOSS shared how they brought MoVA model support to SGLang-Diffusion. Xingyu from Skywork AI walked through the inference optimizations powering production-ready diffusion serving.

StreamYardRecording Recap Slides

NVIDIA Developer Community Meetup

Meetup

February 19, 2026

Our core dev speaker Qiaolin Yu shared SGLang’s key priorities, upcoming initiatives, and milestones for the quarter.

San FranciscoRecap Slides

SGLang x Modal Office Hours

Office Hours

February 11, 2026

SGLang x Modal Office Hours: Deploying Big MoE Models, From Zero to Serving

Google MeetRecording Recap

SGLang x OpenAnolis Meetup

Meetup

January 31, 2026

SGLang x OpenAnolis Meetup in Beijing. Brought together frontline engineers to break down real, production-ready system designs.

BeijingRecap

SGLang x Modal x Qwen Meetup

Meetup

January 29, 2026

Meetup with Modal, Qwen & SGLang. The evening explored how open tools deliver competitive performance and cost for LLM inference.

San FranciscoRecap

Dynamo Day

Conference

January 22, 2026

On NVIDIA AI’s Dynamo Day, our core developer, Baizhou, shared firsthand insights on large-scale AI inference with the Dynamo community.

Virtual EventRecording

SGLang x Ant OSS Meetup

Meetup

January 17, 2026

Meetup with Ant Open Source. SGLang core member shared the latest progress and roadmap, and engaged in deep discussions on inference performance, system design, and real-world deployment.

HangzhouSlides

SGLang Office Hours

Office Hours

December 29, 2025

A technical deep dive into SGLang’s vision-language model capabilities, covering architecture and implementation details.

Virtual EventSlides

SGLang x OpenCloudOS x AMD Meetup

Meetup

December 20, 2025

With OpenCloudOS and AMD. Featured end-to-end system optimization, low-precision performance breakthroughs, and hands-on Triton kernel development.

ShenzhenRecap

SGLang x AtomGit Meetup

Meetup

December 20, 2025

With AtomGit. Talked about KV cache management, inference stalls during RL/agent weight updates, new demands of GLM/Mamba/MoE models and real-world performance on Ascend hardware

HangzhouRecap

SGLang x Baidu Meetup

Meetup

December 14, 2025

With Baidu. Featured large-scale inference system optimization, distributed architectures, the latest SGLang ecosystem advances, plus an in-depth look at Baidu Baige’s inference acceleration for the DeepSeek V3 series.

BeijingRecap

SGLang Diffusion Workshop

Workshop

December 14, 2025

At Tsinghua PACMAN Lab, we presented the SGLang Diffusion roadmap. We reused proven LLM infrastructure with a redesigned diffusion pipeline to cut inference latency by up to ~57% on H100/H200.

BeijingSlides

SGLang x AMD x Moreh AI Meetup

Meetup

December 13, 2025

With AMD and Moreh AI. Showcased SGLang’s core features, roadmap, and GPU optimizations, along with hands-on exploration of open-source LLM acceleration on AMD MI300X GPUs.

SeoulRecap

SGLang Diffusion Tutorial

Workshop

December 12, 2025

New tutorial introducing SGLang Diffusion, covering high-performance image and video generation workflows with multi-GPU acceleration, CFG parallelism, and hands-on demos using CLI and Python APIs.

Virtual EventRecording

SGLang NeurIPS 2025 Meetup

Conference

December 4, 2025

SGLang x Atlas Cloud NeurIPS 2025 happy hour

San DiegoRecap

SGLang Workshop

Workshop

November 14, 2025

On NV Developer Day, our core contributor introduced SGLang, covering our core features, key performance optimizations, real-world large-scale deployment lessons, and the future roadmap.

SuzhouSlides

SGLang x Volcengine x NVIDIA Meetup

Meetup

November 7, 2025

SGLang x Volcengine x NVIDIA meetup, discussed framework updates, stability optimization, distributed KVCache acceleration, VLM and MoE inference, and efficient deployment on DGX Spark.

ShanghaiRecap

EMNLP Happy Hour

Conference

November 5, 2025

SGLang x Abaka AI co-hosted the EMNLP after-party, bringing together ML engineers, tech leads, and enthusiasts for conversations on AI infra, LLM agents, multimodality, and interpretability.

SuzhouRecap

SGLang x Meituan x AWS Meetup

Meetup

October 25, 2025

SGLang meetup with Meituan and AWS, brought together developers and researchers to discuss LLM inference, speculative decoding, quantization, and real-world SGLang deployments

BeijingRecap

SGLang x NVIDIA Meetup

Meetup

October 2, 2025

SGLang x NVIDIA SF Meetup had incredible talks and discussions on LLM inference acceleration, distributed compute, and open infra, featuring amazing speakers from across the AI infra community.

San FranciscoRecording Recap

SGLang x GOSIM Workshop

Workshop

September 14, 2025

SGLang x GOSIM Workshop, featuring deep-dive talks from core SGLang members on topics like efficient inference, PD disaggregation, and Ascend, alongside industry speakers from Qwen, NVIDIA, and Volcengine.

HangzhouRecap

SGLang x AMD Meetup

Meetup

August 22, 2025

SGLang & AMD Meetup in San Francisco. We had a hands-on GPU workshop and talks from AMD, xAI, and the SGLang team on roadmap, MoE scaling, and inference optimization.

San FranciscoRecap