Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[VLM] Support apply qk norm in multi cuda streams Multi-modal multi-modal language model
#15720 opened Dec 24, 2025 by yuan-luo Loading…
6 tasks
[diffusion] refactor: unify the profiling api for all executors diffusion SGLang Diffusion run-ci
#15718 opened Dec 24, 2025 by mickqian Loading…
6 tasks
[Draft][DSV32] Dont merge now. Support PPxTP8CP8 deepseek documentation Improvements or additions to documentation npu
#15713 opened Dec 24, 2025 by whybeyoung Loading…
Add SwapAB Optimization for triton fused_moe_kernel on SM90.
#15712 opened Dec 24, 2025 by Insideyyy Loading…
2 of 6 tasks
[JIT sgl-kernel] Jit support per tensor quant quant LLM Quantization sgl-kernel
#15709 opened Dec 24, 2025 by BBuf Loading…
6 tasks
[CPU] optimize flash_attn_varlen_func cpu cpu backend performance optimization intel run-ci sgl-kernel
#15708 opened Dec 24, 2025 by mingfeima Loading…
6 tasks done
Tiny env cleanup in deepgemm deepseek run-ci
#15706 opened Dec 24, 2025 by vincentzed Loading…
6 tasks
Simplify server args run-ci
#15704 opened Dec 24, 2025 by merrymercy Loading…
ci: add continue-on-error for scheduled PR tests diffusion SGLang Diffusion run-ci
#15701 opened Dec 23, 2025 by alisonshao Loading…
3 tasks
feat(metrics): add prefix cache monitoring statistics
#15699 opened Dec 23, 2025 by maggie26375 Loading…
6 tasks
[AMD] Update wave-lang to 3.9.1 dependencies Pull requests that update a dependency file
#15697 opened Dec 23, 2025 by xintin Loading…
4 of 6 tasks
update benchmark README to use --fp8-gemm-backend instead of env var deepseek documentation Improvements or additions to documentation nvidia
#15689 opened Dec 23, 2025 by leejnau Loading…
6 tasks
[sgl-kernel] feat: simplify tree_speculative_sampling_target_only documentation Improvements or additions to documentation sgl-kernel speculative-decoding
#15687 opened Dec 23, 2025 by alphabetc1 Loading…
6 tasks
[diffusion] Add kernel for svdquant quant LLM Quantization sgl-kernel
#15681 opened Dec 23, 2025 by jianyingzhu Draft
2 of 6 tasks
[Model] Add Ernie4.5 VL model support
#15679 opened Dec 23, 2025 by CSWYF3634076 Loading…
6 tasks
Add auto bind numa node
#15678 opened Dec 23, 2025 by QiuMike Loading…
6 tasks
[Disaggregation] Validate TP size compatibility for non-MLA models
#15675 opened Dec 23, 2025 by chi2liu Loading…
6 tasks done
[AMD] Fix Indexer fp8_index_kernel with ROCm tilelang backend
#15673 opened Dec 23, 2025 by wufann Loading…
6 tasks
Improve tp*pp error message
#15669 opened Dec 23, 2025 by Monokaix Loading…
6 tasks done
[diffusion] Use sage attn as default backend for RTX5090 diffusion SGLang Diffusion
#15668 opened Dec 23, 2025 by ryang-max Draft
6 tasks
ProTip! Exclude everything labeled bug with -label:bug.