-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[VLM] Support apply qk norm in multi cuda streams
Multi-modal
multi-modal language model
#15720
opened Dec 24, 2025 by
yuan-luo
Loading…
6 tasks
[diffusion] refactor: unify the profiling api for all executors
diffusion
SGLang Diffusion
run-ci
#15718
opened Dec 24, 2025 by
mickqian
Loading…
6 tasks
[Draft][DSV32] Dont merge now. Support PPxTP8CP8
deepseek
documentation
Improvements or additions to documentation
npu
#15713
opened Dec 24, 2025 by
whybeyoung
Loading…
Add SwapAB Optimization for triton fused_moe_kernel on SM90.
#15712
opened Dec 24, 2025 by
Insideyyy
Loading…
2 of 6 tasks
[JIT sgl-kernel] Jit support per tensor quant
quant
LLM Quantization
sgl-kernel
#15709
opened Dec 24, 2025 by
BBuf
Loading…
6 tasks
[CPU] optimize flash_attn_varlen_func
cpu
cpu backend performance optimization
intel
run-ci
sgl-kernel
#15708
opened Dec 24, 2025 by
mingfeima
Loading…
6 tasks done
Tiny env cleanup in deepgemm
deepseek
run-ci
#15706
opened Dec 24, 2025 by
vincentzed
Loading…
6 tasks
ci: add continue-on-error for scheduled PR tests
diffusion
SGLang Diffusion
run-ci
#15701
opened Dec 23, 2025 by
alisonshao
Loading…
3 tasks
[Auto Sync] Update server_args.py (20251223)
run-ci
#15700
opened Dec 23, 2025 by
merrymercy
Loading…
feat(metrics): add prefix cache monitoring statistics
#15699
opened Dec 23, 2025 by
maggie26375
Loading…
6 tasks
[AMD] Update wave-lang to 3.9.1
dependencies
Pull requests that update a dependency file
#15697
opened Dec 23, 2025 by
xintin
Loading…
4 of 6 tasks
SGLang Tracing: fix attribute errors (header extraction & bootstrap span closing)
#15693
opened Dec 23, 2025 by
vladnosiv
Loading…
update benchmark README to use --fp8-gemm-backend instead of env var
deepseek
documentation
Improvements or additions to documentation
nvidia
#15689
opened Dec 23, 2025 by
leejnau
Loading…
6 tasks
[sgl-kernel] feat: simplify tree_speculative_sampling_target_only
documentation
Improvements or additions to documentation
sgl-kernel
speculative-decoding
#15687
opened Dec 23, 2025 by
alphabetc1
Loading…
6 tasks
[sgl-model-gateway] Add EPD (Encode-Prefill-Decode) Disaggregated Routing Support
high priority
model-gateway
run-ci
#15683
opened Dec 23, 2025 by
chenzongyao200127
Loading…
2 of 6 tasks
Fix
torch.__version__ for PEP440
piecewise-cuda-graph
#15682
opened Dec 23, 2025 by
EduardDurech
Loading…
[diffusion] Add kernel for svdquant
quant
LLM Quantization
sgl-kernel
#15681
opened Dec 23, 2025 by
jianyingzhu
•
Draft
2 of 6 tasks
Fix the problem where Qwen3VL raises an "object has no attribute 'mod…
run-ci
#15677
opened Dec 23, 2025 by
ShirakSyouya
Loading…
[Disaggregation] Validate TP size compatibility for non-MLA models
#15675
opened Dec 23, 2025 by
chi2liu
Loading…
6 tasks done
[AMD] Fix Indexer fp8_index_kernel with ROCm tilelang backend
#15673
opened Dec 23, 2025 by
wufann
Loading…
6 tasks
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.