Popular repositories Loading
-
nanochat-rs-ternary
nanochat-rs-ternary PublicProduction-ready ternary quantized (1.58-bit) Rust code generation model with mHC-lite, MaxRL training, and comprehensive benchmarking
Rust 3
-
sglang_attentio
sglang_attentio PublicWe added top-k attention token visualization, exposing which tokens the model attends to most during decode, useful for interpretability research for moe hybrid mamba arch
-
sglang
sglang PublicForked from sgl-project/sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
Python
-
30b-moe-benchmarks
30b-moe-benchmarks PublicComprehensive benchmarks for 30B MoE models on RTX 4090. Key finding: HumanEval ≠ SWE-bench - the 98% HumanEval model scores 0% on agentic tasks. Includes Nemotron, GLM-4.7-Flash, Qwen3-Coder compa…
Python
-
If the problem persists, check the GitHub status page or contact support.


