Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

vulkan: optimized coopmat matmul perf for IntelGPU
#19320 opened Feb 4, 2026 by fish-jiang Loading…
gguf-py: Bump sentencepiece version
#19319 opened Feb 4, 2026 by Ahajha Loading…
cleanup llama-quantize --help output examples
#19317 opened Feb 4, 2026 by ddh0 Loading…
[WebGPU] Plug memory leaks and free resources on shutdown ggml changes relating to the ggml tensor library for machine learning
#19315 opened Feb 4, 2026 by nikhilJain17 Draft
chore: update cpp-httplib version python python script changes script Script related
#19313 opened Feb 4, 2026 by taronaeo Loading…
ggml-webgpu: JIT compile binary operators and handle binding overlaps ggml changes relating to the ggml tensor library for machine learning
#19310 opened Feb 4, 2026 by abhijitramesh Loading…
vulkan: make FA mask/softcap enables spec constants ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#19309 opened Feb 3, 2026 by jeffbolznv Loading…
sycl: add F16 support for GGML_OP_CEIL documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#19306 opened Feb 3, 2026 by NechamaKrashinski Loading…
vulkan: Set k_load_shmem to false when K is too large ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19301 opened Feb 3, 2026 by jeffbolznv Loading…
vulkan: fix non-contig rope ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19299 opened Feb 3, 2026 by jeffbolznv Loading…
tests : add non-cont, inplace rope tests testing Everything test related
#19296 opened Feb 3, 2026 by ggerganov Loading…
ci : add metal server workflows devops improvements to build systems and github actions
#19293 opened Feb 3, 2026 by ggerganov Draft
1 task
CANN: Multi-stream support Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#19284 opened Feb 3, 2026 by hipudding Draft
Support Step3.5-Flash model Model specific python python script changes
#19283 opened Feb 3, 2026 by forforever73 Loading…
[WIP] ggml-hexagon: convert f32 to f16 - fa opt part3 ggml changes relating to the ggml tensor library for machine learning
#19282 opened Feb 3, 2026 by chraac Draft
vulkan: Preprocess FA mask to detect all-neg-inf and all-zero. ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#19281 opened Feb 3, 2026 by jeffbolznv Loading…
Add test for vk_buffer from host memory ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19254 opened Feb 1, 2026 by sredman Loading…
[SYCL] fix segmentation fault on consumer CPUs without bfloat16 hardware ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#19247 opened Feb 1, 2026 by sajonoso Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.