-
Notifications
You must be signed in to change notification settings - Fork 13.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
cmake: add option to build and link BoringSSL
build
Compilation issues
#17062
opened Nov 6, 2025 by
angt
Loading…
[WIP] s390x ci: debug build issue
devops
improvements to build systems and github actions
#17053
opened Nov 6, 2025 by
AlekseiNikiforovIBM
Loading…
cuda: extended MMF_ROWS_PER_BLOCK
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17051
opened Nov 6, 2025 by
zhang-hui-yulo
Loading…
fix : Dangling pointer for non-empty trigger words in lazy grammar construction
#17048
opened Nov 6, 2025 by
marek-hradil
Loading…
kv-cache : pad the size of the small SWA cache for performance
#17046
opened Nov 6, 2025 by
ggerganov
Loading…
Add MoE dynamic routing with expert caching
build
Compilation issues
documentation
Improvements or additions to documentation
examples
#17044
opened Nov 6, 2025 by
jmangold23
•
Draft
ggml-hexagon: fix changes relating to the ggml tensor library for machine learning
test-backend-ops failures on specific binary ops
ggml
CUDA: only use moe_expert_reduce when n_tokens=1
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17032
opened Nov 5, 2025 by
am17an
Loading…
ggml webgpu: faster matrix multiplication/matrix-vector multiplication
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#17031
opened Nov 5, 2025 by
reeselevine
Loading…
ggml-cpu: handle 3d tensors in repack mat_mul
ggml
changes relating to the ggml tensor library for machine learning
#17030
opened Nov 5, 2025 by
Alcpz
Loading…
tests(test-backend-ops): Test backend ops verbosity
testing
Everything test related
#17029
opened Nov 5, 2025 by
gabe-l-hart
Loading…
examples(eval-callback): Eval callback verbosity
examples
#17028
opened Nov 5, 2025 by
gabe-l-hart
Loading…
vulkan: Fix test-thread-safety crashes
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17024
opened Nov 5, 2025 by
jeffbolznv
Loading…
cuda/vulkan : bicubic interpolation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#17022
opened Nov 5, 2025 by
Acly
Loading…
webui: fix keyboard shortcuts for new chat & edit chat title
examples
server
#17007
opened Nov 4, 2025 by
chansikpark
Loading…
sampling : add support for GPU sampling (wip)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
Q4/Q8 Tiled Gemm Optimization.
ggml
changes relating to the ggml tensor library for machine learning
#16999
opened Nov 4, 2025 by
shalinib-ibm
Loading…
kleidiai: add optimized per-channel kernels for Q8_0
ggml
changes relating to the ggml tensor library for machine learning
#16993
opened Nov 4, 2025 by
chaxu01
Loading…
CUDA: add stream-based concurrency
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.