- Notifications
You must be signed in to change notification settings - Fork 14.1k
Pull requests: ggml-org/llama.cpp
Author
Uh oh!
There was an error while loading. Please reload this page.
Label
Uh oh!
There was an error while loading. Please reload this page.
Projects
Uh oh!
There was an error while loading. Please reload this page.
Milestones
Uh oh!
There was an error while loading. Please reload this page.
Reviews
Assignee
Assigned to nobodyLoading
Uh oh!
There was an error while loading. Please reload this page.
Sort
Pull requests list
kv-cache: Fix state restore fragmented cache testing Everything test related
#17982 opened Dec 13, 2025 by ssweensLoading…
webui: fix chat header width when sidebar is closed examples server
#17981 opened Dec 13, 2025 by polydecayLoading…
ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations ggml changes relating to the ggml tensor library for machine learning
#17977 opened Dec 12, 2025 by ngdxzyLoading…
Build CUDA architectures 120 and 121 by default (RTX5000 and GB10) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17970 opened Dec 12, 2025 by DaAwesomePLoading…
mtmd, llama: add GLM4V vision-language model support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17967 opened Dec 12, 2025 by eelbazLoading…
server: add encoder-decoder model support (T5, BART, MADLAD) examples server
#17956 opened Dec 12, 2025 by TureeLoading…
vulkan: Add perf logger mode with concurrency ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17944 opened Dec 11, 2025 by jeffbolznvLoading…
vulkan: support get_rows for i32 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17941 opened Dec 11, 2025 by jeffbolznvLoading…
CANN: CONV_TRANSPOSE_1D operator: supporting the cases where (op->src[0]->ne[0] - 1) > 255 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17934 opened Dec 11, 2025 by IntellouisLoading…
ggml-hexagon: gelu operation ggml changes relating to the ggml tensor library for machine learning
#17921 opened Dec 10, 2025 by joeldushouyu • Draft
Restore clip's cb() to its rightful glory - extract common debugging elements in llama examples
#17914 opened Dec 10, 2025 by pwilkinLoading…
Make
LlamaData utility functions static in llama-run examples #17913 opened Dec 10, 2025 by rauletorrescLoading…
server: fix crash when batch > ubatch with embeddings (#12836) examples server
#17912 opened Dec 10, 2025 by yifant-codeLoading…
CUDA: experimental native mxfp4 support for blackwell [WIP] ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
ggml: correct inaccurate comments for GGML_OP_MUL_MAT backward pass [no ci] ggml changes relating to the ggml tensor library for machine learning
#17899 opened Dec 10, 2025 by csmyxLoading…
ggml-hexagon: mm for mtmd ggml changes relating to the ggml tensor library for machine learning script Script related
#17894 opened Dec 9, 2025 by joeldushouyuLoading…
vulkan: support GGML_OP_DIAG ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17893 opened Dec 9, 2025 by jeffbolznvLoading…
vulkan: Multi-pass softmax for large number of cols ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#17892 opened Dec 9, 2025 by jeffbolznvLoading…
vulkan: Fix data race/hang in scalar/cm1 flash attention ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17887 opened Dec 9, 2025 by jeffbolznvLoading…
PreviousNext
ProTip! Type gi on any issue or pull request to go back to the issue listing page.