Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobodyLoading
Sort

Pull requests list

sync : ggml
#17988 opened Dec 13, 2025 by ggerganovLoading…
kv-cache: Fix state restore fragmented cache testing Everything test related
#17982 opened Dec 13, 2025 by ssweensLoading…
ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations ggml changes relating to the ggml tensor library for machine learning
#17977 opened Dec 12, 2025 by ngdxzyLoading…
Build CUDA architectures 120 and 121 by default (RTX5000 and GB10) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17970 opened Dec 12, 2025 by DaAwesomePLoading…
mtmd, llama: add GLM4V vision-language model support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17967 opened Dec 12, 2025 by eelbazLoading…
mtmd: (WIP) gemma3n vision support examples python python script changes
#17961 opened Dec 12, 2025 by ngxson Draft
vulkan: Add perf logger mode with concurrency ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17944 opened Dec 11, 2025 by jeffbolznvLoading…
vulkan: support get_rows for i32 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17941 opened Dec 11, 2025 by jeffbolznvLoading…
CANN: CONV_TRANSPOSE_1D operator: supporting the cases where (op->src[0]->ne[0] - 1) > 255 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17934 opened Dec 11, 2025 by IntellouisLoading…
ggml-hexagon: gelu operation ggml changes relating to the ggml tensor library for machine learning
#17921 opened Dec 10, 2025 by joeldushouyu Draft
[WIP]gml-hexagon: Q4_0 mm opt ggml changes relating to the ggml tensor library for machine learning
#17907 opened Dec 10, 2025 by chraac Draft
CUDA: experimental native mxfp4 support for blackwell [WIP] ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17906 opened Dec 10, 2025 by am17an Draft
1 of 2 tasks
ggml: correct inaccurate comments for GGML_OP_MUL_MAT backward pass [no ci] ggml changes relating to the ggml tensor library for machine learning
#17899 opened Dec 10, 2025 by csmyxLoading…
ggml-hexagon: mm for mtmd ggml changes relating to the ggml tensor library for machine learning script Script related
#17894 opened Dec 9, 2025 by joeldushouyuLoading…
vulkan: support GGML_OP_DIAG ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17893 opened Dec 9, 2025 by jeffbolznvLoading…
vulkan: Multi-pass softmax for large number of cols ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#17892 opened Dec 9, 2025 by jeffbolznvLoading…
vulkan: Fix data race/hang in scalar/cm1 flash attention ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17887 opened Dec 9, 2025 by jeffbolznvLoading…
ProTip! Type gi on any issue or pull request to go back to the issue listing page.