ggml-org /llama.cppPublic

Notifications You must be signed in to change notification settings
Fork 14.1k
Star 91.2k

Code
Issues336
Pull requests616
Discussions
Actions
Projects10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 86 Milestones 0

New pull requestNew

Clear current search query, filters, and sorts

555 Open 2,114 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

sync : ggml

#17988 opened Dec 13, 2025 by ggerganov

Loading…

kv-cache: Fix state restore fragmented cache testing

Everything test related

#17982 opened Dec 13, 2025 by ssweens

Loading…

webui: fix chat header width when sidebar is closed examples server

#17981 opened Dec 13, 2025 by polydecay

Loading…

mtmd: refactor audio preprocessing examples

#17978 opened Dec 12, 2025 by ngxson • Draft

ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations ggml

changes relating to the ggml tensor library for machine learning

#17977 opened Dec 12, 2025 by ngdxzy

Loading…

Build CUDA architectures 120 and 121 by default (RTX5000 and GB10) ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#17970 opened Dec 12, 2025 by DaAwesomeP

Loading…

mtmd, llama: add GLM4V vision-language model support examples ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

python

python script changes

#17967 opened Dec 12, 2025 by eelbaz

Loading…

mtmd: (WIP) gemma3n vision support examples python

python script changes

#17961 opened Dec 12, 2025 by ngxson • Draft

server: add encoder-decoder model support (T5, BART, MADLAD) examples server

#17956 opened Dec 12, 2025 by Turee

Loading…

vulkan: Add perf logger mode with concurrency ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17944 opened Dec 11, 2025 by jeffbolznv

Loading…

vulkan: support get_rows for i32 ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17941 opened Dec 11, 2025 by jeffbolznv

Loading…

server: support echo + logprobs + stream examples server

#17935 opened Dec 11, 2025 by ngxson • Draft

CANN: CONV_TRANSPOSE_1D operator: supporting the cases where (op->src[0]->ne[0] - 1) > 255 Ascend NPU

issues specific to Ascend NPUs

ggml

changes relating to the ggml tensor library for machine learning

#17934 opened Dec 11, 2025 by Intellouis

Loading…

implement Power Law sampling examples server

#17927 opened Dec 11, 2025 by ddh0 • Draft

ggml-hexagon: gelu operation ggml

changes relating to the ggml tensor library for machine learning

#17921 opened Dec 10, 2025 by joeldushouyu • Draft

Restore clip's cb() to its rightful glory - extract common debugging elements in llama examples

#17914 opened Dec 10, 2025 by pwilkin

Loading…

Make LlamaData utility functions static in llama-run examples

#17913 opened Dec 10, 2025 by rauletorresc

Loading…

server: fix crash when batch > ubatch with embeddings (#12836) examples server

#17912 opened Dec 10, 2025 by yifant-code

Loading…

[WIP]gml-hexagon: Q4_0 mm opt ggml

changes relating to the ggml tensor library for machine learning

#17907 opened Dec 10, 2025 by chraac • Draft

CUDA: experimental native mxfp4 support for blackwell [WIP] ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#17906 opened Dec 10, 2025 by am17an • Draft

1 of 2 tasks

ggml: correct inaccurate comments for GGML_OP_MUL_MAT backward pass [no ci] ggml

changes relating to the ggml tensor library for machine learning

#17899 opened Dec 10, 2025 by csmyx

Loading…

ggml-hexagon: mm for mtmd ggml

changes relating to the ggml tensor library for machine learning

script

Script related

#17894 opened Dec 9, 2025 by joeldushouyu

Loading…

vulkan: support GGML_OP_DIAG ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17893 opened Dec 9, 2025 by jeffbolznv

Loading…

vulkan: Multi-pass softmax for large number of cols ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#17892 opened Dec 9, 2025 by jeffbolznv

Loading…

vulkan: Fix data race/hang in scalar/cm1 flash attention ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17887 opened Dec 9, 2025 by jeffbolznv

Loading…

Previous12 3 4 5…22 23 Next

PreviousNext

ProTip! Type gi on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!