ggml-org /llama.cppPublic

Notifications You must be signed in to change notification settings
Fork 14.1k
Star 91.2k

Code
Issues335
Pull requests616
Discussions
Actions
Projects10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 86 Milestones 0

New pull requestNew

616 Open 8,069 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

mtmd: refactor audio preprocessing

#17978 opened Dec 12, 2025 by ngxson • Draft

ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations

#17977 opened Dec 12, 2025 by ngdxzy

Loading…

common : add llama-completion to completion-bash executables

#17976 opened Dec 12, 2025 by CISC

Loading…

common : skip model validation when --completion-bash is requested

#17975 opened Dec 12, 2025 by CISC

Loading…

llama_context: synchronize before reallocating output buffer

#17974 opened Dec 12, 2025 by jeffbolznv

Loading…

cmake: correct scope - link ws2_32 for MinGW/w64devkit builds in cpp-httplib

#17972 opened Dec 12, 2025 by gustrd

Loading…

Build CUDA architectures 120 and 121 by default (RTX5000 and GB10) ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#17970 opened Dec 12, 2025 by DaAwesomeP

Loading…

webui: Improve copy to clipboard with text attachments examples server

#17969 opened Dec 12, 2025 by allozaur

Loading…

mtmd: fix GLM4V vision encoder 2D RoPE implementation examples ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

python

python script changes

#17967 opened Dec 12, 2025 by eelbaz

Loading…

mtmd: (WIP) gemma3n vision support examples python

python script changes

#17961 opened Dec 12, 2025 by ngxson • Draft

server: support global section of presets examples server

#17959 opened Dec 12, 2025 by ngxson

Loading…

server: add encoder-decoder model support (T5, BART, MADLAD) examples server

#17956 opened Dec 12, 2025 by Turee

Loading…

scripts: add script to compare logprobs of llama.cpp against other frameworks python

python script changes

script

Script related

#17947 opened Dec 11, 2025 by ngxson

Loading…

vulkan: Add perf logger mode with concurrency ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17944 opened Dec 11, 2025 by jeffbolznv

Loading…

vulkan: support get_rows for i32 ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17941 opened Dec 11, 2025 by jeffbolznv

Loading…

common : refactor common_sampler + grammar logic changes examples python

python script changes

server

#17937 opened Dec 11, 2025 by ggerganov

Loading…

server: support echo + logprobs + stream examples server

#17935 opened Dec 11, 2025 by ngxson • Draft

CANN: CONV_TRANSPOSE_1D operator: supporting the cases where (op->src[0]->ne[0] - 1) > 255 Ascend NPU

issues specific to Ascend NPUs

ggml

changes relating to the ggml tensor library for machine learning

#17934 opened Dec 11, 2025 by Intellouis

Loading…

implement Power Law sampling

#17927 opened Dec 11, 2025 by ddh0 • Draft

Webui: Disable attachment button and model selector button when prompt textbox is disabled. examples server

#17925 opened Dec 11, 2025 by dariusjlukas

Loading…

Gigachat 3 tool parser and tests testing

Everything test related

#17924 opened Dec 11, 2025 by Mishusha

Loading…

ggml-hexagon: gelu operation ggml

changes relating to the ggml tensor library for machine learning

#17921 opened Dec 10, 2025 by joeldushouyu • Draft

Restore clip's cb() to its rightful glory - extract common debugging elements in llama examples

#17914 opened Dec 10, 2025 by pwilkin

Loading…

Make LlamaData utility functions static in llama-run examples

#17913 opened Dec 10, 2025 by rauletorresc

Loading…

server: fix crash when batch > ubatch with embeddings (#12836) examples server

#17912 opened Dec 10, 2025 by yifant-code

Loading…

Previous12 3 4 5…24 25 Next

PreviousNext

ProTip! Updated in the last three days: updated:>2025-12-09.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!