ggml-org /llama.cppPublic

Notifications You must be signed in to change notification settings
Fork 14.1k
Star 91.2k

Code
Issues335
Pull requests619
Discussions
Actions
Projects10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 86 Milestones 0

New pull requestNew

Clear current search query, filters, and sorts

16 Open 69 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

mtmd, llama: add GLM4V vision-language model support examples ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

python

python script changes

#17967 opened Dec 12, 2025 by eelbaz

Loading…

Feature/kimi linear support ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

python

python script changes

#17592 opened Nov 29, 2025 by cacaview

Loading…

Improve Qwen3-Next Speed model

Model specific

#17585 opened Nov 29, 2025 by lovedheart • Draft

Add PagedAttention support (experimental, CUDA only) examples ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

server

#17579 opened Nov 28, 2025 by ericcurtin • Draft

model : add LLADA 2.0 diffusion support examples model

Model specific

python

python script changes

#17454 opened Nov 23, 2025 by wsbagnsv1 • Draft

mtmd: Add DeepSeekOCR Support examples ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

python

python script changes

#17400 opened Nov 20, 2025 by sfallah

Loading…

models : add Nougat OCR support with mBART and Swin Transformer examples model

Model specific

python

python script changes

#17398 opened Nov 20, 2025 by h9-tec

Loading…

6 of 10 tasks

[model] Add support for Plamo3 model

Model specific

python

python script changes

#17304 opened Nov 16, 2025 by mmnga

Loading…

llama: add attn temperature tuning for llama arch (non-iswa) model

Model specific

python

python script changes

#17239 opened Nov 13, 2025 by ngxson • Draft

Add complete Megrez-MoE support: GGUF conversion + inference. model

Model specific

python

python script changes

#17141 opened Nov 10, 2025 by tamarPal

Loading…

Mamba2 SSD Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

examples ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

testing

Everything test related

#16982 opened Nov 3, 2025 by gabe-l-hart • Draft

support GLM-4.5V and GLM-4.1V vision models examples help wanted

Needs help from the community

model

Model specific

python

python script changes

#16600 opened Oct 15, 2025 by ddh0 • Draft

Modern Bert Support model

Model specific

python

python script changes

#15641 opened Aug 28, 2025 by ryan-mangeno

Loading…

llama: Attempt to add ModernBert model

Model specific

python

python script changes

#14014 opened Jun 4, 2025 by huydt84

Loading…

Clamp out of range values in K quantizer bugfix

fixes an issue or bug

model

Model specific

Review Complexity : Medium

Generally require more time to grok but manageable by beginner to medium expertise level

#6888 opened Apr 25, 2024 by jart • Draft

Adding Support for Custom Qwen2moe Architectures with mergekit-qwen2 model

Model specific

Review Complexity : High

Generally require indepth knowledge of LLMs or GPUs

#6453 opened Apr 3, 2024 by DisOOM • Draft

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!