- Notifications
You must be signed in to change notification settings - Fork 14.1k
Pull requests: ggml-org/llama.cpp
Author
Uh oh!
There was an error while loading. Please reload this page.
Label
Uh oh!
There was an error while loading. Please reload this page.
Projects
Uh oh!
There was an error while loading. Please reload this page.
Milestones
Uh oh!
There was an error while loading. Please reload this page.
Reviews
Assignee
Assigned to nobodyLoading
Uh oh!
There was an error while loading. Please reload this page.
Sort
Pull requests list
mtmd, llama: add GLM4V vision-language model support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17967 opened Dec 12, 2025 by eelbazLoading…
Feature/kimi linear support ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17592 opened Nov 29, 2025 by cacaviewLoading…
Add PagedAttention support (experimental, CUDA only) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs server
#17579 opened Nov 28, 2025 by ericcurtin • Draft
mtmd: Add DeepSeekOCR Support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17400 opened Nov 20, 2025 by sfallahLoading…
models : add Nougat OCR support with mBART and Swin Transformer examples model Model specific python python script changes
#17398 opened Nov 20, 2025 by h9-tecLoading…
6 of 10 tasks
[model] Add support for Plamo3 model Model specific python python script changes
#17304 opened Nov 16, 2025 by mmngaLoading…
Add complete Megrez-MoE support: GGUF conversion + inference. model Model specific python python script changes
#17141 opened Nov 10, 2025 by tamarPalLoading…
Mamba2 SSD Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16982 opened Nov 3, 2025 by gabe-l-hart • Draft
support GLM-4.5V and GLM-4.1V vision models examples help wanted Needs help from the community model Model specific python python script changes
Modern Bert Support model Model specific python python script changes
#15641 opened Aug 28, 2025 by ryan-mangenoLoading…
llama: Attempt to add ModernBert model Model specific python python script changes
#14014 opened Jun 4, 2025 by huydt84Loading…
Clamp out of range values in K quantizer bugfix fixes an issue or bug model Model specific Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
Adding Support for Custom Qwen2moe Architectures with mergekit-qwen2 model Model specific Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.