Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobodyLoading
Sort

Pull requests list

mtmd, llama: add GLM4V vision-language model support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17967 opened Dec 12, 2025 by eelbazLoading…
Feature/kimi linear support ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17592 opened Nov 29, 2025 by cacaviewLoading…
Improve Qwen3-Next Speed model Model specific
#17585 opened Nov 29, 2025 by lovedheart Draft
Add PagedAttention support (experimental, CUDA only) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs server
#17579 opened Nov 28, 2025 by ericcurtin Draft
model : add LLADA 2.0 diffusion support examples model Model specific python python script changes
#17454 opened Nov 23, 2025 by wsbagnsv1 Draft
mtmd: Add DeepSeekOCR Support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17400 opened Nov 20, 2025 by sfallahLoading…
models : add Nougat OCR support with mBART and Swin Transformer examples model Model specific python python script changes
#17398 opened Nov 20, 2025 by h9-tecLoading…
6 of 10 tasks
[model] Add support for Plamo3 model Model specific python python script changes
#17304 opened Nov 16, 2025 by mmngaLoading…
llama: add attn temperature tuning for llama arch (non-iswa) model Model specific python python script changes
#17239 opened Nov 13, 2025 by ngxson Draft
Add complete Megrez-MoE support: GGUF conversion + inference. model Model specific python python script changes
#17141 opened Nov 10, 2025 by tamarPalLoading…
Mamba2 SSD Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16982 opened Nov 3, 2025 by gabe-l-hart Draft
support GLM-4.5V and GLM-4.1V vision models examples help wanted Needs help from the community model Model specific python python script changes
#16600 opened Oct 15, 2025 by ddh0 Draft
Modern Bert Support model Model specific python python script changes
#15641 opened Aug 28, 2025 by ryan-mangenoLoading…
llama: Attempt to add ModernBert model Model specific python python script changes
#14014 opened Jun 4, 2025 by huydt84Loading…
Clamp out of range values in K quantizer bugfix fixes an issue or bug model Model specific Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#6888 opened Apr 25, 2024 by jart Draft
Adding Support for Custom Qwen2moe Architectures with mergekit-qwen2 model Model specific Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#6453 opened Apr 3, 2024 by DisOOM Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.