Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobodyLoading
Sort

Pull requests list

Modern Bert Support model Model specific python python script changes
#15641 opened Aug 28, 2025 by ryan-mangenoLoading…
llama : add llama_batch_ext android Issues specific to Android examples python python script changes server
#11875 opened Feb 14, 2025 by ngxsonLoading…
sampling : add support for backend sampling Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server testing Everything test related
#17004 opened Nov 4, 2025 by danbevLoading…
17 of 25 tasks
llama: Attempt to add ModernBert model Model specific python python script changes
#14014 opened Jun 4, 2025 by huydt84Loading…
add FP8 support to gguf/llama: build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning script Script related Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes testing Everything test related
#10055 opened Oct 26, 2024 by Djip007 Draft
1 of 3 tasks
mtmd: Add DeepSeekOCR Support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17400 opened Nov 20, 2025 by sfallahLoading…
Implement SparseK Attention mechanism — new GGML operator with CPU backend (GPU planned next) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#16817 opened Oct 28, 2025 by yael-worksLoading…
tool: add convertation of text/parquet to custom format build Compilation issues examples
#14622 opened Jul 10, 2025 by lexasubLoading…
model : add LLADA 2.0 diffusion support examples model Model specific python python script changes
#17454 opened Nov 23, 2025 by wsbagnsv1 Draft
Feature/kimi linear support ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17592 opened Nov 29, 2025 by cacaviewLoading…
Implementation of a sequence repetition penalty sampler enhancement New feature or request generation quality Quality of model output need feedback Testing and feedback with results are needed
#2593 opened Aug 12, 2023 by KerfuffleV2 Draft
llama : second attempt to refactor vision API examples python python script changes server
#11292 opened Jan 18, 2025 by ngxson Draft
1 of 5 tasks
llama-cli: add support for reasoning examples
#16603 opened Oct 16, 2025 by bandotiLoading…
WIP: Add model merge example demo Demonstrate some concept or idea, not intended to be merged help wanted Needs help from the community
#5741 opened Feb 26, 2024 by ngxson Draft
cuda : Add conv2d Implicit GEMM ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15805 opened Sep 4, 2025 by bssrdfLoading…
[MPI] Add support for per-node options, thread counts, and layer allocations build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning server
#3334 opened Sep 26, 2023 by AutonomicPerfectionist Draft
2 of 5 tasks
Implement llama-pull tool examples
#16423 opened Oct 4, 2025 by ericcurtinLoading…
support MiniCPM-V-2 demo Demonstrate some concept or idea, not intended to be merged enhancement New feature or request examples python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#6919 opened Apr 26, 2024 by AchazwlLoading…
Layer skipping/self-speculation demo demo Demonstrate some concept or idea, not intended to be merged research 🔬
#3565 opened Oct 10, 2023 by KerfuffleV2 Draft
Server: enable lookup decoding enhancement New feature or request examples Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#6828 opened Apr 22, 2024 by JohannesGaesslerLoading…
ProTip! What’s not been updated in a month: updated:<2025-11-13.