- Notifications
You must be signed in to change notification settings - Fork 14.1k
Pull requests: ggml-org/llama.cpp
Author
Uh oh!
There was an error while loading. Please reload this page.
Label
Uh oh!
There was an error while loading. Please reload this page.
Projects
Uh oh!
There was an error while loading. Please reload this page.
Milestones
Uh oh!
There was an error while loading. Please reload this page.
Reviews
Assignee
Assigned to nobodyLoading
Uh oh!
There was an error while loading. Please reload this page.
Sort
Pull requests list
Modern Bert Support model Model specific python python script changes
#15641 opened Aug 28, 2025 by ryan-mangenoLoading…
sampling : add support for backend sampling Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server testing Everything test related
#17004 opened Nov 4, 2025 by danbevLoading…
17 of 25 tasks
llama: Attempt to add ModernBert model Model specific python python script changes
#14014 opened Jun 4, 2025 by huydt84Loading…
add FP8 support to gguf/llama: build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning script Script related Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes testing Everything test related
mtmd: Add DeepSeekOCR Support examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17400 opened Nov 20, 2025 by sfallahLoading…
Implement SparseK Attention mechanism — new GGML operator with CPU backend (GPU planned next) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#16817 opened Oct 28, 2025 by yael-worksLoading…
tool: add convertation of text/parquet to custom format build Compilation issues examples
#14622 opened Jul 10, 2025 by lexasubLoading…
Feature/kimi linear support ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#17592 opened Nov 29, 2025 by cacaviewLoading…
imatrix: calculate activation-based statistics for new format (GGUF) imatrices examples
#14891 opened Jul 26, 2025 by EAddarioLoading…
Implementation of a sequence repetition penalty sampler enhancement New feature or request generation quality Quality of model output need feedback Testing and feedback with results are needed
#2593 opened Aug 12, 2023 by KerfuffleV2 • Draft
WIP: Add model Demonstrate some concept or idea, not intended to be merged help wanted Needs help from the community
merge example demo cuda : Add conv2d Implicit GEMM ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15805 opened Sep 4, 2025 by bssrdfLoading…
[MPI] Add support for per-node options, thread counts, and layer allocations build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning server
#3334 opened Sep 26, 2023 by AutonomicPerfectionist • Draft
2 of 5 tasks
Update gpt2 preprocess and add deepseek coder preprocess
#4070 opened Nov 14, 2023 by DOGEwbxLoading…
Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also enhancement New feature or request Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
support MiniCPM-V-2 demo Demonstrate some concept or idea, not intended to be merged enhancement New feature or request examples python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#6919 opened Apr 26, 2024 by AchazwlLoading…
Layer skipping/self-speculation demo demo Demonstrate some concept or idea, not intended to be merged research 🔬
#3565 opened Oct 10, 2023 by KerfuffleV2 • Draft
Server: enable lookup decoding enhancement New feature or request examples Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#6828 opened Apr 22, 2024 by JohannesGaesslerLoading…
PreviousNext
ProTip! What’s not been updated in a month: updated:<2025-11-13.