Server · Workflow runs · ggml-org/llama.cpp · GitHub

Actions

Server
Actions
Loading...
Loading

14,591 workflow runs

14,591 workflow runs

kv-cache: Fix state restore fragmented cache Server #23899: Pull request #17982 opened by ssweens

Queuedssweens:fix-state-restore-fragmented-cache

ssweens:fix-state-restore-fragmented-cache

Queued

webui: fix chat header width when sidebar is closed Server #23898: Pull request #17981 synchronize by polydecay

Action requiredpolydecay:251213-fix-collapsed-sidebar-header-wdith

polydecay:251213-fix-collapsed-sidebar-header-wdith

Action required

webui: fix chat header width when sidebar is closed Server #23897: Pull request #17981 synchronize by polydecay

Action requiredpolydecay:251213-fix-collapsed-sidebar-header-wdith

polydecay:251213-fix-collapsed-sidebar-header-wdith

Action required

webui: fix chat header width when sidebar is closed Server #23896: Pull request #17981 opened by polydecay

Action requiredpolydecay:251213-fix-collapsed-sidebar-header-wdith

polydecay:251213-fix-collapsed-sidebar-header-wdith

Action required

common: support negated args (#17919) Server #23895: Commit 380b4c9 pushed by ngxson

Queuedmaster

Queued

mtmd: refactor audio preprocessing Server #23894: Pull request #17978 opened by ngxson

Queuedngxson:xsn/mtmd_refactor_audio_preproc

ngxson:xsn/mtmd_refactor_audio_preproc

Queued

ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations Server #23893: Pull request #17977 opened by ngdxzy

Action requiredngdxzy:real_q8_0

ngdxzy:real_q8_0

Action required

ggml-hexagon: gelu operation Server #23892: Pull request #17921 synchronize by joeldushouyu

Action requiredjoeldushouyu:hexagon-gelu-hvx

joeldushouyu:hexagon-gelu-hvx

Action required

Webui: Disable attachment button and model selector button when prompt textbox is disabled. Server #23891: Pull request #17925 synchronize by dariusjlukas

Action requireddariusjlukas:webui-chatformaction-disable-flag

dariusjlukas:webui-chatformaction-disable-flag

Action required

Webui: Disable attachment button and model selector button when prompt textbox is disabled. Server #23890: Pull request #17925 synchronize by dariusjlukas

Action requireddariusjlukas:webui-chatformaction-disable-flag

dariusjlukas:webui-chatformaction-disable-flag

Action required

common : add llama-completion to completion-bash executables Server #23889: Pull request #17976 opened by CISC

Queuedcisc/llama-completion-completion-bash

cisc/llama-completion-completion-bash

Queued

common : skip model validation when --completion-bash is requested Server #23888: Pull request #17975 opened by CISC

Queuedcisc/skip-model-validation-completion-bash

cisc/skip-model-validation-completion-bash

Queued

llama_context: synchronize before reallocating output buffer Server #23887: Pull request #17974 opened by jeffbolznv

Queuedjeffbolznv:issue_17957

jeffbolznv:issue_17957

Queued

clip: move model cgraphs into their own files (#17965) Server #23886: Commit e39a2ce pushed by ngxson

Queuedmaster

Queued

clip: move model cgraphs into their own files Server #23885: Pull request #17965 synchronize by ngxson

Queuedngxson:xsn/clip_refactor_smaller_files

ngxson:xsn/clip_refactor_smaller_files

Queued

Webui: Disable attachment button and model selector button when prompt textbox is disabled. Server #23884: Pull request #17925 synchronize by dariusjlukas

Action requireddariusjlukas:webui-chatformaction-disable-flag

dariusjlukas:webui-chatformaction-disable-flag

Action required

cmake: correct scope - link ws2_32 for MinGW/w64devkit builds in cpp-httplib Server #23883: Pull request #17972 opened by gustrd

Queuedgustrd:master

Queued

Webui: Disable attachment button and model selector button when prompt textbox is disabled. Server #23882: Pull request #17925 synchronize by dariusjlukas

Action requireddariusjlukas:webui-chatformaction-disable-flag

dariusjlukas:webui-chatformaction-disable-flag

Action required

Build CUDA architectures 120 and 121 by default (RTX5000 and GB10) Server #23881: Pull request #17970 opened by DaAwesomeP

Action requiredmitmedialab:cuda_arch_120_and_121

mitmedialab:cuda_arch_120_and_121

Action required

llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization Server #23880: Pull request #16653 synchronize by JohannesGaessler

QueuedJohannesGaessler:llama-memory-fit-9

JohannesGaessler:llama-memory-fit-9

Queued

common: support negated args Server #23879: Pull request #17919 synchronize by ngxson

Queuedngxson:xsn/arg_neg

ngxson:xsn/arg_neg

Queued

webui: Improve copy to clipboard with text attachments Server #23878: Pull request #17969 synchronize by allozaur

Queuedallozaur:17834-copy-user-message-content-with-pasted-content-attachments

allozaur:17834-copy-user-message-content-with-pasted-content-attachments

Queued

webui: Improve copy to clipboard with text attachments Server #23877: Pull request #17969 opened by allozaur

3m 5s allozaur:17834-copy-user-message-content-with-pasted-content-attachments

allozaur:17834-copy-user-message-content-with-pasted-content-attachments

3m 5s

ggml-hexagon: gelu operation Server #23876: Pull request #17921 synchronize by joeldushouyu

Action requiredjoeldushouyu:hexagon-gelu-hvx

joeldushouyu:hexagon-gelu-hvx

Action required

common: support negated args Server #23875: Pull request #17919 synchronize by ngxson

12m 15s ngxson:xsn/arg_neg

ngxson:xsn/arg_neg

12m 15s