Skip to content

Tags: ggml-org/llama.cpp

Tags

b7376

Toggle b7376's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
common: support negated args (#17919) * args: support negated args * update docs * fix typo * add more neg options * Apply suggestions from code review Co-authored-by: Sigbjørn Skjæret <[email protected]> * rm duplicated arg * fix LLAMA_ARG_NO_HOST * add test --------- Co-authored-by: Sigbjørn Skjæret <[email protected]>

b7375

Toggle b7375's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
clip: move model cgraphs into their own files (#17965) * clip: move model cgraphs into their own files * more explicit enums * fix linux build * fix naming * missing headers * nits: add comments for contributors

b7374

Toggle b7374's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ci : change the cann version and the container pull method (#17953) fix error format Update build.yml Remove unnecessary zip files fix update

b7372

Toggle b7372's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
CUDA: fix overflow in MMA kernel without stream-k (#17939) 

b7371

Toggle b7371's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
models : fix the attn_factor for mistral3 graphs + improve consistency ( #17945) * models : fix the attn_factor for mistral3 graphs * cont : rework attn_factor correction logic * cont : make deepseek2 consistent * cont : add TODO * cont : special-case DSv2 * cont : revert Mistral 3 Large changes * cont : fix DS2 to use the original attn_factor * cont : minor comments

b7370

Toggle b7370's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cann : fix ops broken by circular padding guard (#17825) 

b7369

Toggle b7369's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml-cpu : fix RISC-V Q4_0 repack select and RVV feature reporting (#……17951) * ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting Signed-off-by: Wang Yang <[email protected]> * using the name VLEN instead of CNT * Update ggml/include/ggml-cpu.h --------- Signed-off-by: Wang Yang <[email protected]> Co-authored-by: Georgi Gerganov <[email protected]>

b7368

Toggle b7368's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
mtmd: explicitly forbidden inclusion of private header and libcommon (#……17946)

b7366

Toggle b7366's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
arg: add -mm and -mmu as short form of --mmproj and --mmproj-url (#17958) * arg: add -mm and -mmu as short form of --mmproj and --mmproj-url * correct order * update docs

b7364

Toggle b7364's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
common : add minimalist multi-thread progress bar (#17602) Signed-off-by: Adrien Gallouët <[email protected]>