Tags: ggml-org/llama.cpp
Tags
common: support negated args (#17919) * args: support negated args * update docs * fix typo * add more neg options * Apply suggestions from code review Co-authored-by: Sigbjørn Skjæret <[email protected]> * rm duplicated arg * fix LLAMA_ARG_NO_HOST * add test --------- Co-authored-by: Sigbjørn Skjæret <[email protected]>
models : fix the attn_factor for mistral3 graphs + improve consistency ( #17945) * models : fix the attn_factor for mistral3 graphs * cont : rework attn_factor correction logic * cont : make deepseek2 consistent * cont : add TODO * cont : special-case DSv2 * cont : revert Mistral 3 Large changes * cont : fix DS2 to use the original attn_factor * cont : minor comments
ggml-cpu : fix RISC-V Q4_0 repack select and RVV feature reporting (#……17951) * ggml-cpu:fix RISC-V Q4_0 repack select and RVV feature reporting Signed-off-by: Wang Yang <[email protected]> * using the name VLEN instead of CNT * Update ggml/include/ggml-cpu.h --------- Signed-off-by: Wang Yang <[email protected]> Co-authored-by: Georgi Gerganov <[email protected]>
PreviousNext