Skip to content

Conversation

@jeffbolznv
Copy link
Collaborator

The dequantize functions are copy/pasted from mul_mm_funcs.comp with very few changes - add a_offset and divide iqs by 2. It's probably possible to call these functions from mul_mm_funcs and avoid the duplication, but I didn't go that far in this change.

See leejet/stable-diffusion.cpp#851 (comment).

The dequantize functions are copy/pasted from mul_mm_funcs.comp with very few changes - add a_offset and divide iqs by 2. It's probably possible to call these functions from mul_mm_funcs and avoid the duplication, but I didn't go that far in this change.
@github-actionsgithub-actionsbot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Sep 24, 2025
@0cc4m0cc4m merged commit 3f81b4e into ggml-org:masterSep 27, 2025
63 of 65 checks passed
pwilkin pushed a commit to pwilkin/llama.cpp that referenced this pull request Oct 23, 2025
The dequantize functions are copy/pasted from mul_mm_funcs.comp with very few changes - add a_offset and divide iqs by 2. It's probably possible to call these functions from mul_mm_funcs and avoid the duplication, but I didn't go that far in this change.
@ggerganov
Copy link
Member

@jeffbolznv GET_ROWS support for I32 would be useful for #17004

Sign up for freeto join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggmlchanges relating to the ggml tensor library for machine learningVulkanIssues specific to the Vulkan backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants

@jeffbolznv@ggerganov@0cc4m