- Notifications
You must be signed in to change notification settings - Fork 14.2k
Add complete Megrez-MoE support: GGUF conversion + inference.#17141
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
tamarPal wants to merge 16 commits into ggml-org:masterChoose a base branch from tamarPal:feature/megrez-moe
base:master
Could not load branches
Branch not found: {{refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline, and old review comments may become outdated.
Uh oh!
There was an error while loading. Please reload this page.
Open
Changes from all commits
Commits
Show all changes
16 commits Select commit Hold shift + click to select a range
6d95df3 feat: Add Megrez-MoE architecture support
92750c4 fix: increase graph nodes for Megrez-MoE warmup
969c4f7 feat: adapt Megrez-MoE to new models/*.cpp architecture
9b433f4 refactor: use standard build_moe_ffn instead of custom build_mergez_m…
755418d fix: remove trailing whitespace
007ef13 fix: resolve additional merge issues from rebase
bad2132 Add Megrez-MoE GGUF conversion and inference support
4b67f5c fix: restore HunYuanMoE code, keep only MegrezMoE Pyright fix
cd46a28 fix: remove unintended HunYuanMoE changes
125a1d2 megrez-moe : fix conversion
019d8f6 megrez-moe : fix pyright type error
f1f7aa9 refactor: improve code clarity and address PR review comments
0c70f2b fix: apply graph_max_nodes factor fix to feature/megrez-moe branch
28854c4 refactor: clean up Megrez-MoE code structure
9f90dbe fix: add missing MODEL_TENSOR.FFN_EXP_PROBS_B to MEGREZ_MOE constants
5e8998a Refactor tensor creation for ffn_exps in llama-model
tamarPal File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Uh oh!
There was an error while loading. Please reload this page.
Jump to
Jump to file
Failed to load files.
Loading
Uh oh!
There was an error while loading. Please reload this page.
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
Oops, something went wrong.
Uh oh!
There was an error while loading. Please reload this page.
Oops, something went wrong.
Uh oh!
There was an error while loading. Please reload this page.
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.