Skip to content

[Do not Merge] WA for Qwen3 qkv/up_gate weight load error#59

Open
feiwan1 wants to merge 3 commits intofmiao2372:develop_hpufrom
feiwan1:hpu_qwen_upstream_wa
Open

[Do not Merge] WA for Qwen3 qkv/up_gate weight load error#59
feiwan1 wants to merge 3 commits intofmiao2372:develop_hpufrom
feiwan1:hpu_qwen_upstream_wa

Conversation

@feiwan1
Copy link
Collaborator

@feiwan1 feiwan1 commented Nov 25, 2025

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

  • Add at least a tag in the PR title.
    • Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
    • You can add new tags based on the PR content, but the semantics must be clear.
  • Format your code, run pre-commit before commit.
  • Add unit tests. Please write the reason in this PR if no unit tests.
  • Provide accuracy results.
  • If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants