Dfridman/deepseek v3 by denys-fridman · Pull Request #445 · mlcommons/logging

denys-fridman · 2026-02-06T13:16:36Z

No description provided.

github-actions · 2026-02-06T13:16:47Z

MLCommons CLA bot:
Thank you very much for your submission, we really appreciate it. Before we can accept your contribution, we ask that you sign the MLCommons CLA (Apache 2). Please use this [Google form] (https://forms.gle/Ew1KkBVpyeJDuRw67) to initiate authorization. If you are from an MLCommons member organization, we will request that you be added to the CLA. If you are not from a member organization, we will email you a CLA to sign. For any questions, please contact support@mlcommons.org.
0 out of 1 committers have signed the MLCommons CLA.
❌ @denys-fridman
_{You can retrigger this bot by commenting recheck in this Pull Request}

mlperf_logging/compliance_checker/training_6.0.0/closed_common.yaml

mlperf_logging/compliance_checker/training_6.0.0/closed_deepseekv3_671b.yaml

mlperf_logging/mllog/constants.py

- Add rcps_deepseek_v3_671b.json stub with BS 16384/18432/20480, learning rates, warmup steps, and gradient accumulation steps - Register deepseek_v3_671b in benchmark_meta.py (result file counts and allowed benchmarks for 6.0) - Add deepseek_v3_671b to submission_runs and eval_accuracy parsing in rcp_checker.py - Add deepseek_v3_671b entry to result_summarizer config.yaml

denys-fridman · 2026-02-24T10:41:20Z

recheck

ShriyaRishab · 2026-02-24T16:19:24Z

@denys-fridman - can you please complete the CLA?

Also, can you create a PR to training_rules that adds GB300 to the list of acceptable reference hardware (https://github.com/mlcommons/training_policies/blob/master/CONTRIBUTING.md#general)?

ShriyaRishab · 2026-02-24T16:21:14Z

mlperf_logging/compliance_checker/training_6.0.0/closed_deepseekv3_671b.yaml

+    REQ:   EXACTLY_ONE
+    CHECK: " v['value'] == 'adamw' "
+
+- KEY:


LR and warmup need to be fixed right? The value should be checked to make sure it follows the fixed formula

ShriyaRishab · 2026-02-24T16:21:37Z

mlperf_logging/compliance_checker/training_6.0.0/closed_deepseekv3_671b.yaml

+    NAME:  opt_learning_rate_warmup_steps
+    REQ:   EXACTLY_ONE
+
+- KEY:


decay steps should be fixed and checked if the value matches what is expected by the reference

ShriyaRishab · 2026-02-24T16:22:49Z

mlperf_logging/rcp_checker/rcp_checker.py

        'flux1': 10,
        'llama31_405b': 3,
        'llama31_8b': 10,
+        'deepseek_v3_671b': 10,


Do we indeed expect 10 submission runs?

denys-fridman · 2026-02-25T09:23:18Z

recheck

…om BS 16384)

denys-fridman requested review from a team as code owners February 6, 2026 13:16

mmarcinkiewicz reviewed Feb 6, 2026

View reviewed changes

mlperf_logging/compliance_checker/training_6.0.0/closed_common.yaml Outdated Show resolved Hide resolved

mmarcinkiewicz reviewed Feb 6, 2026

View reviewed changes

mlperf_logging/compliance_checker/training_6.0.0/closed_deepseekv3_671b.yaml Show resolved Hide resolved

mmarcinkiewicz reviewed Feb 6, 2026

View reviewed changes

mlperf_logging/mllog/constants.py Outdated Show resolved Hide resolved

denys-fridman force-pushed the dfridman/deepseek-v3 branch from 039c2fa to f3a3dc7 Compare February 24, 2026 10:36

denys-fridman added 10 commits February 24, 2026 11:39

add deepseek constant

f1df5f8

add deepseek to compliance check

d2f27fe

rm closed_bert.yaml

63db84d

update deepseek values

1de3bbe

rm closed_retinanet.yaml

3e965eb

rm unused configs + update deepseek

bcbef0a

fix deepseek name

3eca23b

fix deepseek name

ec1d0aa

+DEEPSEEK_V3 -> +DEEPSEEK_V3_671B

cded0c8

denys-fridman force-pushed the dfridman/deepseek-v3 branch from f3a3dc7 to c528ecb Compare February 24, 2026 10:40

update deepseek rcps: set Creator to NVIDIA and Platform to GB300

adff4f6

ShriyaRishab reviewed Feb 24, 2026

View reviewed changes

rename deepseek_v3_671b -> deepseekv3_671b across files and filenames

749650a

denys-fridman added 3 commits February 25, 2026 13:27

add opt_base_learning_rate check for deepseekv3_671b (sqrt scaling fr…

e8300ba

…om BS 16384)

add max_steps and decay_steps checks for deepseekv3_671b

1dc9e8d

apply same checks to open_deepseekv3_671b.yaml

d09e52f

update deepseekv3_671b target loss to 4.05

1941059

denys-fridman mentioned this pull request Feb 25, 2026

[WIP] Initial DeepSeek reference implementation mlcommons/training#861

Open

set deepseekv3_671b submission runs to 5

9ddf73a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dfridman/deepseek v3#445

Dfridman/deepseek v3#445
denys-fridman wants to merge 17 commits intomlcommons:masterfrom
denys-fridman:dfridman/deepseek-v3

denys-fridman commented Feb 6, 2026

Uh oh!

github-actions bot commented Feb 6, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

denys-fridman commented Feb 24, 2026

Uh oh!

ShriyaRishab commented Feb 24, 2026

Uh oh!

ShriyaRishab Feb 24, 2026

Uh oh!

denys-fridman Feb 25, 2026

Uh oh!

ShriyaRishab Feb 24, 2026

Uh oh!

denys-fridman Feb 25, 2026

Uh oh!

ShriyaRishab Feb 24, 2026

Uh oh!

denys-fridman commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

denys-fridman commented Feb 6, 2026

Uh oh!

github-actions bot commented Feb 6, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

denys-fridman commented Feb 24, 2026

Uh oh!

ShriyaRishab commented Feb 24, 2026

Uh oh!

ShriyaRishab Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

denys-fridman Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

denys-fridman Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

ShriyaRishab Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

denys-fridman commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants