Do not shard unused parameters by kctezcan · Pull Request #1773 · ecmwf/WeatherGenerator

kctezcan · 2026-02-02T10:52:40Z

Description

See #1750

Issue Number

Closes #1750

Is this PR a draft? Mark it as draft.

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

shmh40

Tested and doesn't cause problems for me.

clessig · 2026-02-06T20:13:45Z

src/weathergen/model/model_interface.py

-            # maybe_sharded_sd[param_name.replace("module.", "")] = nn.Parameter(sharded_tensor)
-            maybe_sharded_sd[param_name] = torch.nn.Parameter(sharded_tensor)
+            if sharded_meta_param is None:
+                logger.info(f"Sharding meta parameters is None for: {param_name}")


Is it correct that sharded_meta_param is None means that this is a parameter in the checkpoint that is not present in the current model?

clessig · 2026-02-06T20:14:03Z

src/weathergen/model/model_interface.py

+                    sharded_meta_param.placements,
+                )
+                # maybe_sharded_sd[param_name.replace("module.", "")] = nn.Parameter(sharded_tensor)
+                maybe_sharded_sd[param_name] = torch.nn.Parameter(sharded_tensor)


Can we please remove the line below.

clessig · 2026-02-11T11:41:27Z

@kctezcan : can you address the comments so that we can merge this

added check for sharding

b5eab97

github-project-automation bot added this to WeatherGen-dev Feb 2, 2026

shmh40 approved these changes Feb 6, 2026

View reviewed changes

clessig requested changes Feb 6, 2026

View reviewed changes

github-project-automation bot moved this to In Progress in WeatherGen-dev Feb 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not shard unused parameters#1773

Do not shard unused parameters#1773
kctezcan wants to merge 1 commit intoecmwf:developfrom
MeteoSwiss:ktezcan/dev/iss1750_load_sharding

kctezcan commented Feb 2, 2026

Uh oh!

shmh40 left a comment

Uh oh!

clessig Feb 6, 2026

Uh oh!

clessig Feb 6, 2026

Uh oh!

clessig commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kctezcan commented Feb 2, 2026

Description

Issue Number

Checklist before asking for review

Uh oh!

shmh40 left a comment

Choose a reason for hiding this comment

Uh oh!

clessig Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

clessig Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

clessig commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants