Skip to content

Comments

Fix/megatron/megatron fsdp patches#553

Open
jasainio wants to merge 1 commit intomainfrom
fix/megatron/megatron-fsdp-patches
Open

Fix/megatron/megatron fsdp patches#553
jasainio wants to merge 1 commit intomainfrom
fix/megatron/megatron-fsdp-patches

Conversation

@jasainio
Copy link

Adds a patch to fix Megatron FSDP compatibility with PyTorch 2.10+. The patch updates get_mesh_names to use the new DeviceMesh API (_get_root_mesh() and _flatten_mapping) instead of the deprecated _mesh_resources.child_to_root_mapping removed in PyTorch 2.10. The patch is automatically applied when use_megatron_fsdp is enabled.

@jasainio jasainio force-pushed the fix/megatron/megatron-fsdp-patches branch 3 times, most recently from 00239a9 to 8002c01 Compare February 10, 2026 13:20
@jasainio jasainio force-pushed the fix/megatron/megatron-fsdp-patches branch from 8002c01 to 1a4685c Compare February 10, 2026 13:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants