Add automatic mixed precision training for ATME by benbennett · Pull Request #2 · benbennett/SIMPLE

benbennett · 2025-09-24T03:51:56Z

Summary

enable optional automatic mixed precision in the ATME model using GradScaler to speed up training
add a command-line flag to toggle AMP support for ATME runs

Testing

python -m compileall models/atme_model.py options/atme_options.py

https://chatgpt.com/codex/tasks/task_e_68d35ae1eeec83319cf61e547d8fb9ae

chatgpt-codex-connector

Codex Review: Here are some suggestions.

Reply with @codex fix comments to fix any unresolved comments.

About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you open a pull request for review, mark a draft as ready, or comment "@codex review". If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex fix this CI failure" or "@codex address that feedback".

chatgpt-codex-connector · 2025-09-24T03:56:50Z

models/atme_model.py

+        self.use_amp = bool(getattr(opt, 'use_amp', False)) and self.isTrain and torch.cuda.is_available()
+        self.scaler = GradScaler(enabled=self.use_amp)


[P1] Enable AMP only when model runs on CUDA device

self.use_amp is set to True whenever torch.cuda.is_available() is true, regardless of whether the model is actually placed on CUDA. When the user selects CPU execution (e.g. --gpu_ids -1) on a machine that still has CUDA available, optimize_parameters enters a CUDA autocast context and uses GradScaler while all tensors stay on CPU. torch.cuda.amp raises at runtime in that configuration, so CPU training now crashes unless AMP is manually disabled. The previous implementation worked on CPU; this change regresses that path. The flag should also check the selected device (e.g. non-empty self.gpu_ids or self.device.type == 'cuda') before enabling AMP.

Useful? React with 👍 / 👎.

Add AMP support to speed up ATME training

bc9f346

benbennett added the codex label Sep 24, 2025 — with ChatGPT Codex Connector

chatgpt-codex-connector bot reviewed Sep 24, 2025

View reviewed changes

benbennett marked this pull request as draft February 8, 2026 01:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add automatic mixed precision training for ATME#2

Add automatic mixed precision training for ATME#2
benbennett wants to merge 1 commit intomainfrom
codex/find-ways-to-speed-up-training

benbennett commented Sep 24, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Sep 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		self.use_amp = bool(getattr(opt, 'use_amp', False)) and self.isTrain and torch.cuda.is_available()
		self.scaler = GradScaler(enabled=self.use_amp)

Conversation

benbennett commented Sep 24, 2025

Summary

Testing

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Sep 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant