Improve performance of RealSHT / InverseRealSHT by mcgibbon · Pull Request #835 · ai2cm/ace

mcgibbon · 2026-02-13T19:12:04Z

This PR updates RealSHT and InverseRealSHT to improve performance. I see speed ups of ~50% (in the SHT itself, not total) on benchmarks replicating the size of our data during n=512 production runs.

Changes:

Added benchmark classes for RealSHT and InverseRealSHT
Added CPU timing to benchmarking, to corroborate GPU total times
Tests added

mcgibbon · 2026-02-13T19:53:42Z

I ran some local tests at different resolutions. At 1/4 degree with a batch size of 32 I got 1485s -> 947s, and at 2 degrees with a batch size I can't recall (larger than 512 though, maybe 1024) I got 491s -> 275s. So the improvement appears to be independent of grid resolution, so long as the GPU is occupied (I got a 0% speed-up on some experiments with very low batch size). These are also lower bounds for speed-up, since I have not guaranteed full GPU occupancy in these benchmarks.

mcgibbon · 2026-02-13T20:03:48Z

fme/core/benchmark/results/inverserealsht_tesla_t4_44b11334.json

@@ -0,0 +1,12 @@
+{


TODO: these benchmark files are added to help with the review, but need to be deleted before merging.

mcgibbon · 2026-02-17T16:43:28Z

A more modest speedup unfortunately (5%) in the benchmark I ran on Jupiter on H100s, but still a speedup: https://wandb.ai/ai2cm/fme-core-benchmarks baseline run is 13f5b2, run for this branch is f51f0e.

Update: looks like on Titan the result is the opposite, about a 5% slow-down.

For both of these benchmarks, it's not clear to me if the GPU is occupied - the run time is a few ms compared to ~50 on the T4.

mcgibbon added 4 commits February 13, 2026 18:50

add benchmarks for sht and isht

b11c918

add log of total cpu time during benchmark

da4ba94

speed up sht and isht

44b1133

add before and after json benchmarks

dc17cad

add regression target from b11c918

ff33599

mcgibbon changed the title ~~Feature/benchmark sht~~ Improve performance of RealSHT / InverseRealSHT Feb 13, 2026

Merge branch 'main' into feature/benchmark_sht

86a2e36

mcgibbon commented Feb 13, 2026

View reviewed changes

mcgibbon marked this pull request as ready for review February 13, 2026 20:05

mcgibbon mentioned this pull request Feb 13, 2026

Speed up SHT NVIDIA/torch-harmonics#150

Open

mcgibbon added 6 commits February 13, 2026 15:37

Merge branch 'main' into feature/benchmark_sht

3d55bed

add wandb logging for benchmarks

c35fbe3

add benchmarking to CI

e24ee9f

only run benchmarking on commits to main

1fabf00

Merge branch 'main' into feature/benchmark_to_wandb

299240d

Merge branch 'feature/benchmark_to_wandb' into feature/benchmark_sht

517796f

mcgibbon changed the base branch from main to feature/benchmark_to_wandb February 17, 2026 16:05

mcgibbon added 3 commits February 17, 2026 16:10

try adding gantry run script

8e7fcf3

fix run script

f50b4d2

use correct workspace

f51f0ed

simpler lowercase name for benchmark

0e7046e

Base automatically changed from feature/benchmark_to_wandb to main February 18, 2026 16:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of RealSHT / InverseRealSHT#835

Improve performance of RealSHT / InverseRealSHT#835
mcgibbon wants to merge 16 commits intomainfrom
feature/benchmark_sht

mcgibbon commented Feb 13, 2026 •

edited

Loading

Uh oh!

mcgibbon commented Feb 13, 2026

Uh oh!

mcgibbon Feb 13, 2026

Uh oh!

mcgibbon commented Feb 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

mcgibbon commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcgibbon commented Feb 13, 2026

Uh oh!

mcgibbon Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

mcgibbon commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

mcgibbon commented Feb 13, 2026 •

edited

Loading

mcgibbon commented Feb 17, 2026 •

edited

Loading