Updated workflow templating by mranst · Pull Request #656 · GEOS-ESM/swell

mranst · 2025-11-10T16:07:08Z

This is the most recent version of the concept of updating the workflow templating to use python. This approach should be more straightforward and easier to understand than the previous version, as it uses the same jinja templating in the graph section to make it easier to read (though it doesn't necessarily have to). This pr also merges the task_runtimes and task_questions objects used by the previous pr into one object for simplicity. I'm aiming for this approach to allow for greater independence of the tasks from the suite creation system, to allow generating configs for single tasks and having mock values for code tests for each task, but that's not ready to show off yet.

I still need to merge develop into this and update the docs, but I wanted to open this up as a draft

…s_for_comparison

mranst · 2026-01-20T22:33:28Z

Setup declarations have been put in the main task files. I went with method 3 as that seemed to work best. Method 4 seemed to register the inherited classes under each instance that TaskSetup was imported under, so it didn't work to centrally track them. Method 5 worked but there wasn't a great way to robustly handle the naming of the classes without initializing them. I moved the external imports (eva, r2d2, jedi_bundle) into the execute methods of the tasks that need them, but there may be a better way of handling this

shiklomanov-an

Nicely done! I really like how this is looking. I think having the task parameters in the same files as the task definitions will be very useful.

I flagged only a few (hopefully minor) things.

Some of these are type annotation issues flagged by my type checker.
There's also what looks like a bug related to override as a file path vs. a dictionary.
Finally, there are a few documentation requests for implementation details and design decisions, to help future developers (since we're using slightly esoteric Python machinery to get this working).

(Re: Local import documentation --- if we're just using local imports for consistency across modules, that's totally acceptable and you can disregard my comments about those...but maybe add a line to the documentation somewhere about why we're using that general pattern, with reference to specific cases like eva and r2d2).

@Dooruk @rtodling -- Take a look the new task definition structure (any of the files under tasks) and let me know what you think. Don't worry too much about the underlying machinery; I'm more interested on what you think of this overall structure (compared to having all the task questions sit in a single giant YAML file / Python script like we had before; #314).

src/swell/suites/base/all_suites.py

src/swell/deployment/create_task_config.py

src/swell/swell.py

src/swell/deployment/create_experiment.py

src/swell/tasks/base/task_setup.py

src/swell/tasks/build_geos.py

src/swell/tasks/build_geos_by_linking.py

src/swell/tasks/build_jedi.py

src/swell/tasks/jedi_c_test.py

Dooruk · 2026-02-05T20:59:28Z

Wow, this is some impressive amount of change. Really appreciate the documentation, it would be quite confusing without it.

When I first started using SWELL, I was able to include config variables inside the task scripts without the task_questions.yaml and suite_questions.yaml interference. Explaining that concept to the new users has always been challenging. Overall, I'm happy with the vision and design decisions here. Previous method made it hard to introduce new developers, hope this will simplify the SWELL entry point.

Overall, I'm happy with these changes. In the previous workflow templating iteration flow.cylc handling was really confusing, and this approach makes it more straightforward.

Abandoning single task_questions and suite_questions is a welcome change. Users would also see QuestionDefaults class is coming from utilities so that multi-file ambiguity is inherently handled.

Since we have a limited number of seasoned users, I would be curious to hear from Yonggang and Maryam too, once this is ready for that stage.

Some minor questions/details/nitpicks:

Is this ready for testing yet?
How can workflow.py handle non \tasks specified scripts? Is this PR limiting us with python-only scripts such as:

(More of a general comment but going forward) Could this new method handle, say, bash scripts?
Or calling outside scripts directly, like so:

    [[RunGeos]]
        script = "{{experiment_path}}/GEOSgcm/forecast/gcm_run.j"
        platform = {{platform}}
        [[[job]]]
            shell = /bin/csh

Model specific tasks organized under \tasks\model, following Michael's PR.

Documentation states that [scheduling] section of flow.cylc remains the same, that means that part is still meant to handle conditional tasks and loops since jinja templating parses it?
What are "messaging parameters"?
I'm going to sound very pedantic and annoying, but is_model switch might be better named model_specific or model_dep? Considering how widespread this switch is I can live without this change :) In a similar vein, time_limit -> task_time_limit.

mranst · 2026-02-05T22:01:16Z

@Dooruk Thanks! To answer your questions:

Yes, this should be ready for testing. It's been a week or two but this PR has passed tier1 and tier2 previously
The runtime section generation is geared to swell tasks for convenience, but you can easily run any kind of script by overriding the TaskSetup object. Your example would look like this: (you can just put this in the workflow.py file and reference it there, it doesn't need to be it's own task)

class GcmRunJ(TaskSetup):
    def set_attributes:
        self.base_name = "RunGeos"
        self.script = "{{experiment_path}}/GEOSgcm/forecast/gcm_run.j"
        self.slurm = {}
        self.additional_sections.append(
             self.create_new_section('job', {'shell': 'csh'})
        )

class Workflow:
...

    def set_tasks(self) -> list:
        self.tasks.append(GcmRunJ)

For development purposes, there's also no reason you couldn't just add the section in plain text to the scheduling

swell/src/swell/suites/3dvar/workflow.py

Lines 101 to 107 in dabe9b2

    
               # Task defaults 
        
               # ------------- 
        
           '''  # noqa 
        
           # --------------------------------------------------------------------------------------------------

^ right here, for instance.

Basically nothing is different in the scheduling section, cylc still handles it in the exact same way once flow.cylc is generated.

In all workflows I maintained that the scheduling section is templated using jinja, like so:

swell/src/swell/suites/3dvar/workflow.py

Lines 111 to 122 in dabe9b2

    
           def get_workflow_string(self): 
        
               workflow_str = self.default_header() 
        
               workflow_str += template_string_jinja2(logger=self.logger, 
        
                                                      templated_string=template_str, 
        
                                                      dictionary_of_templates=self.experiment_dict, 
        
                                                      allow_unresolved=True) 
        
               for task in self.tasks: 
        
                   workflow_str += task.runtime_string(self.experiment_dict, 
        
                                                       self.slurm_external) 
        
               return workflow_str

So that section of file construction is still handled in the same jinja logic. But technically, you can set the set_workfow_string method in any way you want, so there's room to get more advanced here if we wish

Messaging parameters are the events for which cylc will email the user if the suite is launched with the -m flag. The relevant ones are failed and submit-failed. The ones that are designed to fail normally only have submit-failed as a option to not annoy the user. Suites can have succeeded which might be relevant as well.
Good call, I'll rename them

mranst added 30 commits May 1, 2025 18:36

Initial commit

933b142

adding workflow string

d46b22f

working

e1c3816

fixing section formatting

e3e7d74

redo task questions

3788c9c

update task questions

f661549

update task questions again

4d94820

mostly working

6e5fa8d

Functional, fixing indentation

b69e974

pycodestyle fixes

71370dd

Fix slurm formatting in runtime

d0d6a1c

Simplify task runtime definitions

65d97a3

remove print statement

c1bb1fc

Fix slurm setting

0ec6316

Remove duplicate import

d584c5d

Remove setter methods

21828f4

Simplify formatting

d00d382

Simplify workflows and suite configs

b699e87

Clean up and introduce question order test

5be480c

Initial commit

7a88864

Merge branch 'se/mranst/cylc_templating' into feature/mranst/workflow…

d051a2d

…s_for_comparison

remove non-task tasks from tasks

78303bd

Merge branch 'se/mranst/cylc_templating' into feature/mranst/workflow…

c3edeea

…s_for_comparison

add test for jedi

540e088

add compare command for jedi

f9b5cd5

Add workflow pause and event status emails

9c307f0

Add message on pause and fix to messaging

375b40f

pycodestyle fix

b38e0ac

Update slurm code test

eb83c5b

added some suites

f92d4c6

mranst added 2 commits January 20, 2026 17:28

code test fix

a9007b2

Code test fix

8cb8012

mranst added 2 commits January 20, 2026 17:52

fix for jedi_bundle

901bb5a

Increase time limit for build_jedi

c20689b

shiklomanov-an requested changes Jan 27, 2026

View reviewed changes

mranst added 13 commits January 27, 2026 13:46

cast to list

691444b

Fix override bug and use ruamel

275647b

Type hint fixes

f4a27c7

Add docstrings

613e31d

Remove flow.cylc

9ebf6fd

Add type annotations and remove registry

f745458

add comments for local imports

cb3753c

Update adding a suite docs

537afb6

Resolve override dictionary

2b15bd4

Code test fixes

0a92f44

Merge branch 'develop' into se/mranst/workflow_templating_redux2

3d15d48

Code test fixes

90ded9a

Merge branch 'develop' into se/mranst/workflow_templating_redux2

8476c87

Dooruk mentioned this pull request Jan 29, 2026

Use ruamel exclusively #682

Merged

mranst added 2 commits January 29, 2026 15:09

Merge branch 'develop' into se/mranst/workflow_templating_redux2

a6fd35a

Merge branch 'develop' into se/mranst/workflow_templating_redux2

cdddd54

Dooruk mentioned this pull request Feb 4, 2026

Allow for model-differentiated tasks #626

Merged

mranst added 2 commits February 5, 2026 15:14

Fix for ensemble

d094ac1

Merge branch 'develop' into se/mranst/workflow_templating_redux2

dabe9b2

mranst mentioned this pull request Feb 5, 2026

Search for model-specific tasks #698

Open

Refactor names of time_limit and is_model

d72b275

Dooruk mentioned this pull request Feb 12, 2026

breaking down PR/660 into smaller pieces -- part3, adding hofx_cf experiment #694

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated workflow templating#656

Updated workflow templating#656
mranst wants to merge 216 commits intodevelopfrom
se/mranst/workflow_templating_redux2

mranst commented Nov 10, 2025

Uh oh!

mranst commented Jan 20, 2026

Uh oh!

shiklomanov-an left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Dooruk commented Feb 5, 2026

Uh oh!

mranst commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

mranst commented Nov 10, 2025

Uh oh!

mranst commented Jan 20, 2026

Uh oh!

shiklomanov-an left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Dooruk commented Feb 5, 2026

Uh oh!

mranst commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants