Plotting rework 1: use axes, use render_plot everywhere #166

casblaauw · 2025-12-19T16:03:28Z

This PR reworks CREsted's plotting functions to be axis-focused according to a few core principles, inspired by scanpy and seaborn. Just in time for Christmas 😅

Core principles

Working with an ax: Core plotting functions should accept an axis and some data and plot the data on that axis.
- This allows for composite plots and easy adding of extra annotations/elements from the user's side, by far my greatest frustration currently.
  - Some plots (inherently multi-panel plots, clustermaps) are of course exempt.
- If no axis is provided, it should create a sensible-sized plot for the user (as it does now already), returning both the fig and axis (if show=False).
- If multiple values are provided (e.g. predictions from multiple models), it should automatically create a figure with multiple axes (like sc.pl.umap(colors=['var1', 'var2']).
- Labels/titles should also be set on axes rather than figures preferably, especially on single-axis plots. Both because they don't look misaligned like suptitle does, but also because a function should not disturb the larger figure without being explicitly instructed to (through suptitle/sup[x/y]label).
Customizing the plotting function: The underlying plotting functions should be exposed through a plot_kws argument (like seaborn does for complicated functions with e.g. sns.lmplot(scatter_kws={}))
- Default arguments in the plot should also be overridable with this, i.e. color in pl.hist.distribution, without requiring all of them to be separate plotting function arguments (which can become overwhelming).
Unified syntax: All plotting functions should use an identical syntax, aligning with each other and with matplotlib.
- Figure size is now always set with width and height, and is set on plot creation rather than post-hoc resizing.
- All plots should use render_plot unless really not possible, and things like setting axis labels, titles, tick label rotations, etc, should as much as possible be handed off to render_plot by putting things in kwargs in the plotting functions. (This allows users to prevent any change to pre-existing properties by simply setting property=None in the plotting function)
- Things like separate unique figsave arguments are all unified to use save_path in render_plot (or manually if it's one of the few functions that doesn't use it).

High-level summary:

All functions now use render_plot (except a few pl.patterns modisco clustermaps)
Almost all functions now accept an axis to plot their data on, if plotting a single panel.
Almost all functions now accept plot_kws to add and customize the underlying plotting function's arguments.
All functions take width and height to set dimensions, and multi-plot functions also take sharex/sharey.
render_plot now can also set ax-level labels and titles, set x/y lims, and add a grid.
Default axis labels now denote whether you're using log_transform or not.

Complete(ish) changelist:

bar:
- All (normalization_weights, region, region_predictions, prediction) now take plot_kws and all (except region_predictions) also an axis.
- region_predictions now uses region to plot its components
- prediction is cleaned up and made consistent with other functions
- All barplots now use a y-only grid by default, since an x-grid is superfluous with a categorical axis.
heatmap
- All (correlations_self, correlations_predictions) now take an axis and plot_kws.
- Colormap is now customizable.
- Colorbar now has a label to show its units (pearson correlation), indicating log1p-transformation if used.
- Heatmaps are now square (sns.heatmap(square=True)) by default, and default fig size was slightly changed to make it fit a square heatmap + a colorbar well.
hist
- The only function, hist.distribution now takes plot_kws and an optional axis.
- custom argument share_y is now renamed to sharey, as in matplotlib.
- Add nice default axis labels, including denoting log-transformation if used.
- Non-used plots in the plot grid (if plotting multiple classes) are now hidden by default.
locus
- locus.locus_scoring now takes an axis (if only plotting the locus scoring and not the extra bigwig track) and separate plot_kws for both the locus and bigwig plots. Previous custom arguments are now folded into the plot_kws or render_plot kwargs. Highlights can now also be customized with highlight_kws.
- locus.track was expanded from the beta function I implemented at some point.
  - Accepts an axis and plot_kws.
  - Now accepts standard track model outputs and a class_idx, instead of requiring the user to subset dimensions before passing in the data.
  - Automatically creates multiple axes for every class provided.
patterns
- contribution_scores:
  - Now takes an axis (if only plotting one sequence/class). Also now takes width/height, sharex/sharey.
  - Class labels are updated to be consistently at 70% of plot height, rather than at 70% of the positive values (which made them inconsistent if negative values in the data), and at 2.5% of the plot width instead of at x=5 (which is the same at default zoom level 200bp, but can vary if zoom level is changed)
  - Highlights can now be customized with highlight_kws.
- _enhancer_design:
  - enhancer_design_steps_predictions now takes an axis (if plotting one class) and plot_kws. Spelling mistake in the arguments fixed. Now always creates a square grid of plots if supplying a lot of classes, following hist.distributions.
  - enhancer_design_steps_contribution_scores now takes sharex/sharey and highlights can now be customized with highlight_kws.
- _modisco_results : These plots are more convoluted/specific (and I very rarely use them), so I didn't touch them beyond the basics.
  - All functions now take width/height, and the non-clustermap functions now all use render_plot. Clustermap functions now use g.savefig() as recommended by seaborn instead of fig.savefig.
  - clustermap_with_pwm_logos pwm positioning logic was slightly adjusted, since they were all overlapping on my test run. Now they're all neatly aligned and separated in my tests at least.
  - selected_instances now takes an axis if plotting a single index.
  - All clustermaps/heatmaps in this module should now have cmap as an argument.
scatter
- class_density can now be customized more and has better defaults (figsize mostly square with or without colorbar, colorbar off by default, nicer labels)
- class_density now has properly colored and properly ranged colorbar.
violin:
- violin.correlations now takes plot_kws and ax. Label adjusted if using log-transformed data.
render_plot
- Now primarily axis-focused, taking and returning axes, and only disturbing the figure if explicitly asked to. (Fig resizing moved to plot creation, rather than post-hoc, to follow this rule).
- Can now set axis titles, x/ylabels, and limits. Can handle both a single value (applying that to all axes) and a list of values (one per axis).
- Rotated labels now align with their ticks, optimized to some heuristics. Primarily important with longer cell type names.
- [x/y]_*arguments aligned with matplotlib and setting arguments (e.g. xlabel's fontsize is now set by xlabel_fontsize rather than x_label_fontsize, also to prevent supx_label_fontsize which looks weird).
- rotation arguments renamed to [x/y]tick, since x/ylabel refers to the axis labels, not the axis tick labels.
- Can now add a grid with nice defaults (behind data). Works both for single-axis and both axes.
create_plot
- New function to replace plt.subplots calls, shorthand for if ax is not None; fig = ax.get_figure(); else; fig, ax = plt.subplots()

Compatibility

I've endeavored to keep code as reverse compatible as possible.

All renamed arguments still work, and raise a warning on how to use them with the renamed version or new syntax.
If using show=False, render_plot does now return both a fig and ax(s), so code previously doing fig = crested.pl.func(show=False); axs = fig.axes or something similar will have to update to fig, axs = crested.pl.func(show=False).
title as a kwarg now refers to the axis title rather than suptitle; suptitle's now under suptitle. This leads to better titles and nicer plots in 90% of cases, but might need some manual changes if doing multi-panel plots where you expected suptitle.
I've tested all base functions (everything except modisco_results) pretty thoroughly (also adjusting plot_kws, etc), but something might've slipped through.
- for _modisco_results , I tested that all functions at least work with an old CREsted-based modisco run I had lying around, but haven't played with parameters a lot. Did not test the two TF expression-based plots (tf_expression_per_cell_type & clustermap_tf_motif), since I didn't have an elegans TF list available, so anyone testing those is appreciated.

Future work

Add tests for all plotting functions! Was once attempted in Render plot used everywhere #66.
bar.region/bar.prediction automatically also plotting multiple regions?
Add range to contribution_scores to plot on genomic coordinates (like with track())?
Expand track() with stuff from other functions, like center-zoom and highlights from contribution_scores -> see Expand pl.track.locus #161
Look into also using render_plot for clustermaps?
Think about log_and_raise: currently looks bad in notebooks because it duplicates the error message, and makes errors uncatchable with try/except. Not sure what the advantages are.

This is the first (and biggest) part of a plotting overhaul. The next parts will add some new plots, rework plot categorisation, and update all tutorials.

…ys included)

…lot docs

…lor to region_predictions

…ameter

…commented code

casblaauw · 2025-12-19T16:58:03Z

Ah shoot, comma'd arguments in docstrings (like sharex, sharey) currently repeat the same message for both in the docs. I was hoping they'd be shared on one line. I should probably split those out into separate arguments with separate explanations then.

casblaauw · 2025-12-22T15:22:28Z

This should be ready to review. Once uv learns how to install packages again, the tests should pass; they are passing for me locally, at least.

casblaauw added 30 commits December 10, 2025 00:41

render_plot rework 1: make axis-focused, add rotated label alignment

41f2852

Declare scipy dependency (was alr used in violin.correlations(), alwa…

ad058bc

…ys included)

create_plot: add docs, checks, allow multi-plot. also expand render_p…

edfd2dc

…lot docs

render_plot: add axis label/title lists, grid support

f0d13bf

pl.bar axis function rework

08dc7ba

pl.heatmap axis function rework

84d9bcb

Simplify correlations_predictions loops

9ec1532

remove bar.prediction whitespace

41b3a04

correlations_predictions make title adjustable

9d3be90

pl.hist axis function rework

8617bb6

fix wrong axis default in distribution

2795d1c

fix duplicated axis calls in region

6dcec09

render_plot: fix axis label/title lists

fb02827

Add plot_kws & kwargs defaults to docstring + add pred_color/truth_co…

8bf15f0

…lor to region_predictions

Add grid to normalization_weights by default

718611e

Add xlim/ylim to render_plot

27ed198

render_plot: fix grid default alias

b38034a

update bar._region docs

3ae9a46

pl.locus.locus_scoring axis function rework

0636fd3

pl.locus.track expansion and axis rework

445ebfc

render_plot: shrink default suptitle slightly

76b7d2b

pl.scatter.class_density expansion and axis rework

66c2ed7

pl.violin.correlations axis rework

af8c8f0

render_plot: grid default color improvement

9847cbe

add create_plot to docs toctree

d05d8e7

clean up class_density disregarded code

53b5020

Clean up violin.correlations whitespace

22e3e6d

render_plot: always apply label size even if label was not changed

7356971

Add log_transformed to all labels (& add heatmap cbar labels in general)

2c8d4d4

render_plot: Make all xlabel/xtick/etc names consistent w setting par…

65795f1

…ameter

casblaauw added 14 commits December 17, 2025 18:01

render_plot: rename tick-args from label to tick

ed8cd84

pl.patterns.modisco_results axis/render_plot rework

2a2bb6e

pl.patterns._enhancer_design render_plot/ax rework

0bad4ce

Pydocstyle updates

ae16747

contribution_socres: Add explicit multi-plot ax error msg and remove …

8287cc2

…commented code

Fix docstring typo in locus_scoring

6d35866

Add highlight_kws to plots with highlights

5d7455a

Export create_plot

d43b2f8

Fix docstring backtick misses

0543fb2

Add bar.prediction to public docs

2d7a201

Fix matplotlib method docstring references

8f3250c

Fix np array type hint

b8e1033

Fix stupid typo in track type hint

af3cbe7

Add missing plotting output type hints

de235a9

casblaauw added 7 commits December 21, 2025 12:31

Add region name as default title in barplots

ba7bbea

class_density: Rename colorbar to cbar to match seaborn

8d09c48

correlations: Add missed cbar_kws argument to docstring

c1c95f0

Clarify default widths for class_density

968b529

heatmap: make cbar optional and adjust plot widths

170d4c6

Update docstring examples & figures

fadd4af

track: Fix bug ignoring class_idxs and update example

3e07ef0

casblaauw mentioned this pull request Dec 22, 2025

Install failing due to too-early numba versions #168

Closed

casblaauw added 2 commits December 22, 2025 14:43

Remove temp file

7449fd3

Split comma arguments in docstrings

6d07ca5

casblaauw added 4 commits January 5, 2026 10:54

Add ytick_va and ytick_rotationmode for consistency

34f1954

handle figure kwargs popping within create_plot

9f5eb4b

Delete accidental ipynb_checkpoints upload

f0e5812

Merge branch 'main' into plotting_axes

d55af67

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Plotting rework 1: use axes, use render_plot everywhere #166

Plotting rework 1: use axes, use render_plot everywhere #166

Uh oh!

casblaauw commented Dec 19, 2025 •

edited

Loading

Uh oh!

casblaauw commented Dec 19, 2025

Uh oh!

casblaauw commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Plotting rework 1: use axes, use render_plot everywhere #166

Are you sure you want to change the base?

Plotting rework 1: use axes, use render_plot everywhere #166

Uh oh!

Conversation

casblaauw commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Core principles

High-level summary:

Complete(ish) changelist:

Compatibility

Future work

Uh oh!

casblaauw commented Dec 19, 2025

Uh oh!

casblaauw commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

casblaauw commented Dec 19, 2025 •

edited

Loading