[compat] wandb by asaiacai · Pull Request #49 · Trainy-ai/pluto

asaiacai · 2026-02-12T03:54:04Z

Adds pluto.compat.wandb module that lets users replace import wandb with import pluto.compat.wandb as wandb to route all logging through pluto with minimal code changes.

Module structure:

init.py: Module-level API (init, log, finish, watch, config, summary, run)
run.py: Run class wrapping pluto.Op with wandb.Run-compatible interface
config.py: Dict-like Config object that syncs mutations to pluto
summary.py: Dict-like Summary object tracking last-logged values
data_types.py: Wrappers (Image, Audio, Video, Table, Histogram, Html, Artifact, AlertLevel) that convert to pluto equivalents

Key features:

commit=False buffering (accumulate data across log calls)
Nested dict flattening with / separator
wandb env var fallbacks (WANDB_PROJECT, WANDB_MODE, WANDB_TAGS, etc.)
Graceful degradation for unsupported features (define_metric, save, etc.)
Context manager support
Disabled-mode fallback when pluto.init() fails

https://claude.ai/code/session_01VTSZKK5UsMqjiADFX57SMY

Tested (run the relevant ones):

Code formatting: bash format.sh
Any manual or new tests for this PR (please specify below)

Adds pluto.compat.wandb module that lets users replace `import wandb` with `import pluto.compat.wandb as wandb` to route all logging through pluto with minimal code changes. Module structure: - __init__.py: Module-level API (init, log, finish, watch, config, summary, run) - run.py: Run class wrapping pluto.Op with wandb.Run-compatible interface - config.py: Dict-like Config object that syncs mutations to pluto - summary.py: Dict-like Summary object tracking last-logged values - data_types.py: Wrappers (Image, Audio, Video, Table, Histogram, Html, Artifact, AlertLevel) that convert to pluto equivalents Key features: - commit=False buffering (accumulate data across log calls) - Nested dict flattening with / separator - wandb env var fallbacks (WANDB_PROJECT, WANDB_MODE, WANDB_TAGS, etc.) - Graceful degradation for unsupported features (define_metric, save, etc.) - Context manager support - Disabled-mode fallback when pluto.init() fails https://claude.ai/code/session_01VTSZKK5UsMqjiADFX57SMY

gemini-code-assist · 2026-02-12T03:54:26Z

Summary of Changes

Hello @asaiacai, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a new pluto.compat.wandb module designed to simplify the migration of existing Weights & Biases (wandb) logging code to Pluto. By allowing users to replace their import wandb statement with import pluto.compat.wandb as wandb, the module routes all logging operations through Pluto with minimal code modifications. This ensures that projects leveraging wandb's API can easily transition to using Pluto for experiment tracking and data visualization, maintaining compatibility while benefiting from Pluto's backend.

Highlights

Wandb Compatibility Module: Introduced pluto.compat.wandb to enable seamless integration for users migrating from wandb by allowing direct replacement of import statements.
API Emulation: Provides compatible interfaces for core wandb functions like init, log, finish, watch, config, summary, and run, ensuring existing code works with minimal changes.
Data Type Conversion: Includes wrappers for wandb.Image, Audio, Video, Table, Histogram, Html, Artifact, and AlertLevel to automatically convert them to their Pluto equivalents.
Robustness and Usability Features: Implements commit=False buffering for accumulating data, nested dictionary flattening, environment variable fallbacks (e.g., WANDB_PROJECT), graceful degradation for unsupported features, and context manager support for wandb.init().
Error Handling: Incorporates a fallback mechanism to create a disabled run that no-ops all operations if pluto.init() fails during initialization.

Changelog

pluto/compat/wandb/init.py
- Added the main compatibility module, exposing core wandb API functions and managing module-level state for run, config, and summary.
pluto/compat/wandb/config.py
- Added a Config class that mimics wandb.config, supporting dict-like and attribute access, and syncing changes to Pluto's underlying operation.
pluto/compat/wandb/data_types.py
- Added wrapper classes for various wandb data types (Image, Audio, Video, Table, Histogram, Html, Graph, Artifact, AlertLevel) to facilitate their conversion into Pluto's native data types.
pluto/compat/wandb/run.py
- Added a Run class that acts as a wandb.Run-compatible wrapper around a pluto.Op, handling logging, configuration, and run lifecycle events.
pluto/compat/wandb/summary.py
- Added a Summary class that emulates wandb.summary, automatically tracking scalar metrics from log calls and allowing manual overrides.
tests/test_wandb_compat.py
- Added comprehensive unit tests to verify the functionality and compatibility of the new wandb compatibility layer, covering Config, Summary, DataTypes, Run, and module-level API interactions.

Activity

No human activity (comments, reviews) was detected for this pull request.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a wandb-compatible layer for Pluto, allowing users to easily switch from wandb to pluto with minimal code changes. The implementation covers core API functions like init, log, finish, and various data types. The module handles environment variable fallbacks and gracefully degrades for unsupported features, which is a good design choice for compatibility. The accompanying tests provide good coverage for the new functionality. I've identified a few areas for improvement related to redundant logic, unused parameters, and potential data handling nuances that could enhance correctness and maintainability.

gemini-code-assist · 2026-02-12T04:10:48Z

pluto/compat/wandb/data_types.py

+
+        if self.np_histogram is not None:
+            # np_histogram is a tuple of (values, bin_edges)
+            return PlutoHistogram(data=self.np_histogram, bins=self.np_histogram)


When np_histogram is provided (which is a tuple of (values, bin_edges)), the _to_pluto() method passes the entire tuple as the bins argument to PlutoHistogram. The PlutoHistogram constructor expects bins to be an integer or a sequence of bin edges. Passing the tuple directly might not correctly extract the bin edges, potentially leading to incorrect histogram representation. It should likely be bins=self.np_histogram[1] to pass only the bin edges.

return PlutoHistogram(data=self.np_histogram, bins=self.np_histogram[1])

gemini-code-assist · 2026-02-12T04:10:49Z

pluto/compat/wandb/data_types.py

+        mode: Optional[str] = None,
+        caption: Optional[str] = None,
+        grouping: Optional[int] = None,
+        classes: Any = None,
+        boxes: Any = None,
+        masks: Any = None,
+        file_type: Optional[str] = None,
+        normalize: bool = True,
+    ) -> None:


Several parameters (grouping, classes, boxes, masks, file_type, normalize) are accepted by the Image constructor but are not stored or used within the class or during conversion to PlutoImage. This can create confusion for users expecting these parameters to have an effect. Consider removing them from the signature or adding a warning if they are provided.

def __init__( self, data_or_path: Any = None, caption: Optional[str] = None, ) -> None:

gemini-code-assist · 2026-02-12T04:10:49Z

pluto/compat/wandb/__init__.py

+    config_dict: Optional[Dict[str, Any]] = None
+    if config is not None:
+        if isinstance(config, dict):
+            config_dict = dict(config)
+        elif hasattr(config, '__dict__'):
+            config_dict = vars(config)
+        else:
+            config_dict = {}
+
+        if config_dict and config_include_keys:
+            config_dict = {
+                k: v for k, v in config_dict.items() if k in config_include_keys
+            }
+        if config_dict and config_exclude_keys:
+            config_dict = {
+                k: v for k, v in config_dict.items() if k not in config_exclude_keys
+            }


The configuration filtering logic could be more robust. If config is None or not a dict/object, config_dict could remain None, leading to errors if config_include_keys or config_exclude_keys are present. It's safer to initialize config_dict to an empty dictionary. Additionally, if both config_include_keys and config_exclude_keys are provided, the order of operations might lead to unexpected results. Consider clarifying the precedence or disallowing both simultaneously.

config_dict: Dict[str, Any] = {} if config is not None: if isinstance(config, dict): config_dict = dict(config) elif hasattr(config, '__dict__'): config_dict = vars(config) if config_include_keys: config_dict = { k: v for k, v in config_dict.items() if k in config_include_keys } if config_exclude_keys: config_dict = { k: v for k, v in config_dict.items() if k not in config_exclude_keys }

gemini-code-assist · 2026-02-12T04:10:49Z

pluto/compat/wandb/__init__.py

+            settings=pluto_settings or None,
+            run_id=run_id,
+        )
+    except Exception as e:


When pluto.init() fails, the exception e is logged directly in the warning message. For better debugging and to capture the full context of the error, it's recommended to log the exception with exc_info=True.

logger.warning('%s: pluto.init() failed (%s), creating disabled run', tag, e, exc_info=True)

gemini-code-assist · 2026-02-12T04:10:49Z

pluto/compat/wandb/config.py

+        # Use object.__setattr__ to avoid triggering our __setattr__
+        object.__setattr__(self, '_op', op)
+        object.__setattr__(self, '_data', {})
+        object.__setattr__(self, '_allow_val_change', True)


The _allow_val_change attribute is initialized but never used within the Config class. Consider removing it if it's not intended for future functionality, to keep the codebase clean.

Suggested change

object.__setattr__(self, '_allow_val_change', True)

object.__setattr__(self, '_data', {})

gemini-code-assist · 2026-02-12T04:10:49Z

pluto/compat/wandb/data_types.py

+        name: Optional[str] = None,
+        checksum: bool = True,
+        max_objects: Optional[int] = None,
+    ) -> 'Artifact':


The checksum and max_objects parameters are accepted by the add_reference method but are not used. Since add_reference is already a no-op, these parameters further highlight unused functionality. They should be removed.

self, uri: str, name: Optional[str] = None, ) -> 'Artifact':

gemini-code-assist · 2026-02-12T04:10:49Z

pluto/compat/wandb/run.py

+        import time
+
+        return time.time()


The start_time property currently returns time.time(), which is the current time, not the actual start time of the run. wandb.run.start_time typically refers to the timestamp when the run was initialized. This property should ideally retrieve the actual run start time from self._op.settings or a similar source to provide accurate information.

Suggested change

import time

return time.time()

@property

def start_time(self) -> float:

return getattr(self._op.settings, 'start_time', time.time())

gemini-code-assist · 2026-02-12T04:10:49Z

pluto/compat/wandb/run.py

+            from pluto.file import Artifact as PlutoArtifact
+
+            art = PlutoArtifact(data=artifact_or_path, caption=name)
+            log_name = name or 'artifact'


When log_artifact is called with a string path and no name is provided, the log_name defaults to 'artifact'. If multiple string paths are logged without explicit names, they will all be logged under the same key, potentially overwriting previous logs or making them indistinguishable. It would be more robust to use the base filename of the path as the default name if name is None.

art = PlutoArtifact(data=artifact_or_path, caption=name) log_name = name or os.path.basename(artifact_or_path)

gemini-code-assist · 2026-02-12T04:10:49Z

pluto/compat/wandb/summary.py

+            if isinstance(v, (int, float)) and not isinstance(v, bool):
+                store[k] = v
+            elif hasattr(v, 'item') and callable(v.item):
+                store[k] = v.item()


The _update_from_log method explicitly excludes boolean values from being stored in the summary (and not isinstance(v, bool)). While booleans are often treated as flags, wandb.summary might include them as scalar values in some contexts. If full wandb.summary compatibility is desired, this exclusion might lead to a slight behavioral difference. Consider if this exclusion is intentional and aligns with wandb's behavior, or adjust to include booleans if appropriate.

Suggested change

if isinstance(v, (int, float)) and not isinstance(v, bool):

store[k] = v

elif hasattr(v, 'item') and callable(v.item):

store[k] = v.item()

for k, v in data.items():

if isinstance(v, (int, float)):

store[k] = v

elif hasattr(v, 'item') and callable(v.item):

store[k] = v.item()

gemini-code-assist · 2026-02-12T04:10:49Z

tests/test_wandb_compat.py

+        op.add_tags.assert_called()
+


The test_tags_get_and_set test only asserts that op.add_tags was called, but it doesn't verify the arguments passed to add_tags or remove_tags. This makes the test less robust. It would be more effective to assert the specific tags that were added and removed.

Suggested change

op.add_tags.assert_called()

# Setting tags

run.tags = ('tag1', 'tag2')

op.remove_tags.assert_called_with(['tag1'])

op.add_tags.assert_called_with(['tag2'])

19 new tests in TestParityContract that pin the exact pluto call sequences for common wandb workflows: - Standard training loop (init → config → log N → finish) - Nested metric namespace flattening (train/loss, val/acc) - commit=False buffering and flush behavior - Duplicate key resolution (later values win) - Explicit step= forwarding - Config mutations (attr, dict, bulk update, argparse namespace) - config_include_keys / config_exclude_keys filtering - Tags lifecycle (init tags, runtime mutation) - Data type conversion in log() (Image, Table, Histogram → pluto) - Summary auto-tracking of last scalar per key - Context manager lifecycle (success and exception exit codes) - reinit finishing previous run - watch/alert forwarding - Full realistic workflow (config+tags+mixed data+summary) - Module state reset after finish - log_artifact call sequence (one op.log per file) https://claude.ai/code/session_01VTSZKK5UsMqjiADFX57SMY

Users can now swap `wandb` for `pluto-ml` in their dependencies and keep `import wandb` unchanged — no source edits needed. How it works: - Top-level `wandb/` package included in pyproject.toml packages list - `wandb/__init__.py` re-exports everything from pluto.compat.wandb - Common submodule stubs so deep imports don't break: - wandb.sdk, wandb.sdk.data_types - wandb.data_types - wandb.plot (no-op stubs for line_series, confusion_matrix, etc.) - wandb.apis (Api stub that raises NotImplementedError on queries) - wandb.util (generate_id, make_artifact_name_safe, to_json) - wandb.integration.lightning (WandbLogger → pluto MLOPLogger) 14 new tests in TestTopLevelWandbPackage verifying all import patterns. https://claude.ai/code/session_01VTSZKK5UsMqjiADFX57SMY

Two ways to compare dashboards side-by-side: 1. pytest-based (tests/test_wandb_visual_parity.py): # Pluto shim side (our wandb package): PLUTO_API_TOKEN=<token> pytest tests/test_wandb_visual_parity.py -k pluto -v -s # Real wandb side (separate venv with pip install wandb): WANDB_API_KEY=<key> pytest tests/test_wandb_visual_parity.py -k real_wandb -v -s 2. Standalone runner (tests/wandb_visual_parity_runner.py): # Same script, auto-detects which backend is installed: PLUTO_API_TOKEN=<token> python tests/wandb_visual_parity_runner.py WANDB_API_KEY=<key> python tests/wandb_visual_parity_runner.py Both run identical training loops (20 epochs, 1000 steps, same seed) with: scalar metrics, nested namespaces (train/, val/), histograms, tables, images, config mutations, summary overrides, and tags. Prints dashboard URLs for visual comparison. https://claude.ai/code/session_01VTSZKK5UsMqjiADFX57SMY

- Fix Histogram._to_pluto() passing full tuple instead of bin edges - Initialize config_dict to {} to prevent None edge cases - Add exc_info=True for better init failure debugging - Remove unused _allow_val_change attribute from Config - Simplify redundant reinit/finish logic in init() - Remove dead run_id reassignment - Record actual start_time at Run init instead of returning time.time() - Use os.path.basename for log_artifact default name - Include booleans in summary (consistent with wandb behavior) - Strengthen test_tags_get_and_set assertion Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Implement define_metric across the full stack: - Op.define_metric() stores definitions and syncs to server (best-effort) - Op.get_metric_definition() with glob pattern support - Summary aggregation (min/max/mean/first/last) in wandb compat layer - Sync process plumbing (RecordType.METRIC_DEF, enqueue, upload, dispatch) - ServerInterface.update_metric_definitions() for direct API calls Also fixes two pre-existing CI failures: - Fix mypy error: __exit__ return type bool -> None in Run - Fix test_table_from_dataframe: add pytest.importorskip('pandas') Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…mpat-igiLd

Enable pluto shim visual parity live test using MLOP_API_TOKEN secret, matching the neptune-compat pattern. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

gemini-code-assist bot reviewed Feb 12, 2026

View reviewed changes

claude and others added 7 commits February 12, 2026 05:10

Merge remote-tracking branch 'origin/main' into claude/pluto-wandb-co…

b66b779

…mpat-igiLd

Add live wandb compat tests to CI workflow

42710b3

Enable pluto shim visual parity live test using MLOP_API_TOKEN secret, matching the neptune-compat pattern. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

asaiacai force-pushed the claude/pluto-wandb-compat-igiLd branch from 2976742 to 42710b3 Compare February 24, 2026 21:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[compat] wandb#49

[compat] wandb#49
asaiacai wants to merge 8 commits intomainfrom
claude/pluto-wandb-compat-igiLd

asaiacai commented Feb 12, 2026

Uh oh!

gemini-code-assist bot commented Feb 12, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

gemini-code-assist bot Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	object.__setattr__(self, '_allow_val_change', True)
	object.__setattr__(self, '_data', {})

-        import time
-        return time.time()
+    @property
+    def start_time(self) -> float:
+        return getattr(self._op.settings, 'start_time', time.time())

-        op.add_tags.assert_called()
+        # Setting tags
+        run.tags = ('tag1', 'tag2')
+        op.remove_tags.assert_called_with(['tag1'])
+        op.add_tags.assert_called_with(['tag2'])

Conversation

asaiacai commented Feb 12, 2026

Uh oh!

gemini-code-assist bot commented Feb 12, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants