Harpo fix for conceptual agent saving #111

harpomaxx · 2025-09-23T12:18:39Z

Description

The model saving was happening inside the testing loop, so every time it hit a save milestone), it was saving the same model 250 times. this fix put this out of the test loop
l

Store the model every --eval_each episodes.

                        # Use episode (training counter) and not test_episode (test counter)
                        if episode % args.store_models_every == 0 and episode != 0:
                            agent.store_q_table(args.models_dir, f'conceptual_q_agent.experiment{args.experiment_id}-episodes-{episode}.pickle')

…ng agent - Add checks for empty actions list in select_action method for both exploration and exploitation branches - Return None action when no valid actions are available instead of crashing - Handle None action in play_game method with graceful episode termination - Fix typo in comment (ation -> action) 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

The model saving logic was incorrectly placed inside the testing loop, causing the same model to be saved 250 times consecutively at each milestone (e.g., every 20,000 episodes). This resulted in: - File size fluctuations as saves were repeatedly overwriting - Extremely long save times due to multiple concurrent operations - Wasted I/O and disk space Moved the save check outside the testing loop to ensure models are saved only once per milestone episode. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

eldraco and others added 3 commits September 10, 2025 15:50

New way of training the conceptual agent for the 5 rounds

c0bba66

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Harpo fix for conceptual agent saving #111

Harpo fix for conceptual agent saving #111

Uh oh!

harpomaxx commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Harpo fix for conceptual agent saving #111

Are you sure you want to change the base?

Harpo fix for conceptual agent saving #111

Uh oh!

Conversation

harpomaxx commented Sep 23, 2025

Description

Store the model every --eval_each episodes.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants