Skip to content

Data versioning tests#52

Merged
Blazoned merged 74 commits intomainfrom
data_versioning_tests
Sep 12, 2025
Merged

Data versioning tests#52
Blazoned merged 74 commits intomainfrom
data_versioning_tests

Conversation

@maartenjv
Copy link
Collaborator

No description provided.

maartenjv and others added 22 commits November 29, 2024 15:00
Added an Active Learning module based on the BADGE (Batch Active
Learning by Diverse Gradient Embeddings) strategy.

The PR includes: 

- Active learning plugin 
- Adjustments to the CHIMP frontend and datastore logic to support
manual annotation of selected data points.
- Extended backend routes 
- A scheduler that monitors labeled datapoints and triggers automatic
retraining when a threshold is reached.

**Issues:**

- Automatic retraining based on newly labeled datapoints currently fails
due to inconsistent preprocessing between the loaded image data and the
model input format.
_Error: Invalid input shape for input Tensor("data:0", shape=(64,),
dtype=float32). Expected shape (None, 96, 96, 3)_
@maartenjv maartenjv self-assigned this Aug 28, 2025
@maartenjv maartenjv linked an issue Aug 28, 2025 that may be closed by this pull request
@maartenjv maartenjv assigned maartenjv and unassigned maartenjv Aug 28, 2025
maartenjv and others added 2 commits August 29, 2025 13:00
Added git ignore rule to disable logs folder from containing files, while still being capable of existing.
maartenjv and others added 21 commits September 10, 2025 15:17
Added git ignore rule to disable logs folder from containing files, while still being capable of existing.
Removed logs file as logs aren't supposed to be pushed to git.
Keeping logs folder as a safety precaution, as it was used for logging somewhere in the process. Source still needs to be determined but it is to do with the front-end.
To-do added for future reference. Not urgent, just a nice-to-have, non-breaking, vs-code profile to help with productivity.
Notebook file wasn't intended to be included.
Notebook file wasn't intended to be included.
- Added constraint file
- Added requirements install for plugins in docker files
- Added constaint file to pip install call
- Reset requirements files (except the extra protobuf install)
Updated numpy constraint to 1.24.3.
Move init scripts to seperate folder.
Create requirements and constraints for init.
Reorganise init script
Fix import init script
Delete logs
Bug in main.js for retrieving username
Changed tensorflow version in constraints
Add debug info for image recognition
Disable frontendbuttons that are not implemented yet.
This solves the problem that images weren't processed due to an array error.
@Blazoned
Copy link
Collaborator

Alright @LegeDoos, the conflicts should have been resolved- I still like to double check if everything still works aside from the check saying it passed, but the rebase shouldn't have changed anything.

@Blazoned Blazoned requested a review from LegeDoos September 10, 2025 13:31
@LegeDoos
Copy link
Collaborator

Alright @LegeDoos, the conflicts should have been resolved- I still like to double check if everything still works aside from the check saying it passed, but the rebase shouldn't have changed anything.

Thnx, I will look into it soon :)

Copy link
Collaborator

@LegeDoos LegeDoos left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In training/app/datastore.py there is a duplicate line on 498. This doesn't break the system and I will not block the review so we can actually close this PR. I will remove the line in the next PR with manual changes.

@Blazoned Blazoned merged commit 07cf570 into main Sep 12, 2025
1 check passed
@LegeDoos LegeDoos deleted the data_versioning_tests branch September 15, 2025 14:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Training options

5 participants

Comments