Skip to content

Conversation

@f0k
Copy link
Member

@f0k f0k commented Dec 22, 2025

Adds annotations and artist-aware splits for the Osu2MIR dataset. Code to reproduce the splits is included in split_this.py. It assumes that if an artist name is included in another artist name, it's because they collaborated and should be treated as one artist group. In case Osu2MIR had multiple annotations for a track, I picked the second one when sorted by beatmap ID (heuristic: if somebody bothered to do a second version of a track, then probably because they wanted to improve something, but if there is a third or fourth version, I found that it often changed something to the worse).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants