Project directory structure refactor #5

Roj · 2019-03-16T16:28:44Z

This PR organizes the project into the following directories:

data - as before
preprocessing - converting data files into text files for word2vec to use
models - word embedding models and recommendation systems
analysis - benchmarking of models and embedding analysis

PlaylistIterator now accepts a parameter to load track metadata (artist data only*). idomaarReader caches this data if it is loaded, so it doesn't have to be loaded on each new instance. Also fixed some bugs ref. to the load of session data.

Before this commit the model wouldn't actually use the metadata.

the iterator now works without hard-coding dataset values or schema. It is less efficient as it now uses dictionaries instead of vectors whether they are better or not. However, this allows one to not to worry about dataset quirks when parsing metadata. The iterator also has a new registry that servers as a lookup table for existing entities, so if a session has some song it just keeps the reference of the existing entity. This also allows metadata to be preloaded into songs and artists. An important change is that all elements are constructed and persisted in cascade, even if you do not use them (users, for example). It might be a good idea to keep a blacklist or whitelist of entity types to save later. For now it's enough.

(it may not be up-to-date)

Roj added 8 commits December 31, 2018 17:24

Iterator that loads metadata for idomaar.

e1a3fbe

PlaylistIterator now accepts a parameter to load track metadata (artist data only*). idomaarReader caches this data if it is loaded, so it doesn't have to be loaded on each new instance. Also fixed some bugs ref. to the load of session data.

w2v_model passes along metadata parameter.

7d5c0bc

Before this commit the model wouldn't actually use the metadata.

Adjusted PlaylistIterator to use new idomaarReader

6a52d97

Restructuring folders.

ac21232

Add model analysis

5eaa54f

(it may not be up-to-date)

Add lastfm changes.

8fe3907

Rearrange lastfm files.

3764e74

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Project directory structure refactor #5

Project directory structure refactor #5

Uh oh!

Roj commented Mar 16, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Project directory structure refactor #5

Are you sure you want to change the base?

Project directory structure refactor #5

Uh oh!

Conversation

Roj commented Mar 16, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants