Skip to content

Empty data and factors after model finished training #37

@dmalzl

Description

@dmalzl

Hi,

I am frequently using your tool to get insight into the biology of my data and I feel its great. So first of all thank you for providing it.
However, I recently tried to replicated the microbiome analysis detailed here. For the sake of convenience I use the interface provided by muon. This worked great previously but I am experiencing some difficulties with the setup described in the microbiome analysis tutorial which in my mind is either the number of groups, the sparseness of the data or a combination of both. The dataset at hand contains 32 individuals over 7 timepoints and we measured 794 species in at least one of these individual x timepoints. In brief, I do the following:

  1. Filtering the data to retain only species which have at least 8 measurements which retains 441 (I did this to retain only species which are present in at least 8 individuals but I just realized this is not what the actual code does anyway)
  2. I then run MEFISTO via the mu.tl.mofa function (as far as I understand the code there is nothing that would filter the data further, I also tested a couple of things and the dataset gets assigned correctly to the model, which is also confirmed by the model displaying all the right messages and even starting and finishing to train without any errors or warnings)

However, mu.tl.mofa throws an error after the model finished training because it fails to load the data and factors back from the saved model. Looking at it manually with mofax the output is the following:

MOFA+ model: species only quarter 10.h5ad
Samples (cells): 0
Features: 441
Groups: 1 (7), 13 (6), 14 (6), 15 (7), 16 (7), 28 (7), 29 (7), 3 (7), 30 (7), 31 (7), 33 (6), 34 (7), 35 (7), 36 (7), 43 (7), 44 (7), 48 (6), 49 (6), 5 (7), 51 (6), 53 (6), 56 (6), 57 (5), 59 (7), 6 (7), 60 (5), 61 (6), 62 (7), 65 (6), 66 (6), 67 (5), 68 (6)
Views: data (441)
Factors: 10
Expectations: Sigma, W, Z

MEFISTO:
Covariates available: time

and when calling trying to get the data or the factors I only get empty arrays.

My question now is why this is happening and if there is a solution to this. This could also well be a muon related problem but for now I think it is a problem of mofapy2 as the model trains and then saves it with empty factors and data.

Thanks in advance for looking into this

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions