Application to large sparse design matrix

I have a lasso problem with ~10K samples and ~20K features where most of the entries in the design matrix x are zero.  I can store them in a dgCMatrix sparse matrix and use `glmnet` to fit a lasso while taking advantage of the sparse design matrix.  This is can be over 100x faster than using a standard matrix object.

Is there a way to extend `coef`, `fixedLassoInf` and `estimateSigma` to handle sparse matrices??  
This would involve dealing with the centering and scaling of features, but it seems doable.

Also, `fixedLassoInf` that is *very* slow in the high dimensional setting in both the sparse and non-sparse case.  If I do a pre-filtering step, will the p-values and FDR control still be accurate?
 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Application to large sparse design matrix #51

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Application to large sparse design matrix #51

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions