Skip to content

To Do List #18

@SixiangHu

Description

@SixiangHu
  • boxplot for segmentation outlier
  • residual to tree methods (check variable missing)
  • single Tree visualisation on 2 dimension split, which can compare with different tree methods.
  • Travis.yml update to R template from current c template
  • EDA report
  • Quick Xgboost run for important variable
  • Quick abnormal detection run

Done:

Not Now:

  • Correlation matrix calculation for numerical matrix or data frame. Or find a quicker way for correlation calculation on big data. [correlation calculation on numerical dataset #24]
  • Distributed / parallel Calculation
  • Need a model comparison function for all ML methods. A structure or template is needed to conduct this model comparison. A PMML structure would be a good starting point, even though not all ML packages support it. Or use caret package straight away.
  • which terms in the model have been affect by new factors and in what way?
    1. Coefficient
    2. Changes by level (vis)
    3. Is it because of correlation?
    4. What this new factor has explained?
      This is more like feature selection, and may to specific to glm / lm method.
  • Trends analysis:
    1. Within the model to assess the consistency
    2. Between datasets on different times to assess the development
    3. There should have a function to give this feature. A starting point is the AbnormalDetection package and changepoint package.
      More timeseries analysis. Cannot find specific / achievable criteria.
  • historgram of deviance residual (symmetry). (REASON: currentl resiPlot has contour plot for this.)
  • deviance residual (sqrt(weighted deviance) * sign(a-e)). (REASON: different residual function for resiPlot #52 provides an open slot for this.)
  • deviance plot for outlier visualisation. (REASON: currentl resiPlot has contour plot for this.)
  • support multinomial analysis. (REASON: not urgent need.)
  • RcppParalle for CramersV function (REASON: not urgent need. And parallel overheads may have inverse effect.)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions