-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
- boxplot for segmentation outlier
- residual to tree methods (check variable missing)
- single Tree visualisation on 2 dimension split, which can compare with different tree methods.
- Travis.yml update to R template from current c template
- EDA report
- Quick Xgboost run for important variable
- Quick abnormal detection run
Done:
- Need a function to visualise the interactions or correlations:
Can use thenetworkD3package for the network plot, or use theDiagrammeR,igraphandvivagraphhttp://www.buildingwidgets.com/blog/?offset=1435611529118. [3D plot for 2-way interactions #26] - Code efficient by either
Rcppor cleaner and elegant coding in R - different residuals can be used on residual Plot (e.g. student, pearson, cooker's, etc.) [different residual function for resiPlot #52]
- deviance residual vs predicted target value [different residual function for resiPlot #52]
- different residuals can be used on residual Plot (e.g. student, pearson, cooker's, etc.) [different residual function for resiPlot #52]
Not Now:
- Correlation matrix calculation for numerical matrix or data frame. Or find a quicker way for correlation calculation on big data. [correlation calculation on numerical dataset #24]
- Distributed / parallel Calculation
- Need a model comparison function for all ML methods. A structure or template is needed to conduct this model comparison. A
PMMLstructure would be a good starting point, even though not all ML packages support it. Or usecaretpackage straight away. - which terms in the model have been affect by new factors and in what way?
- Coefficient
- Changes by level (vis)
- Is it because of correlation?
- What this new factor has explained?
This is more like feature selection, and may to specific to glm / lm method.
- Trends analysis:
- Within the model to assess the consistency
- Between datasets on different times to assess the development
- There should have a function to give this feature. A starting point is the
AbnormalDetectionpackage andchangepointpackage.
More timeseries analysis. Cannot find specific / achievable criteria.
- historgram of deviance residual (symmetry). (REASON: currentl
resiPlothas contour plot for this.) - deviance residual (sqrt(weighted deviance) * sign(a-e)). (REASON: different residual function for resiPlot #52 provides an open slot for this.)
- deviance plot for outlier visualisation. (REASON: currentl
resiPlothas contour plot for this.) - support multinomial analysis. (REASON: not urgent need.)
-
RcppParalleforCramersVfunction (REASON: not urgent need. And parallel overheads may have inverse effect.)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels