Maths Study for DS, ML, and NLP A Weekly Plan for MATHS Study
- Week 1: DESCRIPTIVE STATISTICS
- GOALS: Understand and apply the basics of basic summary used to describe datasets
- Measure of Central Tendency
- mean
- median
- mode
- Measure of Dispersion
- Range
- Variance
- Standard Deviation
- Quantile(75th percentile - 25th percentile)
-
Outliers are unusually high or low values in your dataset that differ significantly from most other values.
-
Example:
-
In this list of ages:
- [22, 24, 25, 26, 150] β the 150 is clearly an outlier.
-
The mean (average) increases or decreases significantly if there's an outlier.
-
π Example:
-
Ages:
-
[25, 26, 27, 28, 29] β Mean = 27
-
[25, 26, 27, 28, 100] β Mean = 41.2 β Big change due to one outlier
-
The median is the middle value, so itβs stable even with outliers.
-
π Example:
-
[25, 26, 27, 28, 100] β Median = 27
-
Same median as before!
- Itβs just the most frequent value, so outliers donβt matter here.
-
These measure spread of data.
-
Outliers make the spread appear wider than it actually is.
