Skewness

From Rice Wiki
Revision as of 06:44, 26 April 2024 by Rice (talk | contribs) (Created page with "The '''skewness''' of a dataset determines the direction of the outliers. = Impact = Many models assume the data to be normally distributed. Skewed data in those models will result in inaccurate predictions. = Detection = Data skewness is detected during Exploratory data analysis. The first method is visualization. Just look at a graph lol. Numerically, in a dataset, if the median < the mean, then it is skewed to the right. Vice versa. = Mitigate pr...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The skewness of a dataset determines the direction of the outliers.

Impact

Many models assume the data to be normally distributed. Skewed data in those models will result in inaccurate predictions.

Detection

Data skewness is detected during Exploratory data analysis.

The first method is visualization. Just look at a graph lol.

Numerically, in a dataset, if the median < the mean, then it is skewed to the right. Vice versa.

Mitigate problems

Skewed data can be transformed to approximate a more symmetric distribution. Examples include logarithmic, square root, and inverse transformations.