World ICT News | Statistics: Skewness and Kurtosis

Skewness in Statistics.

Skewness is used to determine how asymmetrical a distribution is. It tells you whether your data leans toward one side of the mean or the other.

The mean, median, and mode are all equal in a perfectly normal distribution. The curve is symmetrical on both sides. However, most real-world data isn’t perfectly balanced. The values are concentrated at one end, so the tail is pulled towards the other end. Skewness measures that "pull".

A skewness value of zero indicates a perfectly symmetric distribution
Positive values point to a right-leaning tail
Negative values point to a left-leaning tail

The further the value moves from zero, the more asymmetric your data is.

Skewness is important because it directly affects the interpretation of the mean. In a skewed distribution, the mean is pulled towards the tail, and it no longer accurately represents a typical value. Early awareness of the meaning helps you select appropriate statistical methods and avoid drawing conclusions unsupported by the data.

Kurtosis Definition

As skewness tells you of the direction of the lean in your data, kurtosis meaning tells you: "how sharp or how flat the peak of your distribution is, and how much the weight lies in the tails."

To be more precise, kurtosis is used to understand how many extreme values you have, relative to a normal distribution. A high-peaked, sharp-tailed distribution has very different behavior from a broad and flat one, even if both have the same mean and standard deviation. That difference is captured by kurtosis.

The kurtosis value of a normal distribution is 3 and is used as the reference point. Some analysts use excess kurtosis, obtained by subtracting 3 from the raw value, so that the scores of the normal distribution are zero, making comparisons easier to interpret.

When kurtosis is high, you are more likely to find extreme values in your data
When it is low, the distribution has lighter tails, meaning fewer extreme values

This makes kurtosis particularly useful in areas such as finance and risk analysis, where knowing the likelihood of extreme outcomes is not only helpful but necessary.

Did you know? Data is typically considered approximately normal only when skewness and excess kurtosis both fall between -1 and +1. (Source: PMC, Descriptive Statistics and Normality Tests)

Types of Skewness

There are three types, and the distinction is straightforward.

1. Positive Skewness (Right-Skewed)

In a positively skewed (right-skewed) distribution, most values are concentrated on the left side, while the tail extends toward the right. Because of the long right tail, the mean is typically greater than the median, and the median is greater than the mode.

In this distribution, Mean > Median > Mode

Skewness_And_Kurtosis_2.

Figure: Positive Skewness

2. Negative Skewness (Left-Skewed)

In a negatively skewed, or left-skewed, distribution, most data points are concentrated toward the right side, while the tail extends toward the left. Because of this longer left tail, the mean is typically less than the median, and the median is less than the mode.

In this distribution, Mode > Median > Mean

Skewness_And_Kurtosis_3

Figure: Negative Skewness

3. Zero Skewness (Symmetric)

A perfectly balanced distribution on either side gives a zero skewness. Mean, median, and mode are all equal, and neither tail is heavier than the other. This is the normal distribution in its ideal form.

Types of Kurtosis

Kurtosis has three types, based on the value.

1. Leptokurtic (Positive Kurtosis)

A leptokurtic distribution is one with a kurtosis greater than 3.

Leptokurtic (Positive Kurtosis)

It has a tall, sharp peak and thick tails; that is, data is extremely concentrated around the mean; however, when extreme values occur, they may be important. There is a higher likelihood of outliers in this type than in the normal distribution.

2. Platykurtic (Negative Kurtosis)

Platykurtic distributions have a kurtosis value below 3.

Platykurtic (Negative Kurtosis)

The peak is flatter and wider, and the tails are thinner. Here, the data are more widely distributed, with fewer extreme values. The distribution is nearly stretched compared to a normal curve. A good example of platykurtic behavior is a uniform distribution.

3. Mesokurtic (Kurtosis = 3)

Mesokurtic Kurtosis

This is the baseline, the normal distribution itself. The kurtosis of a mesokurtic distribution is exactly 3, with balanced peaks and tails, which is taken as the reference point for comparing leptokurtic and platykurtic distributions.

Did You Know? A 2025 study found that the power and reliability of normality tests vary substantially with skewness and kurtosis, especially in small samples. (Source: Springer Link, BMC Medical Research Methodology, ‘as of Sep 2025’.)

Formula for Skewness and Kurtosis

Having understood the meaning of skewness and kurtosis in principle, the next step is to learn how to calculate them. The formula for skewness and kurtosis is a direct result of the concepts; both of them represent the shape of a distribution in the form of a number that you can calculate, compare, and operate on.

Skewness Formula

Pearson's skewness coefficient is the most commonly employed, and it exists in two forms:

Pearson's First Coefficient of Skewness = (Mean - Mode)/Standard deviation
Pearson's Second Coefficient of Skewness = 3(Mean - Median)/Standard deviation

When the mode is unclear or unstable, as with continuous data, the second coefficient is usually used. The two formulas are used to determine the distance between the mean and the center of the distribution, normalized by the standard deviation, to make the outcome similar across data sets.

Interpreting skewness values:

-0.5 to 0.5 → Approximately symmetric
-1 to -0.5 or 0.5 to 1 → Moderately skewed
Less than -1 or greater than 1 → Highly skewed

Kurtosis Formula

The kurtosis formula quantifies how steep the peak is and the weight of the tails when compared to a normal distribution:

K = [Σ(X - X̄)⁴ / n] / s⁴

Where:

X = each data point
Xˉ= mean of the dataset
n = number of data points
s = standard deviation

This gives you the raw kurtosis value, where 3 is the baseline for a normal distribution. Practically, excess kurtosis has often been used by many analysts, which is computed as:

Kexcess = K-3

This is just a simple re-centering of the scale so that a normal distribution assigns a score of zero, making it easy to quickly determine whether the distribution has heavier or lighter tails than normal.

When excess kurtosis is positive, tails are heavier,
When it is negative, they are lighter.

Difference Between Skewness and Kurtosis

Dimension	Skewness	Kurtosis
What it measures	Asymmetry of the distribution	Peakedness and tail weight
Core question answered	Which direction does data lean?	How extreme are the tails?
Reference value	0 (perfectly symmetric)	3 (normal distribution)
Positive value means	The right tail is longer	Sharper peak, heavier tails (Leptokurtic)
Negative value means	The left tail is longer	Flatter peak, lighter tails (Platykurtic)
Effect on the mean	Mean is pulled toward the tail	Mean may remain centered, but outliers increase
Typical use case	Detecting directional bias in data	Detecting outlier-proneness and tail risk
Real-world example	Income distribution, exam scores	Stock market returns, insurance claims

The key takeaway is that kurtosis and skewness complement each other. Skewness describes the direction of distortion in a distribution, and kurtosis describes the degree to which the distribution's tails extend. Both are required to see the whole picture.

Why Skewness and Kurtosis Matter in Statistics?

Mean and standard deviation are great starting points; however, they don't give you all the information about your data. What those summary statistics lack is filled in by skewness and kurtosis in statistics.

Here's why they matter in practice:

They expose when the mean is misleading. The mean in a skewed distribution is pulled towards the tail. When you are making decisions using it, budgeting, performance, and risk estimation, skewness in statistics informs you on whether the mean is actually credible.
They reveal outlier risk. High kurtosis indicates heavier tails, meaning extreme values are more likely than a normal distribution would suggest. Many financial models that failed during market downturns did so because they assumed normality and overlooked this entirely.
They determine which statistical tests are valid. Tests such as t-tests, ANOVA, and linear regression assume approximate normality. Once skewness or kurtosis deviates significantly, those assumptions fail, and so do your results.
They're essential in machine learning. Highly skewed features can distort model training. Checking and correcting skewness before modeling is a standard preprocessing step that directly affects performance.

Skewness and Kurtosis Quick Diagnosis Checklist

Before running any statistical analysis, run through this:

Calculate the skewness value:

Between -0.5 and 0.5 → proceed normally
Between ±0.5 and ±1 → consider median over mean
Beyond ±1 → apply log or square root transformation before analysis

Calculate kurtosis value:

Close to 3 (excess ≈ 0) → distribution is normal, proceed
Above 3 → flag for outlier review before modeling
Below 3 → data is spread flat; verify test assumptions still hold

Final Check:

If both deviate significantly → avoid t-tests, ANOVA, and standard regression without adjustment

Examples of Skewness and Kurtosis

Real-world data is rarely textbook-perfect, and these examples of kurtosis and skewness show up across different fields.

Income Distribution: Positive Skewness

The majority of people earn less than the national average, and a small group of very high earners lies much farther to the right in the tail.

This pulls the mean upward, rendering it an inaccurate depiction of average income. It's why median income is a more honest benchmark.

Exam Scores: Negative Skewness

When an exam is straightforward, most students score high and only a few score very low. Data clusters toward the upper end with a long left tail, a clean example of negative skewness that teachers encounter regularly.

Stock Market Returns: Leptokurtic

Daily returns have a small average, though large gains or losses are much more frequent than they would be according to a normal distribution.

These include the so-called "fat tails," which are characteristic of high kurtosis and are precisely why conventional risk models tend to underestimate the likelihood of a market crash.

Rainfall Data: Platykurtic

In places where seasonal rains occur regularly, there are no peaks or extreme variations in monthly rainfall. This flat, wide distribution is typical platykurtic behavior, with kurtosis below 3 and no surprises at either end.

Manufacturing Quality Control: Zero Skewness

A well-functioning production process keeps measurements like component dimensions or fill weights symmetrically centered around a target value. Skewness near zero means the process is on track. Any drift signals something's gone wrong.

Key Takeaways

Kurtosis and skewness are not limited to the mean; they describe the true shape of a distribution, making them essential for any honest data analysis
Skewness tells you which direction your data leans; kurtosis tells you how extreme the tails are. You need both to understand a distribution fully
High kurtosis or heavy skewness may invalidate other commonly used statistical tests, making these checks a non-negotiable step before any analysis
In practice, in fields such as finance, healthcare, and machine learning, these two measures are directly involved in decision-making, including risk assessment and model preprocessing

Statistics: Skewness and Kurtosis