Skewness Calculator

Your Data

Select Population if the data contains all measurable values or all values you are interested in.
Select Sample if the data is a sample of a large or unlimited population and you wish to make a statement about the entire population.
Input the data separated by comma, space or enter. To calculate skewness, you need at least three values in your dataset.
Choose the number of decimal places.

Calculating Skewness (Formula)

The Skewness Calculator is a helpful shortcut that speeds up your work. But you can also calculate a population's or a sample’s skewness by hand.

The formulas, however, are slightly different from each other. In practice, sample skewness is more common because it allows us to make inferences about the skew of the underlying population.

The sample skewness formula is the following:

SK=[n(n1)(n2)]i=1n(XiX¯s)3

In this formula, n denotes the number of observations in the sample, X¯ is the sample mean, whereas s is the sample standard deviation.

Meanwhile, population skewness is calculated using the following formula:

γ1=[1N]i=1n(Xiμσ)3

In this formula, N denotes the number of observations in the population, 𝜇 is the population mean, whereas σ is the population standard deviation.

In both equations, the algebraic sign indicates the skew’s direction. A negative sign indicates a negatively skewed distribution, whereas a positive one suggests a positive skew.

How to Calculate Sample and Population Skewness Step by Step

Follow these steps to obtain the skewness of your dataset:

  1. Calculate the sample or population mean of the dataset:

    X¯=i=1nXin

    μ=i=1nXin

  2. Find the sample or population standard deviation of the dataset:

    s=i=1n(XiX¯)2n1

    σ=i=1n(Xiμ)2n

  3. Standardize each data point by subtracting the mean and dividing by the sample or population standard deviation:

    Z=XiX¯s

    Z=Xiμσ

  4. Cube each standardized value by raising it to the power of 3:

    (XiX¯s)3

    (Xiμσ)3

  5. Sum all the cubed standardized values:

    i=1n(XiX¯s)3

    i=1n(Xiμσ)3

  6. Multiply the result by n/(n-1)(n-2) to obtain the sample skewness or divide it by the number of data points in the population (N) to obtain the population skewness:

    SK=[n(n1)(n2)]i=1n(XiX¯s)3

    γ1=[1N]i=1n(Xiμσ)3

This calculation returns a value that represents the skewness. Intuitively, positive values indicate positive skewness and negative values indicate a negative one. The skewness’s magnitude indicates the extent of asymmetry in the distribution. Meanwhile, a result of 0 means the distribution is symmetrical.

Skewness Calculator

What Is Skewness?

Skewness is a measure of probability distribution asymmetry. It indicates whether the observations in a dataset are concentrated on one side. This metric shows how the distribution deviates from normality and helps us comprehend the data’s fundamental structure. In fact, skewness plays a significant role in statistical modeling and hypothesis testing. If the data is skewed, the statistical test may not be accurate, and the results can be misleading.

A distribution can be right-skewed (positively skewed), left-skewed (negatively skewed), or symmetrical (zero skewness). We demonstrate how to determine the skewness type in the following sections.

What Is Zero Skewness?

Zero skew (or zero skewness) refers to a symmetrical, bell-shaped distribution where the probability of observing a value to the left or right of the mean is equal. This means that the distribution is balanced, with no long tail on either side of the mean. In other words, both sides are mirror images.

Zero skewness graph.

In a zero-skewed distribution, the mean, median, and mode are all equal, coinciding at the center of the distribution. This symmetry makes the zero skew easy to work with.

Symmetrical distribution graph.

Examples of distributions with zero skewness include the normal distribution (also known as the Gaussian distribution) and the uniform distribution .

Uniform distribution graph. No skew graph.

It is important to note that a zero-skewed distribution does not necessarily imply it’s a normal one. Zero skew merely indicates a balance or, in other words, an absence of skewness. A normal distribution is a distribution that fits a particular bell-shaped curve and has the following characteristics:

  • It can be described by two parameters: mean and variance.
  • Its mean and median values are equal.
  • Approximately 68% of the data falls within one standard deviation of the mean.
  • About 95% of the data falls within two standard deviations.
  • Nearly 99% of the data falls within three standard deviations.

Men’s height is one example of a zero-skewed dataset. In general, human height and weight tend to follow a bell-shaped normal distribution that is symmetric around the mean. For example, the average height for men in the US is 5’9” (or roughly 69.2 inches). In such cases, the distribution is roughly symmetrical, with some individuals being shorter and some being taller.

Distribution of male height graph.

What Is Positive Skewness?

Positive skew (also known as right skew) occurs when the data is not symmetrical around the mean, forming a long tail on its right side. This means that most of the distribution's observations are concentrated to the left of the peak.

Distribution skewed to the right graph.

The presence of positive skewness can have several implications. In a right-skewed distribution, the mean is larger than the median, whereas the mode has the highest visual representation. That’s because the tail of the distribution pulls the mean to the right.

Positive skewed mean, median, mode graph.

As a result, the mean is no longer a good central tendency indicator, and it cannot accurately reflect the typical value of the dataset. Performing analyses on positively skewed data violates the assumption of normality, which is typical for many statistical tests. This could affect the validity of the results and should be taken into account when interpreting statistical findings.

The population income distribution is one example of a right skew. Typically, most people earn average volumes of income, while a few earn very high earnings. High-earners would be the outliers that skew the distribution toward the right. As a result, the distribution forms a long tail on its right side, suggesting positive skewness.

Distribution of household incomes graph.

What Is Negative Skewness?

With negative skew (also known as left skew), the left tail of the distribution extends to the peak’s left. This means that most of the observations are clustered to the right side. As a result, a negatively skewed distribution is not symmetrical around the mean, with a long tail on the left side of the peak.

Negative skewed graph.

In a negatively skewed frequency distribution, the mean is lower than the median, whereas, once again, the mode defines the highest point. This type is also called left skew because the observations are to the left.

Negative skewness can affect statistical analyses, such as hypothesis testing and regression, and the data interpretation overall. In general, the presence of a high skew makes it difficult to draw conclusions and requires some data transformation or the use of non-parametric methods. Additionally, it may signify that a dataset contains outliers on the left side or that the data is constrained by a minimum value, producing a long left tail.

Negative skewed mean, median, mode graph.

An example of a left skew is the distribution of daily stock market returns. Investors can expect frequent small gains on most days, but the stock market could occasionally deliver huge negative returns. As a result, the distribution is likely to be skewed to the left.

Distribution of daily stock market returns graph.

How to Handle Skewed Data?

Skewed data can lead to biased estimates of central tendency (such as the mean) and affect measures of variability (such as the standard deviation). In addition, it causes incorrect inferences about the population and reduced statistical power.

There are several methods that deal with skewness in statistics:

  1. Transform the data by applying techniques such as log transformation. This method helps fit a very skewed distribution into a normal one.
  2. Remove outliers—observations that lie at an abnormal distance away from the mean. Creating histograms or box plots can help you see the distribution’s shape and identify these outliers.
  3. Use a model that does not assume a normal distribution. Non-parametric tests or generalized linear models, for example, are often more suited to analyze skewed data.

How to Interpret the Skewness Coefficient

You can determine the skewness of your dataset using the Skewness Calculator or the step-by-step guide above. But either way, the value is meaningless unless you know how to interpret it.

The skew value varies depending on the data distribution. If a distribution is perfectly symmetrical, then the skewness is 0. On the other hand, if the tail of a distribution extends to the right, it has positive skewness. Similarly, if the tail extends to the left, the distribution has negative skewness.

The skew’s magnitude measures the extent of asymmetry, with higher magnitudes indicating greater skewness. Although the possible values of skewness range from -3 to 3, values outside the range of -2 to 2 are not commonplace.

Here is a standardly used scale to interpret skewness intensity:

  • A value of 0 indicates a perfectly symmetrical distribution.
  • A value between -0.5 and 0 or between 0 and 0.5 indicates an approximately symmetric distribution.
  • A value between -1 and -0.5 or between 0.5 and 1 indicates a moderately skewed distribution.
  • A value between -1.5 and -1 or between 1 and 1.5 indicates a highly skewed distribution.
  • A value less than -1.5 or greater than 1.5 indicates an extremely skewed distribution.

What Is the Difference Between Skewness and Kurtosis?

Skewness measures the degree of asymmetry in a distribution, with a skew of 0 indicating perfect symmetry.

Kurtosis, on the other hand, measures the degree of tailedness or flatness in a distribution. This tells us what the weight of the tails is relative to the rest of the distribution. Or put simply, how much of the data is in the tails compared to the center. If a distribution has high kurtosis, it means there is more data in the tails, while low kurtosis means that it is more evenly distributed.

You can obtain it using our Kurtosis Calculator and learn more about it in the related article.

How do you know if your data is left or right skewed?

There are multiple ways to check whether your data is skewed to the left or to the right. It’s important to have a firm grasp on these methods, as skewed data can have a significant impact on our statistical analysis. Ultimately, it allows us to get a more complete understanding of the data we are working with.

One way to determine if your data is left or right skewed is to visually inspect the distribution by creating a histogram.

If most of the data points are clustered to the right, then your data is considered right skewed (or positively skewed), indicating a longer right tail:

Positive skew graph.

Conversely, if most of the data points are clustered to the left, then your data is left skewed (or negatively skewed), indicating a longer left tail:

Negative skew graph.

And finally, if the frequency of occurrence is mirrored, then there is zero skewness and the distribution is symmetrical:

Zero skew graph.

Another way to confirm skewness is by calculating the skewness coefficient, which is a measure of the distribution's asymmetry. A positive value indicates right-skewed data, whereas a negative one means the data is skewed to the left.

By being aware of the skewness of your data, you can take appropriate steps to mitigate its effects and ensure accurate statistical analysis.


How to calculate skewness in Excel?

Excel allows you to calculate skewness using one of its two built-in functions: SKEW and SKEW.P. The former is suitable for sample data, whereas the latter returns the skewness of a distribution based on a population.

The function requires only those cells that contain relevant data. Take, for example, the following dataset:

Dataset.

In Excel, type SKEW, then select the cell range (C5:C14):

SKEW in the range (C5:C14). The result is 0.13.

By running the formula, we see that the sample data skewness is 0.13.

Obtaining the population skewness is also straightforward. As previously mentioned, we use the SKEW.P function on the same cell rage (C5:C14):

SKEW.P in the range (C5:C14).

And here, we see that the population data skewness is 0.11.

Whichever calculation method you chose, make sure you double-check your result using the Skewness Calculator above.