Normal distribution

Wiki2Reveal

This page can be displayed as Wiki2Reveal slides. Single sections are regarded as slides and modifications on the slides will immediately affect the content of the slides in RevealJS.

Introduction

A normal distribution can be described by four moments: mean, standard deviation, skewness and kurtosis. Statistical properties of normal distributions are important for parametric statistical tests which rely on assumptions of normality. The normal distribution is often used as assumption of the underlying probability distribution in natural sciences and social sciences^[1]^[2]

Probability density function

The probability density function of the standard normal distribution (with the standard deviation and area under the curve standardized to 1 and the mean and skewness standardized to 0) is given by $f(x)={\frac {e^{-x^{2}/2}}{\sqrt {2\pi }}}$

Antiderivative

This function has no elementary antiderivative, and thus normal distribution problems are mainly limited to using numerical integration to find a probability.

Gaussian integral and Normalization

However, using multivariable calculus, the value of the Gaussian integral can be determined to be exactly $\int _{-\infty }^{\infty }e^{-x^{2}}dx={\sqrt {\pi }}$ , and through the change of variables $u=x{\sqrt {2}},du=dx{\sqrt {2}}$ , we can be assured that the total area beneath the curve is 1, and thus it is normalized correctly.

Standard Deviation

A similar approach can be used to prove that its standard deviation is also 1. By the definition of the standard deviation, $\sigma ^{2}=\Sigma {(x_{k}-\mu )^{2}}$ , so we form the Riemann sum $\Sigma (x-0)^{2}{\frac {e^{-x^{2}/2}}{\sqrt {2\pi }}}\Delta x$ . Taking the limit of the sum leads to the integral $\int _{-\infty }^{\infty }x^{2}{\frac {e^{-x^{2}/2}}{\sqrt {2\pi }}}dx$ .

Integration by Parts

Using integration by parts, we can determine that this is in fact equal to the area under the normal curve, and thus the standard deviation is 1.

Inflection Points

Using differentiation of the probability density function, we find that the inflection points of the normal distribution curve are each exactly one standard deviation away from the mean.

Standardization

Any other normal distribution can be standardized through a change of variables (such as if the mean is not 0).

Learning Task

Learn about the general properties of a Probability Distribution and check why does the normal distribution fullfil these properities.
Create number in R programming language R e.g. with Open Source R/RStudio and plot the generated data with a histogram.
Analyze the Central Limit Theorem and describe, how the normal distribution is related to that theorem. Give an real world example where you would not assume a normal distribution.

Graphs 1 - Normal Distribution

Some normal distributions with various parameters.

Graphs 2 - Normal Distribution

Some normal distributions with various parameters.

Normal probability plots

A normality probability plot - data falling around this straight indicates the degree of departure from normality. For more information, see * Q-Q plot

Symbols

Population mean = μ (Mu)
Population variance = σ² = (Sigma squared)

Testing for normality

No single indicator of normality should be overly relied upon. Graphical, descriptive, and inferential can be used, each with strengths and limitations. The most important result is to actually describe and show the distribution. Simply listing statistical properties does not demonstrate understanding.

Graphical analysis

Whether it is reported as a Figure or not, responses to interval or continuous variables should be visualised as a:

Histogram, with normal curve imposed

This is the single most basic and important way of examining the central tendency and shape of distribution for participants' responses.

It may also be helpful to examine a:

Normal Q-Q plot

Descriptive indicators of normality

A rule of thumb for assessing normality for the purposes of assumption testing for inferential statistical tests such as ANOVA is that if skewness and kurtosis are between -1 and +1 and there is a reasonable sample size (e.g., at least 20 per cell), then you are unlikely to run into issues related to violations of the assumption of normality.

Some authors suggest that variables with skewness and kurtosis values between -2 to +2 or even -3 to +3 can be treated as being drawn from a normally distributed population.

The larger the sample size, the more robust inferential tests are to departure from normality.

The skewness of a Normal Distribution is always 0; +ve scores indicate a tail to the right; -ve scores indicate a tail to the left.
The kurtosis of a Normal Distribution is always 0; +ve scores indicate a peaked distribution; -ve scores indicate a relatively flat distribution.

If concerned about non-normality, then consider recoding data to a lower level of measurement.

For more information, see: Judging severity of skewness and kurtosis

Inferential tests of normality

Significance tests of (non-)normality become overly sensitive when the sample size is large. Thus, do not rely on significance tests of normality alone in making an assessment (e.g., for assumption-testing purposes):

These tests are overly sensitive to minor departures from normality, particularly with large sample samples (e.g., > 200). This doesn't mean that it should be discounted as an indicator, just that a sig. (p < .05) test value does not necessarily indicate a notable or problematic departure from normality. Also check normality using other indicators.

Take-home message

It is recommended that both graphical indicators and descriptive indicators be used for testing the assumption that a sample is derived from a normally distributed population. Inferential normality tests may also be useful.

Dealing with non-normality

Transformations

Standardise data in SPSS

Non-parametric statistics

Non-parametric statistics

External links

Testing for normality using SPSS (Laerd Statistics]]

References

↑ Normal Distribution, Gale Encyclopedia of Psychology
↑ Casella & Berger (2001, p. 102)

Page Information

You can display this page as Wiki2Reveal slides

Wiki2Reveal

The Wiki2Reveal slides were created for the Statistics' and the Link for the Wiki2Reveal Slides was created with the link generator.

This page is designed as a PanDocElectron-SLIDE document type.
Source: Wikiversity https://en.wikiversity.org/wiki/Normal_distribution
see Wiki2Reveal for the functionality of Wiki2Reveal.

[1] Normal Distribution, Gale Encyclopedia of Psychology

[2] Casella & Berger (2001, p. 102)

[1]

[2]

Normal distribution

Contents

Wiki2Reveal

Introduction

Probability density function

Antiderivative

Gaussian integral and Normalization

Standard Deviation

Integration by Parts

Inflection Points

Standardization

Learning Task

Graphs 1 - Normal Distribution

Graphs 2 - Normal Distribution

Normal probability plots

Symbols

Testing for normality

Graphical analysis

Descriptive indicators of normality

Inferential tests of normality

Take-home message

Dealing with non-normality

Transformations

Non-parametric statistics

External links

References

See also

Page Information

Wiki2Reveal