Advanced | Help | Encyclopedia
Directory


Chi-square distribution

(Redirected from Chi-square)
The title of this article is incorrect because of technical limitations. The correct title is chi-square distribution.
chi-square
Probability density function
Cumulative distribution function
Parameters <math>k > 0\,<math> degrees of freedom
Support <math>x \in [0; +\infty)\,<math>
pdf <math>\frac{(1/2)^{k/2}}{\Gamma(k/2)} x^{k/2 – 1} e^{-x/2}\,<math>
cdf <math>\frac{\gamma(k/2,x/2)}{\Gamma(k/2)}\,<math>
Mean <math>k\,<math>
Median approximately <math>k-2/3\,<math>
Mode <math>k-2\,<math> if <math>k\geq 2\,<math>
Variance <math>2\,k\,<math>
Skewness <math>\sqrt{8/k}\,<math>
Kurtosis <math>12/k\,<math>
Entropy <math>\frac{k}{2}\,+\,\ln(2\Gamma(k/2))+(1-k/2)\psi(k/2)<math>
mgf <math>(1–2\,t)^{-k/2}<math> for <math>2\,t<1\,<math>
Char. func. <math>(1–2\,i\,t)^{-k/2}\,<math>

For any positive integer <math>k<math>, the chi-square distribution with <math>k<math> degrees of freedom is the probability distribution of the random variable

<math>X=Z_1^2 + \cdots + Z_k^2<math>

where the <math>Z_i<math> are independent standard normal variables (zero expected value and unit variance). This distribution is usually written

<math>X\sim\chi^2_k<math>

The chi-square test can be used to test independence as well as goodness of fit.

An example of a test of independence would be if sex and political affiliation are connected. So you would gather your sample, your expected value, find your critical value, and if the chi-square test is greater than the critical value, you can reject the null, otherwise, you fail to reject the null. (you never accept the null)

The chi-square probability density function is

<math>

f_k(x)= \frac{(1/2)^{k/2}}{\Gamma(k/2)} x^{k/2 – 1} e^{-x/2} <math>

where <math>x \ge 0<math> and <math>f_k(x) = 0<math> for <math>x \le 0<math>. Here <math>\Gamma<math> denotes the Gamma function. The cumulative distribution function is:

<math>F_k(x)=\frac{\gamma(k/2,x/2)}{\Gamma(k/2)}\,<math>

where <math>\gamma(k,z)<math> is the incomplete Gamma function.

Tables of this distribution — usually in its cumulative form — are widely available (see the External links below for online versions), and the function is included in many spreadsheets (for example OpenOffice.org calc or Microsoft Excel) and all statistical packages.

If <math>p<math> independent linear homogeneous constraints are imposed on these variables, the distribution of <math>X<math> conditional on these constraints is <math>\chi^2_{k-p}<math>, justifying the term "degrees of freedom". The characteristic function of the Chi-square distribution is

<math>\phi(t)=(1–2it)^{-k/2}\,<math>

The chi-square distribution has numerous applications in inferential statistics, for instance in chi-square tests and in estimating variances. It enters the problem of estimating the mean of a normally distributed population and the problem of estimating the slope of a regression line via its role in Student's t-distribution. It enters all analysis of variance problems via its role in the F-distribution, which is the distribution of the ratio of two independent chi-squared random variables.

Table of contents

The normal approximation

If <math>X\sim\chi^2_k<math>, then as <math>k<math> tends to infinity, the distribution of <math>X<math> tends to normality. However, the tendency is slow (the skewness is <math>\sqrt{8/k}<math> and the kurtosis is <math>12/k<math>) and two transformations are commonly considered, each of which approaches normality faster than <math>X<math> itself:

Fisher showed that <math>\sqrt{2X}<math> is approximately normally distributed with mean <math>\sqrt{2k-1}<math> and unit variance.

Wilson and Hilferty showed in 1931 that <math>\sqrt[3]{X/k}<math> is approximately normally distributed with mean <math>1–2/(9k)<math> and variance <math>2/(9k)<math>.

The expected value of a random variable having chi-square distribution with <math>k<math> degrees of freedom is <math>k<math> and the variance is <math>2k<math>. The median is given approximately by

<math>k-\frac{2}{3}+\frac{4}{27k}-\frac{8}{729k^2}<math>

Note that 2 degrees of freedom leads to an exponential distribution.

The chi-square distribution is a special case of the gamma distribution.

The information entropy is given by:

<math>

H = \int_{-\infty}^\infty f(x)\ln(f(x)) dx = \frac{k}{2} + \ln

\left(
 2 \Gamma
 \left(
  \frac{k}{2}
 \right)
\right)

+ \left(1 – \frac{k}{2}\right) \psi(k/2) <math>

where <math>\psi(x)<math> is the Digamma function.

Related distributions

  • <math>X \sim \mathrm{Exponential}(\lambda = 2)<math> is an exponential distribution if <math>X \sim \chi_2^2<math> (with 2 degrees of freedom).
  • <math>Y \sim \chi_k^2<math> is a chi-square distribution if <math>Y = \sum_{m=1}^k X_m<math> for <math>X_i \sim N(0,1)<math> independent that are normally distributed.
  • <math>Y \sim \mathrm{F}(\nu_1, \nu_2)<math> is an F-distribution if <math>Y = (X_1 – \nu_1)/(X_2 – \nu_2)<math> where <math>X_1 \sim \chi_{\nu_1}^2<math> and <math>X_2 \sim \chi_{\nu_2}^2<math> are independent with their respective degrees of freedom.
  • <math>Y \sim \chi^2(\bar{\nu})<math> is a chi-square distribution if <math>Y = \sum_{m=1}^N X_m<math> where <math>X_m \sim \chi^2(\nu_m)<math> are independent and <math>\bar{\nu} = \sum_{m=1}^N \nu_m<math>.

See also

External links








Links: Addme | Keyword Research | Paid Inclusion | Femail | Software | Completive Intelligence

Add URL | About Slider | FREE Slider Toolbar - Simply Amazing
Copyright © 2000-2008 Slider.com. All rights reserved.
Content is distributed under the GNU Free Documentation License.