Inverse-gamma distribution

Two-parameter family of continuous probability distributions

This article needs additional citations for verification . Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Inverse-gamma distribution" – news · newspapers · books · scholar · JSTOR (October 2014) (Learn how and when to remove this message)

Inverse-gamma
Probability density function
Cumulative distribution function
Parameters	$\alpha >0$ {\displaystyle \alpha >0} shape (real) $\beta >0$ {\displaystyle \beta >0} scale (real)
Support	$x\in (0,\infty )\!$ {\displaystyle x\in (0,\infty )\!}
PDF	${\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}x^{-\alpha -1}\exp \left(-{\frac {\beta }{x}}\right)$ {\displaystyle {\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}x^{-\alpha -1}\exp \left(-{\frac {\beta }{x}}\right)}
CDF	${\frac {\Gamma (\alpha ,\beta /x)}{\Gamma (\alpha )}}\!$ {\displaystyle {\frac {\Gamma (\alpha ,\beta /x)}{\Gamma (\alpha )}}\!}
Mean	${\frac {\beta }{\alpha -1}}\!$ {\displaystyle {\frac {\beta }{\alpha -1}}\!} for $\alpha >1$ {\displaystyle \alpha >1}
Mode	${\frac {\beta }{\alpha +1}}\!$ {\displaystyle {\frac {\beta }{\alpha +1}}\!}
Variance	${\frac {\beta ^{2}}{(\alpha -1)^{2}(\alpha -2)}}\!$ {\displaystyle {\frac {\beta ^{2}}{(\alpha -1)^{2}(\alpha -2)}}\!} for $\alpha >2$ {\displaystyle \alpha >2}
Skewness	${\frac {4{\sqrt {\alpha -2}}}{\alpha -3}}\!$ {\displaystyle {\frac {4{\sqrt {\alpha -2}}}{\alpha -3}}\!} for $\alpha >3$ {\displaystyle \alpha >3}
Excess kurtosis	${\frac {6(5,円\alpha -11)}{(\alpha -3)(\alpha -4)}}\!$ {\displaystyle {\frac {6(5,円\alpha -11)}{(\alpha -3)(\alpha -4)}}\!} for $\alpha >4$ {\displaystyle \alpha >4}
Entropy	$\alpha \!+\!\ln(\beta \Gamma (\alpha ))\!-\!(1\!+\!\alpha )\psi (\alpha )$ {\displaystyle \alpha \!+\!\ln(\beta \Gamma (\alpha ))\!-\!(1\!+\!\alpha )\psi (\alpha )} (see digamma function)
MGF	Does not exist.
CF	${\frac {2\left(-i\beta t\right)^{\!\!{\frac {\alpha }{2}}}}{\Gamma (\alpha )}}K_{\alpha }\left({\sqrt {-4i\beta t}}\right)$ {\displaystyle {\frac {2\left(-i\beta t\right)^{\!\!{\frac {\alpha }{2}}}}{\Gamma (\alpha )}}K_{\alpha }\left({\sqrt {-4i\beta t}}\right)}

In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the reciprocal of a variable distributed according to the gamma distribution.

Perhaps the chief use of the inverse gamma distribution is in Bayesian statistics, where the distribution arises as the marginal posterior distribution for the unknown variance of a normal distribution, if an uninformative prior is used, and as an analytically tractable conjugate prior, if an informative prior is required.^[1] It is common among some Bayesians to consider an alternative parametrization of the normal distribution in terms of the precision, defined as the reciprocal of the variance, which allows the gamma distribution to be used directly as a conjugate prior. Other Bayesians prefer to parametrize the inverse gamma distribution differently, as a scaled inverse chi-squared distribution.

Characterization

[edit ]

Probability density function

[edit ]

The inverse gamma distribution's probability density function is defined over the support $x>0$ {\displaystyle x>0}

f(x;\alpha ,\beta )={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}(1/x)^{\alpha +1}\exp \left(-\beta /x\right)

{\displaystyle f(x;\alpha ,\beta )={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}(1/x)^{\alpha +1}\exp \left(-\beta /x\right)}

with shape parameter $\alpha$ {\displaystyle \alpha } and scale parameter $\beta$ {\displaystyle \beta }.^[2] Here $\Gamma (\cdot )$ {\displaystyle \Gamma (\cdot )} denotes the gamma function.

Unlike the gamma distribution, which contains a somewhat similar exponential term, $\beta$ {\displaystyle \beta } is a scale parameter as the density function satisfies:

f(x;\alpha ,\beta )={\frac {f(x/\beta ;\alpha ,1)}{\beta }}

{\displaystyle f(x;\alpha ,\beta )={\frac {f(x/\beta ;\alpha ,1)}{\beta }}}

Cumulative distribution function

[edit ]

The cumulative distribution function is the regularized gamma function

F(x;\alpha ,\beta )={\frac {\Gamma \left(\alpha ,{\frac {\beta }{x}}\right)}{\Gamma (\alpha )}}=Q\left(\alpha ,{\frac {\beta }{x}}\right)\!

{\displaystyle F(x;\alpha ,\beta )={\frac {\Gamma \left(\alpha ,{\frac {\beta }{x}}\right)}{\Gamma (\alpha )}}=Q\left(\alpha ,{\frac {\beta }{x}}\right)\!}

where the numerator is the upper incomplete gamma function and the denominator is the gamma function. Many math packages allow direct computation of $Q$ {\displaystyle Q}, the regularized gamma function.

Moments

[edit ]

Provided that $\alpha >n$ {\displaystyle \alpha >n}, the $n$ {\displaystyle n}-th moment of the inverse gamma distribution is given by^[3]

\mathrm {E} [X^{n}]=\beta ^{n}{\frac {\Gamma (\alpha -n)}{\Gamma (\alpha )}}={\frac {\beta ^{n}}{(\alpha -1)\cdots (\alpha -n)}}.

{\displaystyle \mathrm {E} [X^{n}]=\beta ^{n}{\frac {\Gamma (\alpha -n)}{\Gamma (\alpha )}}={\frac {\beta ^{n}}{(\alpha -1)\cdots (\alpha -n)}}.}

Characteristic function

[edit ]

The inverse gamma distribution has characteristic function ${\frac {2\left(-i\beta t\right)^{\!\!{\frac {\alpha }{2}}}}{\Gamma (\alpha )}}K_{\alpha }\left({\sqrt {-4i\beta t}}\right)$ {\displaystyle {\frac {2\left(-i\beta t\right)^{\!\!{\frac {\alpha }{2}}}}{\Gamma (\alpha )}}K_{\alpha }\left({\sqrt {-4i\beta t}}\right)} where $K_{\alpha }$ {\displaystyle K_{\alpha }} is the modified Bessel function of the 2nd kind.

Properties

[edit ]

For $\alpha >0$ {\displaystyle \alpha >0} and $\beta >0$ {\displaystyle \beta >0},

\mathbb {E} [\ln(X)]=\ln(\beta )-\psi (\alpha ),円

{\displaystyle \mathbb {E} [\ln(X)]=\ln(\beta )-\psi (\alpha ),円}

and

\mathbb {E} [X^{-1}]={\frac {\alpha }{\beta }},,円

{\displaystyle \mathbb {E} [X^{-1}]={\frac {\alpha }{\beta }},,円}

The information entropy is

{\begin{aligned}\operatorname {H} (X)&=\operatorname {E} [-\ln(p(X))]\\&=\operatorname {E} \left[-\alpha \ln(\beta )+\ln(\Gamma (\alpha ))+(\alpha +1)\ln(X)+{\frac {\beta }{X}}\right]\\&=-\alpha \ln(\beta )+\ln(\Gamma (\alpha ))+(\alpha +1)\ln(\beta )-(\alpha +1)\psi (\alpha )+\alpha \\&=\alpha +\ln(\beta \Gamma (\alpha ))-(\alpha +1)\psi (\alpha ).\end{aligned}}

{\displaystyle {\begin{aligned}\operatorname {H} (X)&=\operatorname {E} [-\ln(p(X))]\\&=\operatorname {E} \left[-\alpha \ln(\beta )+\ln(\Gamma (\alpha ))+(\alpha +1)\ln(X)+{\frac {\beta }{X}}\right]\\&=-\alpha \ln(\beta )+\ln(\Gamma (\alpha ))+(\alpha +1)\ln(\beta )-(\alpha +1)\psi (\alpha )+\alpha \\&=\alpha +\ln(\beta \Gamma (\alpha ))-(\alpha +1)\psi (\alpha ).\end{aligned}}}

where $\psi (\alpha )$ {\displaystyle \psi (\alpha )} is the digamma function.

The Kullback-Leibler divergence of Inverse-Gamma(α_p, β_p) from Inverse-Gamma(α_q, β_q) is the same as the KL-divergence of Gamma(α_p, β_p) from Gamma(α_q, β_q):

$D_{\mathrm {KL} }(\alpha _{p},\beta _{p};\alpha _{q},\beta _{q})=\mathbb {E} \left[\log {\frac {\rho (X)}{\pi (X)}}\right]=\mathbb {E} \left[\log {\frac {\rho (1/Y)}{\pi (1/Y)}}\right]=\mathbb {E} \left[\log {\frac {\rho _{G}(Y)}{\pi _{G}(Y)}}\right],$ {\displaystyle D_{\mathrm {KL} }(\alpha _{p},\beta _{p};\alpha _{q},\beta _{q})=\mathbb {E} \left[\log {\frac {\rho (X)}{\pi (X)}}\right]=\mathbb {E} \left[\log {\frac {\rho (1/Y)}{\pi (1/Y)}}\right]=\mathbb {E} \left[\log {\frac {\rho _{G}(Y)}{\pi _{G}(Y)}}\right],}

where $\rho ,\pi$ {\displaystyle \rho ,\pi } are the pdfs of the Inverse-Gamma distributions and $\rho _{G},\pi _{G}$ {\displaystyle \rho _{G},\pi _{G}} are the pdfs of the Gamma distributions, $Y$ {\displaystyle Y} is Gamma(α_p, β_p) distributed.

{\begin{aligned}D_{\mathrm {KL} }(\alpha _{p},\beta _{p};\alpha _{q},\beta _{q})={}&(\alpha _{p}-\alpha _{q})\psi (\alpha _{p})-\log \Gamma (\alpha _{p})+\log \Gamma (\alpha _{q})+\alpha _{q}(\log \beta _{p}-\log \beta _{q})+\alpha _{p}{\frac {\beta _{q}-\beta _{p}}{\beta _{p}}}.\end{aligned}}

{\displaystyle {\begin{aligned}D_{\mathrm {KL} }(\alpha _{p},\beta _{p};\alpha _{q},\beta _{q})={}&(\alpha _{p}-\alpha _{q})\psi (\alpha _{p})-\log \Gamma (\alpha _{p})+\log \Gamma (\alpha _{q})+\alpha _{q}(\log \beta _{p}-\log \beta _{q})+\alpha _{p}{\frac {\beta _{q}-\beta _{p}}{\beta _{p}}}.\end{aligned}}}

Related distributions

[edit ]

If $X\sim {\mbox{Inv-Gamma}}(\alpha ,\beta )$ {\displaystyle X\sim {\mbox{Inv-Gamma}}(\alpha ,\beta )} then $kX\sim {\mbox{Inv-Gamma}}(\alpha ,k\beta ),円$ {\displaystyle kX\sim {\mbox{Inv-Gamma}}(\alpha ,k\beta ),円}, for $k>0$ {\displaystyle k>0}
If $X\sim {\mbox{Inv-Gamma}}(\alpha ,{\tfrac {1}{2}})$ {\displaystyle X\sim {\mbox{Inv-Gamma}}(\alpha ,{\tfrac {1}{2}})} then $X\sim {\mbox{Inv-}}\chi ^{2}(2\alpha ),円$ {\displaystyle X\sim {\mbox{Inv-}}\chi ^{2}(2\alpha ),円} (inverse-chi-squared distribution)
If $X\sim {\mbox{Inv-Gamma}}({\tfrac {\alpha }{2}},{\tfrac {1}{2}})$ {\displaystyle X\sim {\mbox{Inv-Gamma}}({\tfrac {\alpha }{2}},{\tfrac {1}{2}})} then $X\sim {\mbox{Scaled Inv-}}\chi ^{2}(\alpha ,{\tfrac {1}{\alpha }}),円$ {\displaystyle X\sim {\mbox{Scaled Inv-}}\chi ^{2}(\alpha ,{\tfrac {1}{\alpha }}),円} (scaled-inverse-chi-squared distribution)
If $X\sim {\textrm {Inv-Gamma}}({\tfrac {1}{2}},{\tfrac {c}{2}})$ {\displaystyle X\sim {\textrm {Inv-Gamma}}({\tfrac {1}{2}},{\tfrac {c}{2}})} then $X\sim {\textrm {Levy}}(0,c),円$ {\displaystyle X\sim {\textrm {Levy}}(0,c),円} (Lévy distribution)
If $X\sim {\textrm {Inv-Gamma}}(1,c)$ {\displaystyle X\sim {\textrm {Inv-Gamma}}(1,c)} then ${\tfrac {1}{X}}\sim {\textrm {Exp}}(c),円$ {\displaystyle {\tfrac {1}{X}}\sim {\textrm {Exp}}(c),円} (Exponential distribution)
If $X\sim {\mbox{Gamma}}(\alpha ,\beta ),円$ {\displaystyle X\sim {\mbox{Gamma}}(\alpha ,\beta ),円} (Gamma distribution with rate parameter $\beta$ {\displaystyle \beta }) then ${\tfrac {1}{X}}\sim {\mbox{Inv-Gamma}}(\alpha ,\beta ),円$ {\displaystyle {\tfrac {1}{X}}\sim {\mbox{Inv-Gamma}}(\alpha ,\beta ),円} (see derivation in the next paragraph for details)
Note that If $X\sim {\mbox{Gamma}}(k,\theta )$ {\displaystyle X\sim {\mbox{Gamma}}(k,\theta )} (Gamma distribution with scale parameter $\theta$ {\displaystyle \theta } ) then $1/X\sim {\mbox{Inv-Gamma}}(k,1/\theta )$ {\displaystyle 1/X\sim {\mbox{Inv-Gamma}}(k,1/\theta )}
Inverse gamma distribution is a special case of type 5 Pearson distribution
A multivariate generalization of the inverse-gamma distribution is the inverse-Wishart distribution.
For the distribution of a sum of independent inverted Gamma variables see Witkovsky (2001)

Derivation from Gamma distribution

[edit ]

Let $X\sim {\mbox{Gamma}}(\alpha ,\beta )$ {\displaystyle X\sim {\mbox{Gamma}}(\alpha ,\beta )}, and recall that the pdf of the gamma distribution is

f_{X}(x)={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}x^{\alpha -1}e^{-\beta x}

{\displaystyle f_{X}(x)={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}x^{\alpha -1}e^{-\beta x}},

x>0

{\displaystyle x>0}.

Note that $\beta$ {\displaystyle \beta } is the rate parameter from the perspective of the gamma distribution.

Define the transformation $Y=g(X)={\tfrac {1}{X}}$ {\displaystyle Y=g(X)={\tfrac {1}{X}}}. Then, the pdf of $Y$ {\displaystyle Y} is

{\begin{aligned}f_{Y}(y)&=f_{X}\left(g^{-1}(y)\right)\left|{\frac {d}{dy}}g^{-1}(y)\right|\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{y}}\right)^{\alpha -1}\exp \left({\frac {-\beta }{y}}\right){\frac {1}{y^{2}}}\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{y}}\right)^{\alpha +1}\exp \left({\frac {-\beta }{y}}\right)\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left(y\right)^{-\alpha -1}\exp \left({\frac {-\beta }{y}}\right)\\[6pt]\end{aligned}}

{\displaystyle {\begin{aligned}f_{Y}(y)&=f_{X}\left(g^{-1}(y)\right)\left|{\frac {d}{dy}}g^{-1}(y)\right|\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{y}}\right)^{\alpha -1}\exp \left({\frac {-\beta }{y}}\right){\frac {1}{y^{2}}}\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{y}}\right)^{\alpha +1}\exp \left({\frac {-\beta }{y}}\right)\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left(y\right)^{-\alpha -1}\exp \left({\frac {-\beta }{y}}\right)\\[6pt]\end{aligned}}}

Note that ${\beta }$ {\displaystyle {\beta }} is the scale parameter from the perspective of the inverse gamma distribution. This can be straightforwardly demonstrated by seeing that ${\beta }$ {\displaystyle {\beta }} satisfies the conditions for being a scale parameter.

{\begin{aligned}{\frac {f(y/\beta ;\alpha ,1)}{\beta }}&={\frac {1}{\beta }}{\frac {1}{\Gamma (\alpha )}}\left({\frac {y}{\beta }}\right)^{-\alpha -1}\exp(-{\frac {1}{\frac {y}{\beta }}})\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left(y\right)^{-\alpha -1}\exp(-{\frac {\beta }{y}})\\[6pt]&=f(y;\alpha ,\beta )\end{aligned}}

{\displaystyle {\begin{aligned}{\frac {f(y/\beta ;\alpha ,1)}{\beta }}&={\frac {1}{\beta }}{\frac {1}{\Gamma (\alpha )}}\left({\frac {y}{\beta }}\right)^{-\alpha -1}\exp(-{\frac {1}{\frac {y}{\beta }}})\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left(y\right)^{-\alpha -1}\exp(-{\frac {\beta }{y}})\\[6pt]&=f(y;\alpha ,\beta )\end{aligned}}}

Occurrence

[edit ]

Hitting time distribution of a Wiener process follows a Lévy distribution, which is a special case of the inverse-gamma distribution with $\alpha =0.5$ {\displaystyle \alpha =0.5}.^[4]

References

[edit ]

^ Hoff, P. (2009). "The normal model". A First Course in Bayesian Statistical Methods. Springer. pp. 67–88. ISBN 978-0-387-92299-7.
^ "InverseGammaDistribution—Wolfram Language Documentation". reference.wolfram.com. Retrieved 9 April 2018.
^ John D. Cook (Oct 3, 2008). "InverseGammaDistribution" (PDF). Retrieved 3 Dec 2018.
^ Ludkovski, Mike (2007). "Math 526: Brownian Motion Notes" (PDF). UC Santa Barbara. pp. 5–6. Archived from the original (PDF) on 2022年01月26日. Retrieved 2021年04月13日.

Witkovsky, V. (2001). "Computing the Distribution of a Linear Combination of Inverted Gamma Variables". Kybernetika. 37 (1): 79–90. MR 1825758. Zbl 1263.62022.

v
t
e

Probability distributions (list)

Discrete
univariate

with finite support	Benford Bernoulli Beta-binomial Binomial Categorical Hypergeometric Negative Poisson binomial Rademacher Soliton Discrete uniform Zipf Zipf–Mandelbrot
with infinite support	Beta negative binomial Borel Conway–Maxwell–Poisson Discrete phase-type Delaporte Extended negative binomial Flory–Schulz Gauss–Kuzmin Geometric Logarithmic Mixed Poisson Negative binomial Panjer Parabolic fractal Poisson Skellam Yule–Simon Zeta

Continuous
univariate

supported on a bounded interval	Arcsine ARGUS Balding–Nichols Bates Beta Generalized Beta rectangular Continuous Bernoulli Irwin–Hall Kumaraswamy Logit-normal Noncentral beta PERT Raised cosine Reciprocal Triangular U-quadratic Uniform Wigner semicircle
supported on a semi-infinite interval	Benini Benktander 1st kind Benktander 2nd kind Beta prime Burr Chi Chi-squared Noncentral Inverse Scaled Dagum Davis Erlang Hyper Exponential Hyperexponential Hypoexponential Logarithmic F Noncentral Folded normal Fréchet Gamma Generalized Inverse gamma/Gompertz Gompertz Shifted Half-logistic Half-normal Hotelling's T-squared Inverse Gaussian Generalized Kolmogorov Lévy Log-Cauchy Log-Laplace Log-logistic Log-normal Log-t Lomax Matrix-exponential Maxwell–Boltzmann Maxwell–Jüttner Mittag-Leffler Nakagami Pareto Phase-type Poly-Weibull Rayleigh Relativistic Breit–Wigner Rice Truncated normal type-2 Gumbel Weibull Discrete Wilks's lambda
supported on the whole real line	Cauchy Exponential power Fisher's z Kaniadakis κ-Gaussian Gaussian q Generalized normal Generalized hyperbolic Geometric stable Gumbel Holtsmark Hyperbolic secant Johnson's S_U Landau Laplace Asymmetric Logistic Noncentral t Normal (Gaussian) Normal-inverse Gaussian Skew normal Slash Stable Student's t Tracy–Widom Variance-gamma Voigt
with support whose type varies	Generalized chi-squared Generalized extreme value Generalized Pareto Marchenko–Pastur Kaniadakis κ-exponential Kaniadakis κ-Gamma Kaniadakis κ-Weibull Kaniadakis κ-Logistic Kaniadakis κ-Erlang q-exponential q-Gaussian q-Weibull Shifted log-logistic Tukey lambda

Mixed
univariate

continuous- discrete	Rectified Gaussian

Multivariate
(joint)

Discrete:
Ewens
Multinomial
- Dirichlet
- Negative
Continuous:
Dirichlet
- Generalized
Multivariate Laplace
Multivariate normal
Multivariate stable
Multivariate t
Normal-gamma
- Inverse
Matrix-valued:
LKJ
Matrix beta
Matrix normal
Matrix t
Matrix gamma
- Inverse
Wishart
- Normal
- Inverse
- Normal-inverse
- Complex

Directional

Univariate (circular) directional: Circular uniform; Univariate von Mises; Wrapped normal; Wrapped Cauchy; Wrapped exponential; Wrapped asymmetric Laplace; Wrapped Lévy
Bivariate (spherical): Kent
Bivariate (toroidal): Bivariate von Mises
Multivariate: von Mises–Fisher; Bingham

Degenerate
and singular

Degenerate: Dirac delta function
Singular: Cantor

Families

Retrieved from "https://en.wikipedia.org/w/index.php?title=Inverse-gamma_distribution&oldid=1250635066"

Characterization

Probability density function

Cumulative distribution function

Moments

Characteristic function

Properties

Related distributions

Derivation from Gamma distribution

Occurrence

See also

References