The Negative Binomial distribution

tags
Probability distribution

Canonical parametrization

The negative binomial distribution is usually introduced as the probability distribution of the number \(X\) of failures before the \(r\) th success in a Bernoulli process with probability \(p\in [0,1]\). It is expressed as:

\begin{equation} P(X=x|p, r) = \binom{r+x-1}{x} (1-p)^x\, p^r \end{equation}

However, the negative binomial is rarely used in this canonical context but rather to model count data where there seems to be overdispersion, i.e. where \(\sigma > \mu\) (unlike the Poisson distribution where they're equal). In this case, it is more convenient to work directly with \(\mu\) and \(\sigma\).

Mean-variance parametrization

We can indeed reparametrize the distribution as a function of its mean \(\mu\) and variance \(\sigma\):

\begin{equation} r = \frac{\mu^2}{\sigma^{2}-\mu} \end{equation}

and

\begin{equation} p = \frac{r}{r + \mu} \end{equation}

Viewing \(\mu\) and \(\sigma\) as parameters means we abandon the combinatorial motivation of the negative binomial to view it as a model for count data. A very important property of this parametrization is the link between \(\mu\) and the variance:

\begin{equation} \sigma^{2} = \mu + \frac{1}{r} \mu^{2} \end{equation}

Most problems people have with negative binomial models come from ignoring this property. This dependence in \(\mu^2\) is a rather strong constraint; when it becomes problematic we can use:

  • The Skellam distribution
  • The Conway-Maxwell-Poisson distribution
  • The Generalized Poisson distribution

As a Poisson-Gamma compound distribution

The negative binomial distribution can also be seen as a Poisson distribution where the rate parameter follows a Gamma distribution (conjugate distribution):

\begin{align*} X &\sim \operatorname{Poisson}(\Lambda)\\ \Lambda &\sim \operatorname{Gamma}(\alpha, \beta) \end{align*}

Then we can show that the distribution of \(X\) is a negative binomial with \(r = \alpha\) and \(p = 1 / (1 + \beta)\), so that the mean and variance follow:

\begin{equation} \sigma^{2} = \mu + \frac{1}{\alpha}\, \mu^{2} \end{equation}

Which is a nice, less ad-hoc interpretation of the negative binomial with the mean-variance parametrization.

References

Links to this note