Zero-Inflated Binomial Distribution#

Univariate, Discrete, Bounded, Zero-inflated

The Zero-Inflated Binomial (ZIB) distribution is a discrete probability distribution used to model count data characterized by an excess of zeros. It combines two components: a standard Binomial distribution and an additional mechanism that increases the probability of observing zero outcomes.

This distribution is particularly useful for scenarios where the data exhibit more zeros than what the Binomial model alone would predict. For example, in a study of the number of doctor visits in a year, many individuals might not visit a doctor at all, resulting in a higher frequency of zero counts than expected under a traditional Binomial model.

Key properties and parameters#

Support

\(x \in \mathbb{N}_0\)

Mean

\(\psi n p\)

Variance

\(\psi n p (1 - p) + n^2 p^2 (\psi - \psi^2)\)

Parameters:

  • \(\psi\) : (float) Expected proportion of Binomial variates, \(0 \leq \psi \leq 1\).

  • \(n\) : (int) Number of Bernoulli trials, \(n \geq 0\).

  • \(p\) : (float) Probability of success in each trial, \(0 \leq p \leq 1\).

Probability Mass Function (PMF)#

\[\begin{split} f(x \mid \psi, n, p) = \left\{ \begin{array}{l} (1-\psi) + \psi (1-p)^{n}, \text{if } x = 0 \\ \psi {n \choose x} p^x (1-p)^{n-x}, \text{if } x=1,2,3,\ldots,n \end{array} \right. \end{split}\]
/home/docs/checkouts/readthedocs.org/user_builds/preliz/envs/stable/lib/python3.11/site-packages/pytensor/link/numba/dispatch/basic.py:211: UserWarning: Numba will use object mode to run XlogY0's perform method. Set `pytensor.config.compiler_verbose = True` to see more details.
  warnings.warn(
/home/docs/checkouts/readthedocs.org/user_builds/preliz/envs/stable/lib/python3.11/site-packages/pytensor/link/numba/dispatch/basic.py:211: UserWarning: Numba will use object mode to run XlogY0's perform method. Set `pytensor.config.compiler_verbose = True` to see more details.
  warnings.warn(
Zero-Inflated Binomial Distribution PMF

Cumulative Distribution Function (CDF)#

\[\begin{split} F(k \mid \psi, n, p) = \left\{ \begin{array}{l} (1-\psi) + \psi (1-p)^{n}, \text{if } k = 0 \\ (1-\psi) + \psi (1-p)^{n} + \sum_{x=1}^{k} {n \choose x} p^x (1-p)^{n-x}, \text{if } k=1,2,3,\ldots,n \end{array} \right. \end{split}\]
Zero-Inflated Binomial Distribution CDF

See also

Common Alternatives:

  • Binomial Distribution: Models the number of successes in a fixed number of independent Bernoulli trials, each with the same probability of success.

  • Zero-Inflated Poisson Distribution: Extends the Poisson distribution by adding a mechanism to account for excess zeros, often observed in count data.

Related Distributions:

References#