Chapter 2: Binomial Distribution

A binomial distribution describes the number of successes in a fixed number of independent Bernoulli trials, each with the same probability of success \(p\).

We write: \[ X \sim B(n, p) \]

where:

  • \(n\) is the number of trials (e.g., number of droplets in ddPCR),

  • \(p\) is the probability of success in each trial,

  • \(X\) is the number of successful trials (positive droplets).

Example

Let’s consider a simple example using dice. Suppose we roll a die 3 times and count how many times we get a multiple of 3 (i.e., 3 or 6).

This is a Bernoulli trial repeated 3 times, with:

\[ p = \frac{2}{6} = \frac{1}{3}, \quad n = 3 \]

We are interested in the distribution of \(X\) = number of successes (i.e., times we get a 3 or 6).

The binomial probability mass function is:

\[ P(X = x) = \binom{n}{x} p^x (1 - p)^{n - x} \]

R Example: Computing Probabilities

x <- 0:3
px <- dbinom(x, size = 3, prob = 1/3)
px
## [1] 0.29629630 0.44444444 0.22222222 0.03703704

These correspond to:

\(P(X = 0)\) = 0.296 (no successes)

\(P(X = 1)\) = 0.444 (one success)

\(P(X = 2)\) = 0.222 (two successes)

\(P(X = 3)\) = 0.037 (all three are successes)

Visualizing the Binomial Distribution

binominal distribution

Figure 1.1: binominal distribution

Interpretation in ddPCR

In ddPCR, each droplet is a Bernoulli trial: either positive or negative for the target DNA. When considering multiple droplets, the total number of positive droplets follows a binomial distribution. This distribution becomes particularly useful when modeling the number of positive droplets across many replicates, before transitioning to Poisson-based modeling for rare events.