Mean and standard deviation of a binomial random variable
Consider a binomial experiment with parameters and . Thus, the random variable ="number of successes after repetitions" is binomially distributed and can take on one of the values every time the experiment is performed, with the probability
where . We now want to calculate the average number of successes per experiment, , and also the standard deviation from this average. Let's start with the result, and then we give the proof.
Consider the binomial random variable with success probability and repetition number . The mean of the number of
and the standard deviation
Since counts the number of successes per experiment, this means that the observed number of successes per experiment is on average (averaged over a huge number of experiments), and the deviation of the observed number of successes from per experiment is on average (also averaged over a huge number of experiments).
Here is an example.
An experiment consists of flipping times a biased coin with . On average, how many heads do you observe per Experiment? And what is the typical deviation of the observed number of heads per Experiment from this average?
="number of heads" is a binomial variable with and . Thus, on average we observe
heads and typically, the deviation from this value is
heads.
Note that the formula for the mean makes intuitive sense: If we perform the coin experiment from the example above many times (say times), then yes, we flip the coin a total of
times, and therefore observe heads a total of approximately
times (by definition of probability as long-time relative frequency). So per experiment we see on average
heads, or expressed as a formula with and :
But how can we prove these formulas using our general formula for the mean and standard deviation of random variables? Well, applying the general method for calculating and for , we have
and
In the case of binomially distributed random variables, we have
Inserting the formula for in the above expressions for and , and then simplify the resulting expression, we get after many algebraic manipulations and simplifications that and . The example below shows this calculation for the case .
We want to show that for the case it is and .
Solution
With
and we get
and for the variance we get
Thus, we have