Conditional probability
Conditional probability is an important concept, and occurs quite naturally in surveys or more generally in experiments involving multiple events.
We use the example of a survey to explain the concept: we randomly pick people living in New York and inquire about their voting behaviour. Thus, the set of possible outcomes contains all New Yorkers. Let us define the two events
- ="votes for Trump" and
- ="is male"
Picking a New Yorker at random, denotes the probability for selecting a Trump voter, and is the probability for selecting a male. Or expressed as percentages, is the percentage of New Yorkers voting for Trump and is percentage of New Yorkers being male.
As is often the case for surveys, we want to be more specific and, for example, find the percentage of male New Yorkers voting for Trump. In other words, we are interested in the probability to select a Trump voter, where we now randomly select not from all New Yorkers, but only from all male New Yorkers.
To say it in yet another way, we want to know the probability to select a Trump voter given that the selected person is a male New Yorker. This probability is written as
and is called the conditional probability of E given F. The vertical line stands for "given", so is pronounced " of given ". The situation is illustrated in a Venn-diagram below. The percentage is relative to the sample space , while is the percentage relative to . So there is a change in sample space.
Note that there is a subtle difference between and . Both describe the same persons (male Trump voters in New York), but expresses the number of these people as a percentage of (all New Yorkers), while expresses the same number of people as a percentage of (male New Yorkers).
What is the numerical relationship between these two probabilities? How can we convert from one to the other? Here is a small example. If
- the area is half of :
- and the area of is a quarter of :
- then clearly the area of is half of the quarter of :
Written more generally, we have
Or in a picture:
Observe how on the right side the F-circles cancel and the equation is correct. Dividing both sides of the equation by , we get the equation
Similarly we can show that
All basic properties about probabilities also work for the conditional probability (without proof, but should be intuitively clear as is simply a probability with a sample space restricted to ):
Let and be events of a random experiment, then
- if and are mutually exclusive.
Q1
In a town, of the population are male. Of these, are blond. A person is selected at random. Determine the probability that
- the person is blond, given that the person is male.
- the person is blond and male.
Q2
A teacher gave her class two tests. of the students passed at least the first test, and of the students passed both tests. You select a student at random from the ones that have passed test 1. What is the probability that the student has also passed test 2?
Q3
From people in a town, own a house, and of these, own a car. If you select a person from these people at random, what is the probability that the person owns both a house and a car?
Q4
Show that .
Q5
and are two events of a random experiments with , , and . Determine , , and .
Solution
A1
"selected person is blond", "selected person is male". The sample space is formed by the people in town.
- .

A2
"selected student passes test 1", "selected student passes test 2", and sample space is formed by the students in the class. It is , and . With we get .

A3
"House owner", ="car owner", , . Thus,
Or, because people are in the intersection , we can also calculate directly that .
A4
Repeat the experiment many times, and focus on the experiments where occurred. Of these, the percentage of experiments where occurred (as well) is , and the percentage of experiments where did not occur is . Thus, the two percentages add up to .

A5
Venn-Diagram . , and .
