Sample spaces and Pebble World
Definition
The sample space : of an experiment is the set of all possible outcomes of the experiment.
An event : is a subset of the sample space , and we say that occurred if the actual outcome is in .
Notation
Let be a sample space and be the actual outcome of the experiment:
| English | Sets |
|---|---|
| Events and occurrences | |
| sample space | |
| is a possible outcome | |
| is an event | |
| occurred | |
| something must happen | |
| New events from old events | |
| or (inclusive) | |
| and | |
| not | |
| or , but not both | |
| at least one of | |
| all of | |
| Relationships between events | |
| implies | |
| and are mutually exclusive | |
| are a partition of | , for |
Naive definition of probability
Definition
Let be an event for an experiment with a finite sample space . The naive probability of is:
Note
The naive definition is very restrictive in that it requires to be finite, with equal mass for each pebble. There are several important types of problems where the naive definition is applicable:
- when there is symmetry in the problem that makes outcomes equally likely.
- when the outcomes are equally likely by design.
- when the naive definition serves as a useful null model. In this setting, we assume that the naive definition applies just to see what predictions it would yield, and then we can compare observed data with predicted values to assess whether the hypothesis of equally likely outcomes is tenable.
Strategy
A good strategy when trying to find the probability of an event is to start by thinking about whether it will be easier to find the probability of the event or the probability of its complement.
How to count
Theorem
Multiplication rule
Consider compound experiment consisting of two sub-experiments, Experiment and Experiment . Suppose that Experiment has possible outcomes, and for each of those outcomes Experiment has possible outcomes. Then the compound experiment has possible outcomes.
Note
It is often easier to think about the experiments as being in chronological order, but there is no requirement in the multiplication rule that Experiment has to be performed before Experiment .
Theorem
Sampling with replacement
Consider objects and making choices from them, one at a time with replacement (i.e., choosing a certain object does not preclude it from being chosen again). Then there are possible outcomes (where order matters, in the sense that, e.g., choosing object 3 and then object 7 is counted as a different outcome than choosing object 7 and then object 3.)
Theorem
Sampling without replacement
Consider objects and making choices from them, one at a time without replacement (i.e., choosing a certain object precludes it from being chosen again). Then there are possible outcomes for , and 0 possibilities for (where order matters). By convention, for .
Note
It is important to think of the objects or people in the population as named or labeled. For example, if there are n balls in a jar, we can imagine that they have labels from 1 to n, even if the balls look the same to the human eye.
Strategy
Adjusting for overcounting
In many counting problems, it is not easy to directly count each possibility once and only once. If, however, we are able to count each possibility exactly times for some , then we can adjust by dividing by . For example, if we have exactly double-counted each possibility, we can divide by 2 to get the correct count. We call this adjusting for overcounting.
Note
A binomial coefficient counts the number of subsets of a certain size for a set, such as the number of ways to choose a committee of size k from a set of n people. Sets and subsets are by definition unordered, e.g., , so we are counting the number of ways to choose k objects out of n, without replacement and without distinguishing between the different orders in which they could be chosen.
Definition
Binomial coefficient
For any nonnegative integers and , the binomial coefficient , read as “n choose k”, is the number of subsets of size k for a set of size n.
Theorem
Binomial coefficient formula
For , we have:
For we have
Theorem
Binomial theorem
The binomial theorem states that:
Story Proofs
Definition
A story proof is a proof by interpretation. For counting problems, this often means counting the same thing in two different ways, rather than doing tedious algebra. A story proof often avoids messy calculations and goes further than an algebraic proof toward explaining why the result is true. The word “story” has several meanings, some more mathematical than others, but a story proof (in the sense in which we’re using the term) is a fully valid mathematical proof.
Example
Choosing the complement
For any nonnegative integers and with , we have:
Story proof: Consider choosing a committee of size k in a group of n people. We know that there are (n k ) possibilities. But another way to choose the committee is to specify which n − k people are not on the committee; specifying who is on the committee determines who is not on the committee, and vice versa. So the two sides are equal, as they are two ways of counting the same thing.
Non-naive definition of probability
Definition
General definition of probability
A probability space consists of a sample space and a probability function which takes an event as input and returns , a real number between and , as output. The function must satisfy the following axioms:
- If are disjoint events, then:
(Saying that these events are disjoint means that they are mutually exclusive: for .)
Note
The frequentist view of probability is that it represents a long-run frequency over a large number of repetitions of an experiment: if we say a coin has probability 1/2 of Heads, that means the coin would land Heads 50% of the time if we tossed it over and over and over. The Bayesian view of probability is that it represents a degree of belief about the event in question, so we can assign probabilities to hypotheses like “candidate A will win the election” or “the defendant is guilty” even if it isn’t possible to repeat the same election or the same crime over and over again.
Property
Properties of probability
Probability has the following properties, for any events A and B.
- .
- if , then .
- .
Theorem
Inclusion-exclusion
For any events ,