WEBVTT
00:00:02.900 --> 00:00:09.050
In this video, we’re gonna explain the meaning of some of the vocabulary and notation commonly used in probability.
00:00:11.780 --> 00:00:15.200
An experiment is an activity with an identifiable result.
00:00:15.670 --> 00:00:24.010
For example, if we have a six-sided dice and roll it on the table so that it lands with one of the numbers facing upwards, we could say rolling the dice is an experiment.
00:00:26.780 --> 00:00:31.960
A scientific experiment is a procedure which aims to make a discovery or test a hypothesis.
00:00:33.590 --> 00:00:36.260
But a probability experiment is a bit different to that.
00:00:36.670 --> 00:00:44.460
It’s a specific procedure which we can exactly repeat as often as we like, and the randomly occurring set of possible results are always the same.
00:00:46.720 --> 00:00:58.000
Some other examples of probability experiments could be flipping a coin to see if it lands heads or tails side up, or picking a disc at random out of a bag containing a variety of coloured discs.
00:01:00.140 --> 00:01:06.480
An outcome is a specific result of an experiment, and the set of all possible outcomes is called the sample space.
00:01:06.920 --> 00:01:13.030
For example, when we roll a regular six-sided dice, one outcome will be that it lands with the number one face up.
00:01:13.240 --> 00:01:18.520
Another outcome would be landing two face up, and so on with all these different possibilities.
00:01:21.130 --> 00:01:27.200
As we said, sample space is the set of all possible outcomes, so we write it using set notation.
00:01:27.450 --> 00:01:31.170
In this case, it’s all the numbers listed: one, two, three, four, five, six.
00:01:32.820 --> 00:01:38.330
And this is an exhaustive set of outcomes because it covers every possible outcome.
00:01:41.120 --> 00:01:44.280
An event is a particular subset of the sample space.
00:01:44.690 --> 00:01:49.720
So, for example, with rolling a dice, a simple event might be rolling a one, or rolling a three.
00:01:50.440 --> 00:02:00.040
But we can define more complex events which make up larger subsets of the sample space, like getting an even number, or getting a prime number, or getting a multiple of three, and so on.
00:02:03.160 --> 00:02:16.350
We measure the likelihood of an outcome or event occurring using the probability scale, which is a continuous scale from zero, which represents an impossible situation, up to one, which represents something that’s certain to occur.
00:02:16.920 --> 00:02:23.140
So, for example, a probability of a half is something that will occur half of the time that we conduct the experiment.
00:02:25.390 --> 00:02:30.720
We can use either fractions or decimals to represent these numbers between zero and one.
00:02:32.810 --> 00:02:40.660
It’s also okay to use percentages to represent the possib- the probability scale, so zero percent up to a hundred percent instead of zero to one.
00:02:40.920 --> 00:02:43.620
But you do need to include the percentage sign in there as well.
00:02:45.520 --> 00:02:53.850
So one important result is, if 𝑃 is representing the probability of an event occurring, then it must be between zero and one.
00:02:53.890 --> 00:03:00.380
So we can represent that using this inequality, zero is less than or equal to 𝑃 is less than or equal to one.
00:03:03.410 --> 00:03:09.570
There’s an important bit of notation that we commonly use to represent probabilities, and this saves us a lot of writing.
00:03:09.930 --> 00:03:25.360
So, for example, instead of writing the probability of getting a five when I roll a fair six-sided dice is a sixth, I can just write 𝑃 brackets five equals one-sixth because the shorthand way of writing the same thing that all mathematicians will understand.
00:03:27.650 --> 00:03:34.670
A probability model is a mathematical description of an event which lists all of the possible outcomes along with their probabilities.
00:03:35.110 --> 00:03:36.530
This can be summarised in a table.
00:03:37.710 --> 00:03:49.660
For example, if a bag contains nine similar discs, two which are red, three which are blue, and four which are green, if we draw out one at random, each individual disk is equally likely to be selected.
00:03:51.160 --> 00:03:56.250
So there are nine discs in there in total, and I’m just gonna pick one disc out of that.
00:03:56.250 --> 00:03:58.700
So how many ways are there of getting red discs?
00:03:58.770 --> 00:04:01.260
Well there are two ways out of nine of getting a red disc.
00:04:02.210 --> 00:04:05.490
So the probability of a red disc is two over nine.
00:04:05.980 --> 00:04:11.130
There are three blue discs out of the nine, so the probability of getting a blue disc is three out of nine.
00:04:11.560 --> 00:04:15.800
And for a green disc, there are four ways of getting a green disc out of the nine possibles.
00:04:15.970 --> 00:04:19.200
So the probability of drawing a green disc is four-ninths.
00:04:19.420 --> 00:04:22.850
So these are the probabilities of these different outcomes.
00:04:24.840 --> 00:04:29.870
Notice how we wrote the blue disc probability as three over nine, three-ninths.
00:04:30.200 --> 00:04:33.780
We could’ve simplified this to the equivalent fraction of a third, but we don’t have to.
00:04:34.340 --> 00:04:40.370
In fact, three over nine tells us how many ways of selecting a blue disc there are, and how many discs there are in total in the bag.
00:04:40.370 --> 00:04:44.470
So it’s actually more informative than simplifying the fraction to a third.
00:04:46.250 --> 00:04:50.510
Also notice that the table lists all of the possible outcomes for the experiment.
00:04:50.900 --> 00:04:56.560
So the sum of the probabilities must be one, one of those outcomes is certainly going to occur.
00:04:57.410 --> 00:05:00.190
So this table represents a probability model.
00:05:02.850 --> 00:05:08.510
When we roll a regular fair dice, the possible outcomes are one, two, three, four, five, or six.
00:05:08.830 --> 00:05:11.890
And all of these outcomes are equally likely to occur.
00:05:12.330 --> 00:05:15.230
This is the definition of fair in probability.
00:05:22.030 --> 00:05:28.170
When one or more outcomes are more or less likely to occur than others, we call the experiment biased.
00:05:28.940 --> 00:05:34.250
For example, when you buy a lottery ticket, it’ll either be a winning ticket or it’ll be a losing ticket.
00:05:34.320 --> 00:05:35.750
There’s two possible outcomes.
00:05:36.040 --> 00:05:41.780
But in most lotteries, the probability that the ticket will lose is much greater than the probability that it will win.
00:05:42.380 --> 00:05:46.720
So doing the lottery is biased; you’re more likely to lose than you are to win.
00:05:53.490 --> 00:05:59.630
There’s some experiments we may not know the probability of each outcome occurring before we try them out.
00:06:00.020 --> 00:06:08.360
For example, if we drop a drawing pin on the floor from a height of, say, one metre, when it lands and settles, the pin will either be pointing upwards or downwards.
00:06:08.360 --> 00:06:12.810
But we don’t know the theoretical likelihood of either a- scenario occurring.
00:06:14.410 --> 00:06:18.810
In this case, we can carry out the experiment lots of times and record the outcomes.
00:06:19.160 --> 00:06:25.190
We can then use the proportion of occasions on which each outcome occurred, as an estimate of the probability for the outcome.
00:06:26.130 --> 00:06:32.140
We call this proportion the relative frequency, or sometimes the experimental probability of each outcome.
00:06:32.560 --> 00:06:41.170
The more times we repeat the experiment, the more confident we get that our relative frequencies are a reliable estimate of the actual probabilities of each outcome.
00:06:41.720 --> 00:06:51.500
So in this case, we’ve done a thousand trials of the experiment and the pin landed up on six hundred and thirty-two occasions and down on three hundred and sixty-eight occasions.
00:06:51.790 --> 00:07:03.700
So our estimate of the probability of the pin landing up is six three two over a thousand, and the estimate of the probability of the pin landing down is three six eight over a thousand.
00:07:04.030 --> 00:07:13.290
Those relative frequencies, those proportions of the occasions that those things occur, are our estimates of the actual probabilities of those outcomes happening.
00:07:16.140 --> 00:07:17.580
Independent Probability.
00:07:17.850 --> 00:07:23.610
Two events are said to be independent if the outcome of one has no effect at all on the outcome of the other.
00:07:24.240 --> 00:07:29.750
For example, if we flip a coin, it can land heads or tails, and the probability of each of those is a half.
00:07:30.260 --> 00:07:34.750
If we roll a fair dice, the outcomes are one, two, three, four, five, or six.
00:07:34.750 --> 00:07:37.920
And again, they’ve all got equal probabilities of a sixth.
00:07:39.430 --> 00:07:44.420
Each of these two things, flipping a coin or rolling a fair dice, are independent.
00:07:44.890 --> 00:07:53.840
The probability of getting one, two, three, four, or five or six on a dice is not affected by whether or not the coin lands heads or tails side up in the other experiment.
00:07:55.330 --> 00:08:03.500
So another way of putting in it, these two experiments are independent because if we know the result of one, it doesn’t change the probabilities of the outcomes of the other.
00:08:06.350 --> 00:08:06.700
Okay.
00:08:06.700 --> 00:08:10.870
So in an experiment where we roll a fair dice, this is our probability model.
00:08:11.390 --> 00:08:15.040
There are six possible outcomes, one, two, three, four, five, six and the probabilities of all the six.
00:08:15.040 --> 00:08:16.550
And we’ve seen that many times before.
00:08:16.870 --> 00:08:18.550
Now let’s define two events.
00:08:18.550 --> 00:08:23.600
Event one is where the result was an even number, so it could’ve been a two or a four or a six.
00:08:23.840 --> 00:08:29.210
And event number two is where the result was a prime number, so it could’ve been a two or a three or a five.
00:08:29.630 --> 00:08:33.250
Now these two events are not independent events.
00:08:34.790 --> 00:08:43.250
So remember, we said that two events are independent if knowing the result of one doesn’t affect the probabilities of the outcomes of the other.
00:08:43.940 --> 00:08:53.040
That’s not the case here cause if we know that the result was even, we know that the number that came up with either two or four or six.
00:08:53.410 --> 00:08:58.240
So if we know that we’ve got two or four or six, what’s the probability that the result is a prime number?
00:08:58.490 --> 00:09:00.410
Well only one of those three is a prime number.
00:09:00.410 --> 00:09:06.740
So the result there would be, the probability of being prime would only be a third.
00:09:07.500 --> 00:09:17.140
If we didn’t know that event one had occurred, that the result was even, so we didn’t know that was odd or even, there are three different possibilities for prime numbers.
00:09:17.410 --> 00:09:23.550
So the probability of it being a prime is three out of the six possible outcomes, would be a half.
00:09:25.110 --> 00:09:29.520
So knowing the result of one event affects the probability of another event.
00:09:30.030 --> 00:09:38.400
Now that didn’t occur in our last example because if we knew the result on a dice, that didn’t tell us anything about the result of flipping a coin.
00:09:38.400 --> 00:09:40.360
Those two things were completely independent.
00:09:42.860 --> 00:09:48.930
So dependent and independent probabilities will become very important when you start tackling more complicated questions.
00:09:52.250 --> 00:09:57.210
When two events can’t occur at the same time, then we call the mutually exclusive events.
00:09:57.590 --> 00:10:02.130
For example, let’s say we randomly generate an integer from one to ten, inclusive.
00:10:02.630 --> 00:10:10.730
The events, it’s a multiple of two and it’s a factor of nine, are mutually exclusive because there are no multiples of two that are also factors of nine.
00:10:11.040 --> 00:10:16.950
If you know that the number generated is a multiple of two, then it can’t possibly be a factor of nine, and vice versa.
00:10:18.300 --> 00:10:26.180
So events one and two are mutually exclusive because if it’s a multiple of two, then the probability of it being a factor of nine is zero.
00:10:26.550 --> 00:10:31.010
And if it’s a factor of nine, then the probability of it being a multiple of two is also zero.
00:10:33.040 --> 00:10:37.650
One more example of that might be when you roll a dice, thinking about odd numbers or even numbers.
00:10:37.930 --> 00:10:40.580
If it’s an odd number, it can’t possibly be an even number.
00:10:40.630 --> 00:10:43.750
If it’s an even number, it can’t possibly be an odd number.
00:10:43.750 --> 00:10:49.130
So those two things, even and odd, are mutually exclusive outcomes of that experiment.
00:10:51.780 --> 00:10:58.180
So here’s a list of the probability vocabulary that hopefully you should now understand, as a result of watching this video.
00:10:58.990 --> 00:11:00.110
Let’s hope you find it useful.