Copyright © University of Cambridge. All rights reserved.
Here is an example of something called a 'probability distribution'.
What is the probability for no sixes, or for one six only, or two sixes, three, four, or all five?
To understand more about distributions look at the Distribution Maker environment below. Hold down SHIFT while you click audio for a commentary to help you make sense of what you are looking at. Holding down SHIFT opens the audio in a separate window; minimise that window to see this page again.
The audio commentary talks about two other distributions: rolling one die, and the sum of two dice, before discussing the Five Dice context above, and will help you see that a probability distribution is a profile of how the probability varies as the variable we are interested in (for example the numbers of sixes seen each time) ranges randomly across its set of possible values.
Use the distribution maker to throw a single die 100 times. Do this two or three times. Why isn't the graph of actual values a horizontal line like the yellow probability distribution?
Now use the 'copy to clipboard' facility and paste the values to something like a spreadsheet. The 'Copy to Clipboard' button puts the data to the Windows clipboard. If, for example in Excel or Word, you then click on Paste, the data will appear.
If you do that ten times you will have data for 1000 throws. You should be able to sort those in the spreadsheet which will make counting the frequency much easier (in Excel for example, get all the data into one column, select the column and then use Sort from the Data menu)
How many of the samples of 100 are less even (horizontal or rectangular) than the combined sample of 1000?
When you have grasped the connection between a sample and the abstract probability distribution (the conditions under which that sample has been drawn) you are ready for Data Matching as a natural next challenge.