Google Answers Logo
View Question
 
Q: Statistics: What score would I get by guessing alone? ( No Answer,   10 Comments )
Question  
Subject: Statistics: What score would I get by guessing alone?
Category: Science > Math
Asked by: alwayscurious-ga
List Price: $20.00
Posted: 31 Jan 2006 05:09 PST
Expires: 06 Feb 2006 16:55 PST
Question ID: 439581
I need some help with statistics. I want to give my students a
geography test where they will match the names of countries in list
with a map where the countries are labeled by numbers. I will test
them on 50 countries in Africa.

My question for you: What is the average score a student would obtain
through guessing alone? Let's assume that he/she matches one country
on the map to one name in the list with no duplicates.

Please show your work.  = )

Thanks. (PS I tip well for great answers.)

Request for Question Clarification by mvguy-ga on 31 Jan 2006 08:33 PST
The answer is 1.

I can't explain this in a mathematically rigorous way, but I hope I
can make it so you can see how I arrived at the answer.

Let's say there are just two countries on your map. I've made a chart
below where each letter represents a country to show the various
choices that the students can make. In the second column is the number
of correct answers.

A1
B2     2

A2
B1     0

(In these examples, a correct answer is A1, B2, C3, etc.)

As you can see, the number of possible answers the students can choose
is 2. The total number of correct answers among all possibilities is
2, making the average score of 1.

Now let's do the same thing with three countries:

A1
B2      3
C3

A1
C2      1
B3

B1
C2      0
A3

B1
A2      1
C3

C1
A2       0
B3

C1
B2       1
A3

Here we have 6 possible combinations, or 3!, and the number of correct
answers is also 6. Divide the number of combinations by the total of
correct answers, and we end up with 6.

Now let's try it with four countries:

A1      4
B2
C3
D4

A1      2   
B2
D3
C4

A1      2
C2
B3
D4

A1     1    
C2
D3
B4

A1     1
D2
B3
C4

A1     2
D2
C3
B4

B1     2   
A2
C3
D4

B1
A2     0
D3
C4

B1     1
C2
A3
D4

B1     0
C2
D3
A4

B1     0
D2
A3
C4

B1
D2     1
C3
A4

C1     1
A2
B3
D4

C1     0
A2
D3
B4

C1     2
B2   
A3
D4

C1     1
B2
D3
A4

C1     0
D2
A3
C4

C1     1
D2
C3
A4

D1     0
A2     
B3
C4

D1     1
A2
C3
B4

D1     1
B2
A3
C4

D1     1
B2
C3
A4

D1     0
C2
A3
B4

D1     0
C2
B3
A4


The number increases rapidly! Here we have 24 combinations with 24
correct answers, so the average score will be 1.

I hope you can see the pattern here. It will always be 1.

Another way of looking at it, is the point out that with each of the
50 choices, the student has a 1/50 chance of getting the correct
answer. Multiply 50 times 1/50 and you get a total average score of 1.

I don't have the skills to provide a mathematical proof beyond what I
have provided here. If this is sufficient for you, please let me know,
and I will post this as an official answer.

If it isn't sufficient, just say so, and hopefully some other
Researcher (maybe a math teacher?) can come along and flesh out my
answer to your satisfiction.

Thanks!

mvguy-ga

Request for Question Clarification by sublime1-ga on 31 Jan 2006 10:47 PST
alwayscurious...

In reading my colleague mvguy-ga's well-reasoned response,
it struck me that you had asked what would be the average
"score" for your students, and "1" does not translate into
a percentile score, as such.

If we extend what's been posted, however, it's clear that
the average score, in percent of correct answers, will vary
with the number of countries.

In the case of 2 countries, only 2 answers are possible.
One would score 100%, and the other 0%, for an average of
50%.

In the case of 3 countries, only 6 answers are possible.
One would score 100%, three would score 33.3333%, and 
two would score 0%, giving a ratio 200%/6 answers, for
an average of 33.3333%.

It becomes clear that the pattern is that the average
score will be 100% divided by the number of countries,
so, with 4 countries, the average score would be 25%.

In other words, mvguy's "1" out of however many countries
there are.

Therefore, with 50 countries, the average score, based
on guessing, should be a mere 2%, or 1 out of 50.

sublime1-ga

Request for Question Clarification by czh-ga on 31 Jan 2006 11:08 PST
Hello alwayscurious-ga,

I'm wondering if this is an abstract problem or a real question about
an academic testing situation. If you're really asking about a
classroom test then I think the realities of the context will
influence the likely average score. Are you teaching a geography class
or something else? Are you located in an African country or somewhere
else? Are you teaching elementary students or prospective geography
Ph.D.s? Unless this is an abstract probability problem, knowledge
about the context of the question will have a lot to do with figuring
out a probable average score.

So, what is the real problem we're trying to solve?

~ czh ~

Clarification of Question by alwayscurious-ga on 31 Jan 2006 12:16 PST
Wow. I'm impressed with your interest in this question. Thanks.

Let me clarify some points. First, this is a -real- testing situation.
I will be giving a real test as described. My expectation is that many
students will not -know- any of the countries in Africa, but they'll
complete the test anyway. They'll be guessing on every one of them,
though. How many the guessing student like that likely to get right?
The answer is some mean score and there will be some variation because
some students be luckier guessers than others.  = )

I'm looking for an answer that uses more mathematics/statistics than
pure, reasoned logic. I was expecting someone to talk about "sampling
without replacement"* and and then show some formula that for any size
of matching test, the mean correct responses and standard deviation
can be calculated according to the formula.

*Note: I'm not sure this is the actual statistical
assumption/approach. I just use it as an example.

So, MVGUY, I like your reasoning, but I'm not sure if you're right. I
guess I am looking for a mathematical proof of sorts.

SUBLIME is on the right track by noticing the variance.

CZH: I'm dealing with 5th and 6th grade students in the US. I want to
be able to say--if it's true of course--"The average score of students
on the pretest was 4, which on a matching test with 50 items is within
the range of what you would expect to see if every student was
guessing."

Thanks for all of your efforts.

Request for Question Clarification by sublime1-ga on 31 Jan 2006 12:56 PST
alwayscurious...

Given your choice of phraseology:

"The average score of students on the pretest was 4, which on
 a matching test with 50 items is within the range of what you
 would expect to see if every student was guessing."

...it seems to me that mvguy's original choice of wording was
correct, and you can expect to see an -average- score of 1 out
of 50 correctly answered. That's based on the assumption that
his reasoning is correct, even if it's not mathematically 
formulized, and it does seem correct to me.

Request for Question Clarification by mvguy-ga on 31 Jan 2006 18:19 PST
The answer of 1 just doesn't seem very intuitive, does it? It seems
like the average would be higher. Here's another way to look at it:

For the first question, there's a 1/50 chance the student will get the
correct answer. So the average number of points the student will pick
up from the first question is 1/50.

For the second question, the chance the student will get the correct
answer is 1/49 (49 is the number of answers the student can choose
from) times the chance that the right answer is still remaining, or
49/50 (because there's a 1/50 chance that the correct answer has
already been taken).

In other words the chance the student gets question 2 right is 1/49
times 49/50, which is 1/50. So the average number of points the
student gets from the second question is 1/50.

For the third question, the chance the student will get the correct
answer is 1/48 times 48/50 (since there's a 2/50 chance the right
answer has been taken). Again, then, the chance of getting the right
answer is 1/50.

And on it goes. Since the average number of points an average student
will get from each question is 1/50, the overall average is 50/50 or 1
correct answer.

The interesting thing about this, as one of the commenters has already
suggested, is that the average will be 1 correct answer regardless of
how many questions are on the test.

Does this explain things?

Clarification of Question by alwayscurious-ga on 01 Feb 2006 05:44 PST
It looks like the answer is in... Thank you mvguy, sublime, czh,
rracecarr, and ansel001 for your efforts--rracecarr's answer is what I
was looking for.

By the way, I found a presentation online that addresses my question,
and provides the correct answer, but doesn't offer the mathematical
support--pay particular note to the graph on page 15.

www.ipmaac.org/conf05/salyards.pdf

As an experiment, I created an Excel spreadsheet where I listed the
numbers 1 through 50 in column one and in column two. Then I generated
50 random numbers to fill the third column using Excels RAND function.
I sorted the second column and third by the third column--essentially
to randomize the second column. Finally I compared the first and
second columns to see how many matches there were. Sure enough, the
distribution accurately characterized the number of matches.

Request for Question Clarification by mvguy-ga on 01 Feb 2006 06:13 PST
Rracecarr-ga isn't a Google Researcher, so this one's a freebie for you.
Answer  
There is no answer at this time.

Comments  
Subject: Re: Statistics: What score would I get by guessing alone?
From: rracecarr-ga on 31 Jan 2006 14:41 PST
 
I have no proof or rigorous analysis to offer, but...

For a sufficiently large number (M) of questions, the probablility of
N correct responses (where N<<M) is

exp(-1)/N!

exp(-1) is 1/e = 0.367879441171...

So, the probabilities are as follows

# correct    Probability
0            0.3678794
1            0.3678794
2            0.1839397
3            0.0613132
4            0.0153283
5            0.0030657
6            0.0005109
7            0.0000730
8            0.0000091

In a class of 30 students, all guessing randomly, you'd expect a
couple to get 3 right, 5 or so to get 2 right, and the rest of the
class split between 1 right and none right.

The fascinating thing is that these probablilities don't depend on the
number of questions, N (as long as it's large enough).  All the
decimal places shown here (and many more) are correct whether N is 25,
50, or 100.
Subject: Re: Statistics: What score would I get by guessing alone?
From: ansel001-ga on 31 Jan 2006 16:34 PST
 
Rracecarr is correct.  This is a matching without replacement problem.
 As an example, if n married couples are at a party and the men and
women are randomly paired for a dance, how many couples would be
expected to be matched with their partners?  As the number of
questions/couples increases, the distribution approximates a Poisson
distribution with mean of one.  What Rracecarr didn't say is that the
expected number of correct answers implied by his probablities is one.

Mvguy, your answer of one is correct but I would differ a little on
your reasoning.  Because of the matching, for every question that is
answered wrong, there is also another question that cannot be answered
correctly.  Similarly, if one question is answered right, the number
of choices for the remaining questions decreases by one.
Subject: Re: Statistics: What score would I get by guessing alone?
From: ansel001-ga on 31 Jan 2006 16:37 PST
 
I might add one more thing.  The convergence to the Poisson
approximation is rapid.  By the time you get to about ten questions,
the probabilities rracecarr shows won't differ significantly from what
they would be for a million questions.
Subject: Re: Statistics: What score would I get by guessing alone?
From: alwayscurious-ga on 01 Feb 2006 05:49 PST
 
rracecarr:

Please submit your comment as an answer.
Subject: Re: Statistics: What score would I get by guessing alone?
From: ettarzanestheureux-ga on 01 Feb 2006 08:50 PST
 
First of all, my first language is French, so I apologize for any confused term.  

This problem is quit simple and can be resume in those words:  the
student have 50 tasks to do.  In every step, he (she) has a certain
chance of success.  The student is successful at every single step? by
guessing!
 
In the first step, chances are 1/50 and the student is right!
In the second step, chances are 1/49, the student in right? and so on.

Applying the multiplicative rule of probabilities we (simply) find:

Probability of success for all countries (P) = 1/50 x 1/49 x 1/48?

P = 1/(N) x 1/(N-1) x 1/(N-2) ? where N = number of items (50)

P = 1/N!

The average score a student would obtain through guessing alone in
this kind of test with 50 items is equal to one chance on the number
of permutations possible between 50 elements, say 50 factorial (!).

50! = 50 x 49 x 48 ? = 3.04E64

The answer is:  3.29E-65

In other words, I would recommend revising before the exam.
Subject: Re: Statistics: What score would I get by guessing alone?
From: ettarzanestheureux-ga on 01 Feb 2006 11:16 PST
 
The answers (with a "s") provide by mvguy-ga was TOTALLY wrong (what
does this guy doing around here?).  Unfortunately, the answer accept
by alwayscurious-ga was approximative, but also wrong on a
mathematical point of view.

Regards
Subject: Re: Statistics: What score would I get by guessing alone?
From: docanddoc-ga on 01 Feb 2006 11:25 PST
 
I ran a Monte Carlo simulation with 100000 samples and 50 questions and got:
Number   Percentage
Correct
 0       0.3705
 1       0.3652
 2       0.1832
 3       0.0624
 4       0.0150
 5       0.0030
 6       0.0006
 7       0.0001

which compares well with rracecarr.

The mean number correct was 1.00 with a standard deviation of 1.00 
This is reliable to at least 2 significant digits.

Now, the reason that this is so is because for n=50 questions, Each
question has a 1/n chance of being correct, and after n guesses it is
on average 1 correct answer.

I also considered what would happen if a student was guessing on only
40 or 30 questions. This was run with only 10000 samples(its a little
noisier).

Number of     Number    Standard
Questions     Correct   Deviation
Guessed
    1.0000    1.0000         0
    2.0000    1.0008    1.0000
    3.0000    0.9839    0.9894
    4.0000    1.0098    1.0103
    5.0000    0.9691    0.9807
    6.0000    0.9965    1.0086
    7.0000    0.9878    0.9917
    8.0000    1.0171    1.0008
    9.0000    0.9855    0.9757
   10.0000    1.0083    1.0049
   11.0000    1.0150    1.0085
   12.0000    1.0060    0.9908
   13.0000    1.0030    1.0043
   14.0000    0.9965    0.9970
   15.0000    1.0147    1.0138
   16.0000    1.0049    1.0007
   17.0000    1.0064    1.0051
   18.0000    1.0018    0.9963
   19.0000    0.9963    1.0035
   20.0000    1.0026    1.0032
   21.0000    0.9959    0.9914
   22.0000    0.9843    0.9955
   23.0000    1.0136    1.0264
   24.0000    0.9944    0.9946
   25.0000    1.0052    0.9970
   26.0000    0.9933    0.9947
   27.0000    1.0122    1.0172
   28.0000    0.9969    1.0066
   29.0000    1.0043    1.0065
   30.0000    0.9916    0.9963
   31.0000    0.9865    0.9989
   32.0000    1.0122    1.0007
   33.0000    0.9949    1.0021
   34.0000    0.9952    0.9950
   35.0000    0.9934    0.9899
   36.0000    0.9994    0.9991
   37.0000    1.0047    1.0055
   38.0000    1.0113    1.0151
   39.0000    0.9852    0.9877
   40.0000    1.0130    1.0039
   41.0000    0.9892    0.9919
   42.0000    0.9916    0.9888
   43.0000    1.0067    1.0058
   44.0000    1.0091    1.0120
   45.0000    0.9859    0.9898
   46.0000    0.9974    0.9937
   47.0000    1.0196    1.0076
   48.0000    1.0133    1.0090
   49.0000    0.9913    0.9935
   50.0000    0.9888    1.0031

The uninteresting thing is that the number right is 1.
The interesting thing is that the standard deviation is also 1. ?!?!?!?
I checked it by hand up to n=3.
Subject: Re: Statistics: What score would I get by guessing alone?
From: rracecarr-ga on 01 Feb 2006 11:44 PST
 
alwayscurious--

Glad you liked the comment.  As mvguy says, I'm not a researcher.
Subject: Re: Statistics: What score would I get by guessing alone?
From: boquinha-ga on 01 Feb 2006 11:55 PST
 
Not sure if this is helpful in your teaching or not, but for what it's
worth, I *highly* recommend a rather addictive game called "10 Days in
Africa." If you are inclined at all to help reinforce what you teach
using games, this one is a winner. In just a few rounds, you already
start learning/memorizing African geography quite well. And it's a lot
of fun, too!

The game is made by a great game company called "Out of the Box
Publishing" (the creators of the wildly-popular "Apples to Apples"
game). They also have "10 Days in the USA," "10 Days in Europe," and
from what I understand, they're in the works with "10 Days in Asia."

http://www.otb-games.com/africa/index.html

Sincerely,
Boquinha-ga
Subject: Re: Statistics: What score would I get by guessing alone?
From: hfshaw-ga on 02 Feb 2006 14:20 PST
 
As ansel001 pointed out, in the limit of an infinite number of
questions, the distribution in question converges to the Poisson
distribution with a mean of 1:  The probability of obtaining C correct
matches is given by P(C) = exp(-1)/C!

For a finite number of questions, N, the formula for the distribution
is given exactly by:

P(C) = 1/(C!)* SUM{from j = 0 to (N-C)} of [(-1)^j/j!]

See the page at http://www.ds.unifi.it/VL/VL_EN/urn/urn6.html for
hints on how to get this result, and also for a nice online java
applet that lets you calculate and display the distribution for
various choices of N.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy