Let's say you have a jar with X balls in it, say 10,000. The balls are
either red, green, or blue. You reach into the jar, select a ball,
record the color, and return the ball to the jar. How many times do
you need to repeat this (samples) in order to get certain accuracy Y?
I'm looking for the following information:
What is the statistical way to measure accuracy as related to this
problem? I think its something related to the confidence interval, but
I can't remember what that means exactly.
What is standard error as it applies to this problem? (again I
remember the term but not what it means)
Are there any other statistical accuracy terms that apply to this problem?
And the heart of the question is, please make a formula where the
inputs are the number of balls in the population (which will always be
in one of three states), the statiscal terms for the accuracy desired
(as you will explain from above) and outputs the number of samples
needed.
Thanks! Feel free to email any questions to indigoae@gmail.com |
Request for Question Clarification by
elmarto-ga
on
14 Apr 2005 04:11 PDT
Hello azureae,
From your question, it's not clear what you mean by "accuracy". Are
you trying to measure, using a sample, how many balls are red, green
or blue in the population?
Regards,
elmarto
|
Clarification of Question by
azureae-ga
on
14 Apr 2005 14:45 PDT
Yes, I am trying to measure to estimate via a sample how many red,
green, and blue ball there are in the population. There will be all
three, although one of the three, always the blue, will be in a lower
proportion. The red and green will vary in number also, I'm just
saying that blue will always be much less (perhaps 1/6th of the others
for example) then red and green (but still detectable via sampling).
Regarding romanak's comment, the formula provided doesn't have the
population size as an input, an the population size will vary greatly.
I can't imagine the population size doesn't matter, because what if
we're sampling say a jar of a trillion balls, is 2796 still
sufficient?
Also, regarding using the estimate of proportion, that is what I'm
trying to figure out, so if I knew that then I wouldn't need to sample
at all!
|
Request for Question Clarification by
elmarto-ga
on
14 Apr 2005 16:19 PDT
Hi azurae!
OK, I see now what you're interested in. However, the logic of this
problem can change substantially for different population sizes.
Specifically, it's very different if we have a "small" or "large"
population. Let's say that initially 50% of the balls are red. If the
population is is sufficiently large, when we take a first red ball,
the probability of taking another red ball in a subsequent draw is
still 50%.
For example, if there are 100,000 balls (so 50,000 red ones), and the
first I draw is red, then the probability of drawing another red ball
is 49,999/99,999, which is almost the same as 50%. If this is the
case, then the required sample size can be determined quite easily.
However, in an extreme example where the population is 4 balls (so 2
are red), the proportion changes after you take the first red ball. If
you take a red ball on the first draw, then the probability of drawing
another one becomes 1/3, which is quite different to 0.5. This
complicates the problem.
So, do you have a "large" enough population?
Regards,
elmarto
|
Request for Question Clarification by
elmarto-ga
on
14 Apr 2005 16:24 PDT
I forgot to mention this in my question. You ask "if we're sampling
say a jar of a trillion balls, is 2796 still sufficient?". The answer
would be yes (if the 2796 figure is correct in the first place). Once
you have a "large" population, you will need the same quantity of
balls in order to determine the proportions, no matter if the
population size is 1 million or 1 trillion.
|
Clarification of Question by
azureae-ga
on
14 Apr 2005 16:37 PDT
Yes, the populations will be very large. There will be several
populations, the largest of which will be about 10 trillion, the
smallest of which will be about 200k.
Also please note that I mentioned that once the ball is observed, it
is returned to the jar. So in the case of the 4 ball problem (while
not relevant) please notice that the same ball can be drawn multiple
times.
|