Google Answers Logo
View Question
 
Q: Economic statistics ( Answered 5 out of 5 stars,   0 Comments )
Question  
Subject: Economic statistics
Category: Business and Money > Economics
Asked by: makbool-ga
List Price: $20.00
Posted: 18 Aug 2003 16:47 PDT
Expires: 17 Sep 2003 16:47 PDT
Question ID: 246199
HI
I have done economical statistics long time ago.  I would like to get
help to have the following two questions solved.  Can someone help me
please

1. For a particular population of students the average height is 69
inches and the standard deviation is 3.2 inches.

a. if many random samples of size n = 4 were collected, and in each
case the sample mean was calculated, how would these sample mean
fluctuate?

b. Say that with respect to one particular sample the average height
is 70 inches.  Is this a fairly typical sample, or was it a lucky one,
particularly close to the population mean?

c. suppose the sample size is increased four times to n = 16.  If many
random samples of this size n = 16 were collected, and (as in part a.)
in each case the samle mean was calculated, how would these sample
means fluctuate?

d. Answer true or false to the following.  If false correct it.
"A doubling of the sample size quadruples the accuracy of the sample
mean estimating the population mean.  The reason is the square root
divisor in the (formula for) standard error of the sample mean"

2.The following situation was analysed by an Economic Statistics
student:

"On average, an artist sells two art paints per day.  What is the
probability that the artist sells at least one art paint in a given
day?"

The student's analysis correctly deduced the statements and analytical
steps stated below:


N.B. With respect to each statement, you need to provide the
associated justification.  That is, you need to explain why the
statement or step is correct/valid:

a. The given justification describes a case of a Poisson random
experiment.
   Justification:

b. Given that the random variable, X is defined as "number of art
paints sold in an hour interval", the relevant sample space (possible
values of X) is: X = x where x = 0, 1, 2, ...
   Justification:

c.  And their associated probabilities are
   P(X = x) = 

Justification:

D. P(X>=1) = P(X=1)+P(X=2)+P(X=3)+... (a)
= 1 - P(X<1)                          (b)
= 1 - P(X=0)                          (c)
= 1 - 2~0 e~-2 / 0!                   (d) 
= 1 - e~-2                            (e)
= 0.8647

Justification: 
a) :
b) :
c) :
d) :

I thank you for all your help for providing step by step guidelines to
solve the above questions.

makbool

Request for Question Clarification by livioflores-ga on 19 Aug 2003 09:13 PDT
Are you asking about Sampling distribution mean in the first question?
In the affirmative case the answer requested is the mean of the
samples and the standard deviation of the mean of the samples?
The question b is a tricky one, because the height is a continue
variable, and the probability to obtain an exact value is zero, again
this is the kind of answer requested?
One more thing, may be you consider to split the question in two (one
question for each problem), I am confidence to answer the second one
and not really sure about the first problem, but I cannot post a
partial answer and claim the prize, but if you split the question you
will have at least one problem solved.

Thank you.
livioflores-ga
Answer  
Subject: Re: Economic statistics
Answered By: elmarto-ga on 19 Aug 2003 12:56 PDT
Rated:5 out of 5 stars
 
Hi makbool!
Here are the answers to your questions.

1. In order to answer these set of questions, you must first know some
properties of the sample mean. You may want to check them in the
following page.

Sample Mean
http://www.stat.yale.edu/Courses/1997-98/101/sampmn.htm

We're particularly interested in the mean and standard deviation (SD)
of the sample mean. Given a sample of size n, we have n independent
observations, which can be considered random variables. Since all
these observations come from the same population, they are independent
and identically distributed (iid) random variables. Now, the sample
mean is defined as:

M = (X1 + X2 + X3 + ... + Xn) / n

where X1 is the first observation, X2 the second one, etc. What is the
expected value of this statistic?

E(M) = E[(X1 + X2 + X3 + ... + Xn) / n]
     = (1/n) * E[(X1 + X2 + X3 + ... + Xn)
     = (1/n) * [E(X1) + E(X2) + ... + E(Xn)]
     = (1/n) * [ 69   +   69  + ... +  69]
     = (1/n) * [ n*69 ]
     = 69

I have used standard properties of the expected value (E) operator
here. You can find them at:

Expectation (check especially "properties of expectation")
http://bme.bu.edu/senlab/lecture11.ppt

Therefore the first result we have here is that the expected value of
the sample mean is equal to the population mean.

Let's see now what happens with variance of the sample mean (and thus
with SD). We know that the population variance is 3.2^2 = 10.24.

Var(M) = Var[(X1 + X2 + X3 + ... + Xn) / n]
       = (1/n^2)*Var(X1 + X2 + X3 + ... + Xn)
       = (1/n^2)*[Var(X1)+Var(X2)+...+Var(Xn)]
       = (1/n^2)*[ 10.24 + 10.24 +...+10.24 ]
       = (1/n^2)* (n*10.24)
       = (1/n)*10.24

The third line has made use of the fact that the observations are
independent; so the variance of the sum is the sum of variances. Check
some properties of the variance at:

Variance - Properties
http://www.xycoon.com/properties.htm

Variance of the sum of 2 independent variables
http://people.ccmr.cornell.edu/~muchomas/8.04/Lecs/lec_statistics/node14.html#SECTION00053000000000000000

Thus, the variance of the sample mean is the variance of the
population divided by the sample size. This is consistent with the
idea that, as we take a bigger sample size, the sample mean becomes
more accurate. Since the SD is the square root of the variance, we
also have that the SD of the sample mean must be then the SD of the
population divided by the square root of n.

Now let's go to your questions:

a. The mean of the sample means would be 69 and the SD would be 3.2/2
= 1.6. So the sample mean would fluctuate around 69 inches.

b. It is not possible to answer this question because we don't know
the distribution of the population; and therefore we don't know the
distribution of the sample mean, so just knowing the mean and SD is
not enough information to tell wether 70 is a likely observation.
However, if we assume that the population follows a normal (Gaussian)
distribution, then the sample mean does too. For a normal
distribution, 95% of the observations are concentrated within 2 SD's
from the mean. So 95% of the observations fall between 69-3.2 and
69+3.2. Since 70 falls in this interval, we could conclude that 70 is
not unlikely at all. However, we must know (or make a reasonable
assumption about) the population distribution in order to answer this
question for small samples.

c. With n=16, the mean is still 69, but the SD falls to 3.2/4 = 0.8.
Therefore, now the dispersion is smaller, so the sample mean becomes
more accurate.

d. The statement is false. The "double" and "quadruple" are reversed:
by quadrupling the sample size, we find that the SD (a measurement of
"accuracy") falls by a 50%, so the estimate becomes twice as
"accurate". This is because, as the statement says, the square root in
the divisor of the SD of the sample mean, as in the formula I've shown
above.

Question 2

a. It is reasonable to assume that the sales of paint arts follow a
Poisson process. The Poisson process describes the distribution of an
integer number of "arrivals" over a continuous time. If we assume that
the artist can receive a request for a paint any time of the day, this
fits a Poisson process. The assumptions of a Poisson process (given in
the following link) imply that:

1) It may only take integer, non-negative values: this is consistent
with the trivial fact that the artist sells an integer quantity of art
paints, which must be clearly greater than or equal to zero.

2) Each "arrival" is independent: again this could be consistent with
this case: a person calling the artist to request an art paint doesn't
change the probability that another person will call. This could be
violated in this case, as some "arrivals" might be people who will
then tell everybody that this is the best artist in the world, so that
the artist will afterwards receive hundreds of requests for paintings.
In this case, an arrival changed the probability that future arrivals
occur, thus violating the assumptions of a Poisson process. For the
sake of the argument, let's assume that this is not the case and that
every "arrival" is independent.

Poisson Process
http://www.sics.se/~andersa/report/node15.html

b. This was explained in point (1) in question a. The artist can't
sell a negative amount of paintings. Neither can he sell "half" a
painting, or a quarter of a painting. Therefore the relevant sample
space are the natural numbers. On the other hand, there's no upper
limit as to how many paintings he can sell in a day: he could receive
a request for a painting every second, every milisecond, etc. Of
course, these events are highly improbable, but are part of the sample
space.

c. 
a) Since he can only sell an integer number of paintings, then the
probability of selling more than 1 painting is equal to the
probability of selling 1, plus the probability of selling 2, etc. This
is consistent with the result in probability theory that says that

Prob ( A U B) = Prob(A) + Prob(B) - Prob(A I B)
where U is union, I is intersection, and A and B are two events. So,

Prob(X>=1) = Prob(X=1 U X=2 U X=3 U...) = P(X=1)+P(X=2)+...

Notice that the "intersection terms" are gone, since the probability
of selling two different number of paintings in a day is zero (you
either sell a quantity or another one).

b. Another result in probability theory is that:

P(Ac) = 1 - P(A)

where Ac is the complement of A. The complement of X>=1 is X<1,
therefore,

P(X>=1) = 1 - P(X<1)

c. Since X can only take on integer values, then X < 1 is the same
event as X=0, since X can't be 1/2, 3/4, etc.

d. 1 - 2^0*e^(-2)/0!

This formula comes directly from the formula of the Poisson
distribution (check the formula in the page given above).  In your
case, using the notation of this page, lambda*t is equal to 2, since
the artist sells 2 paintings per day (this is what's called
"intensity" in this page).

e. Since 2^0 is equal to 1, and 0! is also equal to 1, the formula
clearly becomes 1 - e^(-2)


Google search terms used:
"sample mean" properties
variance properties
poisson process


I hope this helps! If you have any doubt regarding my answer, please
request a clarification before rating it. Otherwise, I await your
clarification and final rating.

Best wishes!
elmarto
makbool-ga rated this answer:5 out of 5 stars
thanks a lot for your step by step explanation
makbool

Comments  
There are no comments at this time.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.
Search Google Answers for
Google Answers  


Google Home - Answers FAQ - Terms of Service - Privacy Policy