Google Answers: General Statistics & Probabilities

View Question

Q: General Statistics & Probabilities ( Answered 5 out of 5 stars

, 0 Comments )

Question

Subject: General Statistics & Probabilities
Category: Reference, Education and News > Homework Help
Asked by: sunshine2-ga
List Price: $40.00

Posted: 19 Jun 2003 09:05 PDT
Expires: 19 Jul 2003 09:05 PDT
Question ID: 219231

Review:  Looking for simple step-by-step explanations and solutions
(preferably by Sunday afternoon)

--1. Suppose the manager of a large farm wanted to study the impact of
the fruit flies on the orange crops on a daily basis over a 6-week
period. On each day a random sample of orange trees were selected from
within a random sample of acres. The daily average number of damaged
oranges per tree and the proportion of trees having damaged oranges
were calculated. The two main measures calculated each day (i.e.,
average number of damaged oranges per tree and proportion of trees
having damaged oranges) are called?

--2. This table contains the opinions of a sample of 200 people broken
down by gender about the latest congressional plan to eliminate
anti-trust exemptions for professional baseball.
           For  Neutral Against Totals 
Female     38    54         12     104 
Male       12    36         48     96 
Totals     50    90         60     200 
(a), construct a table of total percentages. 
(b) of those FOR the plan, ________ percent were females. 
(c) of the males in the sample, ________ percent were FOR the plan. 

--3. Health care issues are receiving much attention in both academic
and political arenas. A sociologist recently conducted a survey of
citizens over 60 years of age whose net worth is too high to qualify
for Medicaid by who have no private health insurance. The ages of 25
uninsured senior citizens were as follows:

60 61 62 63 64 65 66 68 68 69 70 73 73 
74 75 76 76 81 81 82 86 87 89 90 92 
*Identify the third quartile of the ages of the uninsured senior
citizens.

--4. The stem-and-leaf display represents the number of vitamin
supplements sold by a health food store in a sample of 16 days.
Stem    Leaves 
1H      99 
2L      0023 
2H      567 
3L      034 
3H      568 
4L       1 

For this sample, the sum of the observations is 448, while the sum of
the squares of the observations is 13,356.
Note: 
1H means “high tens”: 15, 16, 17, 18, or 19; 
2L means “low twenties”: 20, 21, 22, 23, or 24; 
2H means “high twenties”: 25, 26, 27, 28, or 29; etc. 

*What is the standard deviation of the number of vitamin supplements
sold in this sample?

--5. The probability that a new advertising campaign will increase
sales is assessed as being 0.80. The probability that the cost of
developing the new ad campaign can be kept within the original budget
allocation is 0.40. Assuming that the two events are independent, the
probability that the cost is kept within budget and the campaign sales
increases is?

--6. Suppose A and B are mutually exclusive events where P(A) = 0.4
and P(B) = 0.5. Then P(A and B) = ?

--7. A survey is taken among customers of a fast-food restaurant to
determine preference for hamburger or chicken. Of 200 respondents
selected, 75 were children and 125 were adults. 120 preferred
hamburger and 80 preferred chicken. 55 of the children preferred
hamburger.  What is the probability that a randomly selected
individual is a child or prefers hamburger?

--8. Suppose there are three events A, B, and C with probabilities: 
P(A) = 0.50, P(B) = 0.40, P(C) = 0.30. 
Also, A and B are independent; A and C are mutually exclusive; P(B and
C) = .05.

(a) P(C') = 
(b) P(A and C) =

--9. For a potential investment of $5000, a portfolio has an EMV of
$1000 and a standard deviation of $100. What is the coefficient of
variation?

--10. If we know that the length of time it takes a college student to
find a parking spot in the library parking lot follows a normal
distribution with a mean of 3.5 minutes and a standard deviation of 1
minute, find the point in the distribution in which 75.8% of the
college students exceed when trying to find a parking spot in the
library parking lot.

--11. A food processor packages orange juice in small jars. The
weights of the filled jars are approximately normally distributed with
a mean of 10.5 ounces and a standard deviation of 0.3 ounce. Find the
proportion of all jars packaged by this process that have weights that
fall below 10.875 ounces.

--12. Suppose Z has a standard normal distribution with a mean of 0
and standard deviation of 1. The probability that Z is between -2.89
and -1.03 is?

--13. The amount of time necessary for assembly line workers to
complete a product is a normal random variable with a mean of 15
minutes and a standard deviation of 2 minutes. So, 70% of the products
would be assembled within how many minutes?

Answer

Subject: Re: General Statistics & Probabilities
Answered By: elmarto-ga on 19 Jun 2003 11:06 PDT
Rated: 5 out of 5 stars

Hello sunshine2, Glad to see you again! Here are the answers to your questions. 1. The average number of damaged oranges per tree is called, by definition, the "sample mean". It's calculated by taking the number of observed damaged oranges and dividing it by the number of trees in the sample. Sample Mean http://www.animatedsoftware.com/statglos/sgxbar.htm The proportion of trees having damaged oranges, is called "sample proportion". It's calculated in the same way as the sample mean, as you take the number of observed trees that contain damaged oranges, and divide thsi number by the number of trees in the sample. 2. In the following table, I take the total sample size. Thus, the percentage you see in the square (Female, For), would mean "what percentage of all people in the mean is BOTH a female and for the anti-trust plan?". For Neutral Against Totals Female 19% 27% 6% 52% Male 6% 18% 24% 48% Totals 25% 45% 30% 100% These percentages are calculated in the following way. For example, the percentage in (Female, For) is calculated by taking the number of females that are for the plan (38), dividing it by 200 (total number of people in the sample) and multiplying it by 100. The same goes for the other elements in the table. 3. The third quartile is "the value of a set of data at the third quarter, 75%, when that data has been arranged in ascending order. It is the median value of the upper half of all the values in the data set" Glossary O-U http://www.tta.gov.uk/training/skillstests/numeracy/glossary/o-u.htm In your case, we first have: 60 61 62 63 64 65 66 68 68 69 70 73 73 74 75 76 76 81 81 82 86 87 89 90 92 The upper half of your data is: 73 74 75 76 76 81 81 82 86 87 89 90 92 And the median of this subset is 81. Thus, the third quartile is 81. For more information on what the median is, check Glossary H-N http://www.tta.gov.uk/training/skillstests/numeracy/glossary/h-n.htm 5. First let's see what "independent" means. Events A and B are said to be independent if Prob(A and B) = P(A)P(B). Independent Random Variables http://www.uwm.edu/~stockbri/14indep.pdf Therefore, A = The campaign will increase sales B = The campaign can be kept within the original budget We know that P(A)=0.8 and P(B)=0.4 and that A and B are independent. Thus, P(A and B) = 0.80.4 = 0.24 = 24% There is a 24% probability that both events will happen. 6. First, let's see what the definition of mutually exclusive events is. "Two events are mutually exclusive if it is not possible for both of them to occur..." Mutually exclusive events http://davidmlane.com/hyperstat/A132677.html Thus, events A and B cannot happen at the same time; so, P(A and B)=0 7. In order to solve this one, we must first build a table like the on in question 2. From the problem, we know the following data: Chicken Hamburguer Totals Children 55 75 Adult 125 Totals 80 120 200 Now, it's easy to fill the blank spaces, since we know that the sum of lines and columns whould add up to the values in "Totals". For example, we know that there are 75 children, 55 of which prefer burguers; therefore 75-55=20 children prefer chicken. Now, we know that a total of 80 people prefer chicken, 20 of which are children, so 80-20=60 adults prefer chicken. Lastly, since there are 125 adults, and 60 prefer chicken, we know that 125-60=65 adults prefer hamburguers. Thus we get the following table Chicken Hamburguer Totals Children 20 55 75 Adult 60 65 125 Totals 80 120 200 From this table, we make one with percentages, just like in question 3. We get Chicken Hamburguer Totals Children 10% 27.5% 37.5% Adult 30% 32.5% 62.5% Totals 40% 60% 100% Finally, let's get to the question. What is the probability that a randomly selected individual is a child or prefers hamburger? Let's see which events fulfill the condition (Children or Hamburguer). Children that like chicken fulfill it, so do children that like hamburguers, and so do adults that like hamburguers. Adults that like chickens do not fulfill the condition. So now we just have to add the probability of the events that fulfill the conditions in order to get the solution. So, the probability of randomly sleecting someone who is a children or likes hamburguers is 10% + 27.5% + 32.5% = 70% 8. First I'll solve P(C'). C' is the notation for "the complement of C", which is equivalent to saying "not C". Thus P(C')=P(not C). Check the following definition: "Definition: The complement of an event A is the set of all outcomes in the sample space that are not included in the outcomes of event A. The complement of event A is represented by A'. Rule: Given the probability of an event, the probability of its complement can be found by subtracting the given probability from 1. P(A') = 1 - P(A)" Therefore, P(C') = 1-P(C) = 1-0.3 = 0.7 Regarding P(A and C), this is just the same as in question 6. We know from the question that A and C are mutually exclusive, so P(A and C) = 0 9. We know that EMV is the mean of the value of the portfolio, and the formula for the coefficient of variation is given in the following page Coefficient of Variation http://engineering.uow.edu.au/Courses/Stats/File1586.html Therefore, the coefficient of variation is: 100/1000=0.1 10. Let X be the number of minutes it takes a student to find a parking spot. We know that X hasa normal distribution with mean 3.5 and standard deviation 1. The question here is find the number a that solves P(X > a) = 0.758 In order to solve this we must use a table for the normal distribution. The tables you will find in books and on the web are for standard normal distribution (that is, with mean 0 and standard deviation 1). So we have to convert our random variable X into a standard normal distributed variable. This is easily done: we have to substract the mean and divide by the standard deviation. Thus, we have that P(X > a) =P(X - 3.5 > a-3.5) But X - 3.5 has a standard normal distribution, so I'll rename it as Z. We are looking for: P(Z > a-3.5) = 0.758 In the following page, you will find an applet that calculates P(Z < a) Z table http://davidmlane.com/hyperstat/z_table.html Thus, we need to rewrite your problem P(Z > a-3.5)= 1-P(Z < a-3.5) = 0.758 So, P(Z < a-3.5) = 1-0.758= 0.242 So, now we write 0.242 in the applet box called "Area below Z", and we find that the z-value that solves this is -0.6996. Thus, a-3.5=-0.6996 a=3.5-0.6996=2.8004 Therefore, 75.8% of the students take more than 2.8004 minutes to find a parking spot. 11. This problem is similar to the previous one. Here we know that X is distributed normally with mean 10.5 and standard deviation 0.3. We have to find P(X<10.875). Again, we have to convert X to standard normal, so we have that: P(X < 10.875) =P(X-10.5 < 10.875-10.5) =P( (X-10.5)/0.3 < (10.875-10.5)/0.3) =P( Z < 1.25) We now have to plug 1.25 in the applet box called Z to find that 89.44% of the observations fall below this value; and thus 89.44% of the jars weigh less than 10.875 ounces. 12. We will use the same applet here. We have to find out: P(-2.89 < Z < -1.03) But this is equal to say P( Z < -1.03) - P( Z < -2.89) So now we just have to plug these numbers in the applet box called Z, finding that: P(Z<-1.03)=0.1515 P(Z<-2.89)=0.0019 So, 0.1515-0.0019=0.1496. Thus, the answer is 14.96% I hope these answers are clear enough. If you have any doubts, please don't hesitate to request a clarification, I'll be glad to explain further. Best regards, elmarto
Clarification of Answer by elmarto-ga on 19 Jun 2003 11:52 PDT I'm sorry, I neglected questions 4, 13, part of question 2 and the search terms I used. Here they are. -------------------------------------------- 2. I'll repost the whole answer to question 2 here and then add the missing answers: In the following table, I take the total sample size. Thus, the percentage you see in the square (Female, For), would mean "what percentage of all people in the mean is BOTH a female and for the anti-trust plan?". For Neutral Against Totals Female 19% 27% 6% 52% Male 6% 18% 24% 48% Totals 25% 45% 30% 100% These percentages are calculated in the following way. For example, the percentage in (Female, For) is calculated by taking the number of females that are for the plan (38), dividing it by 200 (total number of people in the sample) and multiplying it by 100. The same goes for the other elements in the table. (b) of those FOR the plan, ________ percent were females. From the table in the question, we have that there are 50 people for, 38 of which are women. Thus the percentage of those FOR the plan who were females would be (38/50)100 = 76% (c) of the males in the sample, ________ percent were FOR the plan We know from the table that there are 96 males, 12 of which are FOR the plan. Thus the percentages of males who are FOR the plan is (12/96)100 = 12.5% 4. In order to find the standard deviation (SD), we must first find the variance. Then we will take the square root of the variance and that will be the SD. The formula for the sample variance is: N ___ (1/N-1)* \ (xi-x)^2 / --- i=1 where N is the sample size (16), xi is each observation and x is the sample mean. The sample mean in this case would be 448/16= 16. This formula was taken from Variance http://mathworld.wolfram.com/Variance.html Let's "play" with the term: (xi-16)^2 We can expand it to get: (xi-16)^2 =xi^2 - 2xi16 + 16^2 =xi^2 - 32xi + 256 So, N ___ \ (xi-x)^2 / --- i=1 is equal to N ___ \ xi^2-32xi+256 / --- i=1 which is equal to N ___ ___ ___ \ xi^2 - \ 32xi + \ 256 / / / --- --- --- i=1 (All sums are between i=1 and N) We know what the first term is: it's the sum of the squares of the observations, which we know is 13,356. The third term is simply 256 summed N times, so it's 25616=4096. The second term can be rewritten as N ___ 32\ xi / --- i=1 But we know that the sum of xi is 448, so the second term would be 32448=14,336 Therefore, we have that N ___ ___ ___ \ xi^2 - \ 32xi + \ 256 / / / --- --- --- i=1 equals 13,356 - 14,336 + 4096 = 3,116 So, the variance is (1/N-1)3,116 = 3,116/15 = 207.73... Finally, the standard deviation is the square root of the variance, which would be 14.412... 13. In this question, we are looking for "a" such that P( 15-a < X < 15 +a ) = 0.70 Again, we have to convert X to a standard normal in order to use tables. Thus P( 15-a < X < 15 +a ) =P(-a < X-15 < a) =P(-a/2 < (X-15)/2 < a/2) =P(-a/2 < Z < a/2) Now, P(-a/2 < Z < a/2) = 0.70 =P(Z<a/2)-P(Z<-a/2) () But, because the standard normal distribution is symmetric around 0, we have that P(Z<a/2)=P(Z>-a/2) And we also know that P(Z<-a/2)=1-P(Z>-a/2) So, by replacing this in (), we get that P(Z<a/2)-P(Z<-a/2) =P(Z<a/2)- (1-P(Z>-a/2)) =P(Z<a/2)- (1-P(Z<a/2)) =2P(Z<a/2) - 1 = 0.70 Now, 2P(Z<a/2) = 1.70 P(Z<a/2) = 0.85 So now it's easy to calculate a, we've done it in the previous questions. We just plug 0.85 in the applet box "Area below Z", finding that Z=1.0364. Therefore, a/2=1.0364, so a=2.0728. The answer is then that 70% of the products will be assembled in between 15-2.0728 and 15+2.0728 minutes (between 12.92 and 17.07 minutes). And that's all the answers. The Google search terms I used were: sample mean proportion normal table quartile "coefficient of variation" complement "mutually exclusive" Again, if you need any further assistance, don't hesitate to request clarification. Otherwise, I await your rating and final comments. Best wishes! elmarto
Request for Answer Clarification by sunshine2-ga on 19 Jun 2003 12:55 PDT #5 Hi. Thanks for the fast reply. For #5 is it also possible for the answer to be 0.20?
Clarification of Answer by elmarto-ga on 19 Jun 2003 13:25 PDT Hi sunshine2, Unless the question statement is wrong, it's not possible for the answer to be 0.20. However, I did make a calculation mistake and gave the wrong answer to this question. I'm very sorry. The correct answer is not 24%. It is 32%, or 0.32. By definition, to say "A and B are independent" implies P(A and B) equals P(A)P(B). Therefore, since P(A)=0.8 and P(B)=0.4 and they are independent, then P(A and B) MUST be equal to 0.80.4=0.32. Somewhere along the line I said to myself "8 times 4 is 24", which, as most people know, is wrong :-)
Request for Answer Clarification by sunshine2-ga on 19 Jun 2003 14:08 PDT Thanks again elmarto. I am working through the rest of the problems. After I am finished I will post your rating and tip :-) Sunshine
Clarification of Answer by elmarto-ga on 19 Jun 2003 18:05 PDT Thanks a lot!! :-) Just ask if there's anything you don't understand.

sunshine2-ga rated this answer: 5 out of 5 stars

and gave an additional tip of: $2.00

Another great help session.  Thank you.

Comments

There are no comments at this time.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.

Search Google Answers for

Google Home - Answers FAQ - Terms of Service - Privacy Policy