Google Answers: machine selection

View Question

Q: machine selection ( No Answer, 3 Comments )

Question

Subject: machine selection
Category: Science > Math
Asked by: dogwood-ga
List Price: $20.00

Posted: 03 Sep 2004 04:37 PDT
Expires: 03 Oct 2004 04:37 PDT
Question ID: 396333

Among N machines, there are up to k machines in bad / unacceptable condition. After a primitive testing over the N machines, given the probability of the N machines in bad condition is (p_1, p_2, ..., p_N). Question: If we randomly select m machines from N, the probability at least one machine is in bad condition?
Clarification of Question by dogwood-ga on 05 Sep 2004 04:31 PDT Hi, Rececar, Thanks for your another try. My intension is to prove that with the primitive test, the bad machine will be selected in lower probability than without knowing the individual probability (p_1,..p_N). I hope to know the advantage with the individual probability over without that parameter. I knew, without the individual probability, the probability of that one machine is bad among the m is Min(k,m) (j,k)*((m-j),(N-k)) SUM ________________________ j=1 (m,N) With only the individual probability but no constraint in k, the probability m is 1 - PRODUCT(1-p_j) j = 1 I'm trying to find a probability formula of combining both individual probability p_i and constraint (k,N) in order to prove my intension in the above. Honestly, I didnot understand why your question is different from mine. Thanks again for your help. Regards,
Request for Question Clarification by mathtalk-ga on 05 Sep 2004 08:15 PDT Hi, dogwood-ga: In stating the probability for "no constraint in k" as: m 1 - PRODUCT (1 - p_j) j = 1 it appears that you are perhaps using knowledge of the "primitive test" results to choose the m machines with the least probability p_j. If the m machines are selected "at random" without using such knowledge, your chance of getting at least one bad machine would instead be an average (mean) of expressions like the above, but taking into account all possible choices of m machines out of N. So a clarification on your part of how the "primitive test" results are to be used is needed, even to resolve the simpler question with "no constraint in k". regards, mathtalk-ga
Clarification of Question by dogwood-ga on 07 Sep 2004 11:18 PDT Hi, mathtalk-ga, Thanks for your question. As you said, my thought is to pick up the m machines with the least individual probability because it makes the chance of selecting one bad machine lower than other options. Because of the varied m,k,N, I may not be able to prove my intent of that there exists the advantage with the individual probability over without that parameter. I'm thinking, though the primitive testing doesnot consider the constraint k, we may be able to can adjust the individual probability with the consideration of constraint k after then. For example, treating the primitive testing has the error, the constraint (N & k) can be used to correct the error. By doing that, maybe we can associate the (p_1, ..., p_N) with the constraint k. Am I in the right track? Regards,
Clarification of Question by dogwood-ga on 15 Sep 2004 12:39 PDT Hi, Any further thoughts? Thanks, Regards,
Request for Question Clarification by mathtalk-ga on 15 Sep 2004 18:37 PDT Hi, dogwood-ga: I think my initial focus was on trying to pin down what "the" problem is here, but my conclusion, which I hope you share, is that there is enough room for interpretation to make several problems... which of course is what makes it interesting. I could post an Answer that analyzes one quite specific interpretation of the data in a way that I think is consistent with your understanding. If you wish, I'll be glad to proceed. regards, mathtalk-ga
Clarification of Question by dogwood-ga on 16 Sep 2004 07:16 PDT Hi, Mathtalk-ga, As you know that I'm looking for a probability solution which takes into account both the individual probability and the constraint (k,N). I'm glad that you understand the point. Would you please share your specific interpretation of the data and your question behind the interpretation. Thanks, Regards,
Request for Question Clarification by mathtalk-ga on 16 Sep 2004 19:52 PDT Hi, dogwood-ga: Well, here's the story I built around your problem. It's got unnecessary details, of course, but it puts each piece of the data in a context. Your company has N PC's (machines) which are known to be in working order at the end of the business day, dedicated to the computation of digits of Euler's constant. Overnight k persons come into the facility and log onto the Internet with their respective random choices of k machines in order to play a Doom 3 tournament. As they are eliminated they leave and the machine is disconnected from the Internet. In the morning the breach of security is detected, and their exists a risk of viral contamination for each unauthorized use of a machine. The machines risk of infection, conditioned on having been used, can be estimated accurately (the values p_1,..,p_N), but it is not known which machines were used. Therefore at most k machines are now in an unusable state, depending on whether the corresponding risk of viral contamination was realized. We can now compute the likelihood that at least one of m machines chosen at random is infected, as well as the likelihood that at least one of the m machines with the smallest probability values p_j is infected. Your thesis is that you are better off using the information about the p_j's than choosing machines at random. Of course the improved risk mitigation from using this information depends on the p_j's, and specifically the improvement is nil if all the p_j's are equal. regards, mathtalk-ga
Clarification of Question by dogwood-ga on 17 Sep 2004 04:08 PDT Hi, mathtalk-ga, Your interpretation and question are very close to mine. Please go ahead to proceed it. Thanks, Regards,
Clarification of Question by dogwood-ga on 22 Sep 2004 07:00 PDT Hi, mathtalk, I'm looking forward to your answer. Thanks, Regards,
Clarification of Question by dogwood-ga on 27 Sep 2004 14:44 PDT Hello, I wonder if the question is unsolvable or the list price underestimates the time on it. If it is the second case, I'll willing to know the reasonable time to solve this question and may readjust the price with my budget. Thanks,
Request for Question Clarification by mathtalk-ga on 29 Sep 2004 10:39 PDT Hi, dogwood-ga: Let's spell out the results for the case K = 1. Similar but more complicated results can be given for larger K, but this gives a qualitative feel for the situation. Please confirm whether or not such results are useful to you. Assuming that the probabilities p_i are labelled in increasing order and independent of the chance of a machine being chosen for "unauthorized use", then the chance of picking that machine in your group of M is M/N. The chance of picking "at least one machine" in "bad condition" is then: P = (M/N) * (1/N) * SUM p_i FOR i = 1 to N = (M/N) * AVERAGE {p_1,..,p_N} if no use is made of the "susceptibility" probabilities p_i, versus: P' = (M/N) * (1/M) * SUM p_i FOR i = 1 to M = (M/N) * AVERAGE {p_1,..,p_M} if the probabilities p_i are used to select the M machines with least "susceptibility". The reduction of P' with respect to P is proportional to the average of the M least susceptibility values divided by the average of all of them. Since we know nothing about the probabilities {p_1,..,p_N}, the formulas which can be given for larger values of K cannot be simplified very far. However one can certainly show that P' is less than or equal to P, with equality only in the case that all the p_i's are equal. The case K = 1 is typical in this respect. regards, mathtalk-ga

Answer

There is no answer at this time.

Comments

Subject: Re: machine selection
From: mathtalk-ga on 03 Sep 2004 05:49 PDT

It's hard to know from the problem's wording how to combine the
information that "up to k" machines are in bad shape with the
estimated probabilities on individual machines.  Yet the way of
combining them seems central to this question.

Knowing that at most k machines are in bad shape, for k < N, implies
that the actual probabilities of specific machines being "out of
service" are not independent.  For example, any one machine presumably
has a positive probability of being in bad shape but the probability
that all are (k < N) is zero.  So the probability of the conjunction
event is _not_ the product of the individual probabilities (hence not
independent).

If one ignores the constraint that at most k machines are in bad
shape, one might proceed as follows.  The probability at least one of
the chosen machines is in bad shape is complementary to the
probability that all the chosen machines are in good shape.  For any
particular choice of m machines, the probability of all being in good
shape, assuming independence of those events, is the product of the
individual machines being in good shape, i.e.

PRODUCT (1 - p_i(j)) where j = 1,..,m

and i(j) denotes the j'th choice out of m in this particular
selection.  These products then have to be averaged over all possible
choices of m machines out of N.  Assuming that all machines are
equally likely to be chosen, without replacement, this gives a simple
equiprobability model for the chances of selection; the averaging is
then an "equally weighted" mean.

The constraint of at most k machines being in bad condition can be
worked into the solution, but more careful wording is needed to
understand how that relates to the values p_i "given".  For example,
we could have a model in which one independently selects m machines
"to use" and k machines to (potentially) be out of service, followed
by applying the probabilities p_i shown to determine (for any machines
chosen for both groups) if at least one is actually bad.  If this were
a homework problem, most likely that would be the interpretation the
student was expected to pursue.

Of course if one already had the "primitive testing" results, in a
practical sense that knowledge might bias the selection of machines to
use (toward those with the smallest chance of being in bad shape).

regards, mathtalk-ga

Subject: Re: machine selection
From: dogwood-ga on 03 Sep 2004 07:00 PDT

Good analysis. The central point is how to combine the k into the 
probabilities of the selected machines. My feeling is that both
k and N could affect the overall probability of that at least one 
machine in bad shape is among the selected m machines even if the 
individual probability is not based on the k and N. Of course, 
0<m<N and 0<k<N, but both m>k or m<k may be true. Thanks,

Regards,

Subject: Re: machine selection
From: racecar-ga on 03 Sep 2004 17:25 PDT

dogwood--

There is a problem.  You have provided two pieces of information:

1) The probablity each machine is in bad condition is (p_1, p_2, ..., p_N).
      (presumably none of the p's are equal to zero).

2) At most k machines are in bad condition.

These are inconsistent with each other.  Unless you allow that N-k of
the p's are equal to zero, or that the p for each machine depends on
the conditions of the others, this question is unanswerable.

Here is a question that doesn't contradict itself:

The probablities of badness are known to be (p_1, p_2, ..., p_N).  One
day, the machines are checked, and X of them are bad.  I won't tell
you what X is, but I will tell you that X is less than or equal to k. 
Now, if we select m machines at random, what is the probablity that at
least one is bad?

Is this what you mean?

Note: I haven't said anything here that mathtalk didn't already say,
but maybe it's a useful rephrasing.

Important Disclaimer: Answers and comments provided on Google Answers are general information, and are not intended to substitute for informed professional medical, psychiatric, psychological, tax, legal, investment, accounting, or other professional advice. Google does not endorse, and expressly disclaims liability for any product, manufacturer, distributor, service or service provider mentioned or any opinion expressed in answers or comments. Please read carefully the Google Answers Terms of Service.

If you feel that you have found inappropriate content, please let us know by emailing us at answers-support@google.com with the question ID listed above. Thank you.

Search Google Answers for

Google Home - Answers FAQ - Terms of Service - Privacy Policy