Statistical recognition methods. Recognition methods “Physical meaning” and terminology

Among technical diagnostic methods, the method based on the generalized Bayes formula occupies a special place due to its simplicity and efficiency.

Of course, the Bayes method has disadvantages: a large amount of preliminary information, “suppression” of rare diagnoses, etc. However, in cases where the volume of statistical data allows the Bayes method to be used, it is advisable to use it as one of the most reliable and effective methods.

Basics of the method. The method is based on a simple Bayes formula. If there is a diagnosis D i and a simple sign k j , occurring with this diagnosis, then the probability of the joint occurrence of events (the presence of the condition in the object D i and sign k j)

P (D i k j) = P (D i) P ( k j/D i) = P ( k j)P(Di/ k j). (5.4)

Bayes’ formula follows from this equality (see Chapter 11)

P(D i / k j) = P(D i) P( k i /D i)/P( k j) (5.5)

It is very important to determine the exact meaning of all quantities included in this formula.

P(D i) - probability of diagnosis D i, determined from statistical data ( prior probability of diagnosis). So, if previously examined N objects and N i objects had a condition D i, That

P(D i) = N i/N. (5.6)

P(k j/D i) - k j for objects with state D i. If among N i objects with a diagnosis D i, y N ij a sign appeared k j , That

P(k j/D i) = N ij /N i. (5.7)

P(k j) - probability of occurrence of a sign k j in all objects, regardless of the state (diagnosis) of the object. Let the total number N objects sign k j was discovered N j objects, then

P( k j ) = N j/N. (5.8)

To establish a diagnosis, a special calculation P(kj) not required. As will be clear from what follows , values P(D i)And P(k j/ D i), known for all possible states, determine the value P(k j).

Equality (3.2) P(D i/k j)- probability of diagnosis D i after it has become known that the object in question has the characteristic k j (posterior probability of diagnosis).

Generalized Bayes formula. This formula applies to the case when the examination is carried out according to a set of signs TO, including signs k 1 , k 2 , ..., k v. Each of the signs k j It has m j ranks ( k j l, k j 2 , ..., k js, ..., ). As a result of the examination, the implementation of the characteristic becomes known

k j *= k js(5.9)

and the whole complex of signs K*. Index *, as before, means the specific meaning (realization) of the attribute. The Bayes formula for a complex of features has the form

P(D i/TO* )= P(D i)P(TO */D i)/P(TO* )(i= 1, 2, ..., n), (5.10)

Where P(D i/TO* ) - probability of diagnosis D i after the results of the examination on a set of signs became known TO, P(D i) - preliminary probability of diagnosis D i(according to previous statistics).

Formula (5.10) applies to any of n possible states (diagnoses) of the system. It is assumed that the system is in only one of the specified states and therefore

In practical problems, the possibility of the existence of several states is often allowed A 1 , ..., A r, and some of them may occur in combination with each other. Then, as various diagnoses D i individual conditions should be considered D 1 = A 1 , ..., D r= A r and their combinations D r +1 = A 1 ^ A 2 , ... etc.

Let's move on to the definition P(TO*/ D i). If a set of features consists of v signs, then

P(TO*/ D i) = P( k 1 */ D i)P(k 2 */k 1* D i)...P(k v*/k l*...k*v- 1 D i), (5.12)

Where k j* = k js- category of a sign revealed as a result of the examination. For diagnostically independent signs

P(TO*/ D i) = P(k 1 */ D i) P(k 2 */ D i)... P(kv*/ D i). (5.13)

In most practical problems, especially with a large number of features, it is possible to accept the condition of independence of features even in the presence of significant correlations between them.

Probability of appearance of a complex of signs TO*

P(TO *)= P(D s)P(TO */D s). (5.14)

The generalized Bayes formula can be written as follows :

P(D i/K* ) (5.15)

Where P(TO*/ D i)is determined by equality (5.12) or (5.13). From relations (5.15) it follows

P(D i/TO *)=l , (5.16)

which, of course, should be the case, since one of the diagnoses is necessarily realized, and the realization of two diagnoses at the same time is impossible.

It should be noted that the denominator of the Bayes formula is the same for all diagnoses. This allows us to first determine the probabilities of co-occurrence i-th diagnosis and given implementation of the complex of signs

P(D iTO *) = P(D i)P(TO */D i) (5.17)

and then the posterior probability of diagnosis

P(D i/TO *) = P(D i TO *)/ P(D s TO *). (5.18)

Note that sometimes it is advisable to use preliminary logarithm of formula (5.15), since expression (5.13) contains products of small quantities.

If the implementation of a certain set of features TO * is determining for diagnosis Dp, then this complex does not occur in other diagnoses:

Then, by virtue of equality (5.15)


Thus, the deterministic logic of diagnosis is a special case of probabilistic logic. Bayes' formula can also be used in the case when some of the features have a discrete distribution, and the other part has a continuous distribution. For continuous distribution, distribution densities are used. However, in the calculation plan, the specified difference in characteristics is insignificant if the definition of a continuous curve is carried out using a set of discrete values.

Diagnostic matrix. To determine the probability of diagnoses using the Bayes method, it is necessary to create a diagnostic matrix (Table 5.1), which is formed on the basis of preliminary statistical material. This table contains the probabilities of character categories for various diagnoses.

Table 5.1

Diagnostic matrix in the Bayes method

If the signs are two-digit (simple signs “yes - no”), then in the table it is enough to indicate the probability of the sign appearing P (k i /D i). Probability of missing feature R( /D,-) = 1 - P (k i /D i).

However, it is more convenient to use a uniform form, assuming, for example, for a two-digit sign R (k j/D i)= R(k i 1 /D i); R( /D,) = P (k i 2 /D i).

Note that P(k js/Di)= 1, where T, - number of attribute digits k j. The sum of the probabilities of all possible implementations of the attribute is equal to one.

The diagnostic matrix includes a priori probabilities of diagnoses. The learning process in the Bayes method consists of forming a diagnostic matrix. It is important to provide for the possibility of clarifying the table during the diagnostic process. To do this, not only values ​​should be stored in the computer memory P(k js/Di), but also the following quantities: N- the total number of objects used to compile the diagnostic matrix; N i- number of objects with diagnosis D i; N ij- number of objects with diagnosis D i, examined based on k j. If a new object with a diagnosis arrives , then the previous a priori probabilities of diagnoses are adjusted as follows:


Next, corrections are introduced to the probabilities of the features. Let the new object with the diagnosis discharge detected r sign k j. Then, for further diagnostics, new values ​​of the probability intervals of the feature are accepted k j upon diagnosis :


Conditional probabilities of signs for other diagnoses do not require adjustment.

Example. Let us explain the Bayes method. Let two signs be checked when observing a gas turbine engine: k 1 - increase in gas temperature behind the turbine by more than 50 °C and k 2- increase in time to reach maximum speed by more than 5 s. Let us assume that for a given type of engine the appearance of these symptoms is associated either with a malfunction of the fuel regulator (condition D 1 ,), or with an increase in the radial clearance in the turbine (state D 2).

When the engine is in normal condition (condition D 3) sign k 1 is not observed, but a sign k 2 is observed in 5% of cases. Based on statistical data, it is known that 80% of engines produce a service life in normal condition, 5% of engines have a condition D 1 and 15% - condition D2. It is also known that the sign k 1 occurs in the condition D 1 in 20%, and in case of condition D 2 in 40% of cases; sign k 2 in condition D 1 occurs in 30%, and in the condition D 2- in 50% of cases. Let's summarize these data in a diagnostic table (Table 5.2).

Let us first find the probabilities of engine states when both signs are detected k 1 and k 2 . To do this, considering the signs to be independent, we apply formula (5.15).

State probability

Similarly we get P (D 2 /k 1 k 2) = 0,91; P (D 3 /k 1 k 2)= 0.

Let us determine the probability of engine conditions if the examination showed that there is no increase in temperature (sign k 1), but the time to reach the maximum speed increases (sign k 2 observed). Absence of sign k 1 there is a sign of presence (the opposite event), and P (/Di)= 1 - P (k 1 /Di).

For the calculation, formula (5.15) is also used, but the value P (k 1 /Di) in the diagnostic table is replaced by P (/Di). In this case

and similarly P (D 2 / k 2)= 0,46; P (D 3 / k 2)= 0.41. Let us calculate the probabilities of states in the case when both signs are absent. Similar to the previous one, we get

Note that the probabilities of states D 1 And D 2 are different from zero, since the characteristics under consideration are not determining for them. From the calculations carried out, it can be established that if there are signs k 1 And k 2 the engine has a condition with probability 0.91 D1, those. increase in radial clearance. In the absence of both signs, the most likely condition is normal (probability 0.92). In the absence of a sign k 1 and the presence of a sign k 2 state probabilities D 2 And D 3 approximately the same (0.46 and 0.41) and additional examinations are required to clarify the condition of the engine.

Table 5.2

Feature probabilities and prior state probabilities

Decisive rule- the rule according to which the decision on diagnosis is made. In the Bayes method, an object with a complex of features TO * refers to the diagnosis with the highest (posterior) probability

K* D i,If P(D i / K*) > P(D j / K*) (j = 1, 2,..., n; i ≠ j). (5.22)

Symbol , used in functional analysis, means belonging to a set. Condition (5.22) indicates that an object possessing a given implementation of a complex of features TO * or, in short, implementation TO * belongs to the diagnosis (condition) D i . Rule (5.22) is usually refined by introducing a threshold value for the probability of diagnosis:

P (D i /K *) P i, (5.23)

Where Pi.- pre-selected recognition level for diagnosis D i. In this case, the probability of the closest competing diagnosis is not higher than 1 – P i. Usually accepted P i≥ 0.9. Given that

P(D i /K *)


a decision on diagnosis is not made (refusal to recognize) and additional information is required.

The decision-making process in the Bayes method when calculating on a computer occurs quite quickly. For example, making a diagnosis for 24 conditions with 80 multi-digit signs takes only a few minutes on a computer with a speed of 10 - 20 thousand operations per second.

As indicated, the Bayes method has some disadvantages, for example, errors in recognizing rare diagnoses. In practical calculations, it is advisable to carry out diagnostics for the case of equally probable diagnoses, putting

P(D i) = l/n (5.25)

Then the diagnosis will have the largest posterior probability value D i, for which R (K* /D i) maximum:

K* D i,If P( K*/D i) > P( K*/D j)(j = 1, 2,..., n; i ≠ j). (5.26)

In other words, a diagnosis is made D i if this set of symptoms is more common during diagnosis D i than with other diagnoses. This decision rule corresponds maximum likelihood method. It follows from the previous that this method is a special case of the Bayes method with the same prior probabilities of diagnoses. In the maximum likelihood method, “common” and “rare” diagnoses have equal rights.

For recognition reliability, condition (5.26) must be supplemented with a threshold value

P(K */D i) ≥ P i ,(5.27)

Where P i- pre-selected recognition level for diagnosis D i .

To date, a large number of methods have been developed, the use of which makes it possible to recognize the type of technical condition of the diagnosed object. This paper discusses only some of them, the most widely used in diagnostic practice.

Posted on ref.rf
· Basics of the method

  • Generalized Bayes formula.

· Diagnostic matrix.

Decisive rule

· Fundamentals of the method.

· General procedure of the method.

· Connection of decision boundaries with the probabilities of errors of the first and second types.

The main advantage of statistical recognition methods is the ability to simultaneously take into account signs of different physical nature, since they are characterized by dimensionless quantities - the probabilities of their occurrence under different states of the system.

Among the technical diagnostic methods is a method based on the generalized Bayes formula ( Bayes' theorem (or Bayes' formula) is one of the main theorems of probability theory, which allows you to determine the probability that an event (hypothesis) has occurred in the presence of only indirect evidence (data), which may be inaccurate ), holds a special place due to its simplicity and efficiency.

The Bayes method has disadvantages:a large amount of preliminary information, “suppression” of rare diagnoses, etc. However, in cases where the volume of statistical data allows the use of the Bayes method, it is advisable to use it as one of the most reliable and effective methods.

Basics of the method. The method is based on a simple Bayes formula. If there is a diagnosis D i and a simple sign ki , occurring with this diagnosis, then the probability of the joint occurrence of events (the presence of the state Di and the sign ki in the object )

From this equality follows Bayes' formula


It is very important to determine the exact meaning of all quantities included in this formula.

P(Di) - prior probability of hypothesis D

P(ki/Di) - the probability of the hypothesis ki upon the occurrence of event D (posterior probability - the probability of a random event, provided that the posterior data, i.e. obtained after experiment, is known.)

P(ki) - total probability of occurrence of event ki

P(Di/ki) - probability of occurrence of event Di if hypothesis ki is true

P(D) - probability of diagnosis D, determined by statistical data (prior probability of diagnosis). So, if previously examined N objects and W,-objects had state D, then

P(D i) = N i /N.(3.3)

P (kj/Di) - probability of occurrence of feature k j; for objects with state Di. If among Ni, objects diagnosed with Di, N ij a sign appeared k j That


P (kj) - probability of occurrence of a sign kj in all objects, regardless of the condition (diagnosis) of the object. Let from the total number N objects sign To ) was found in Nj objects, then


In equality (3.2) R ( Di/kj)- the probability of diagnosis D after it has become known that the object in question has the characteristic kj (posterior probability of diagnosis ).

Table 1. Individual rows from the table of calculations using Bayesian inversion

It looks like our customer retention isn't great. But we will recalculate the cost of this information, and although it will decrease, it turns out that it still makes sense to take additional measurements. Let's select 40 more buyers, and then there will be a total of 60 people. Of these 60, only 39 will say they will return to our store. Our new 90% CI will be 69-80%. The upper bound now equals our original critical threshold of 80%, giving us 95% confidence that the repeat customer rate is low enough to require us to make major, costly changes.

The calculations turned out to be quite complex, but remember that you can use the tables provided on our support site. And it is quite possible that the previously discussed subjective Bayesian method, applied by calibrated experts, would have worked in this case. Perhaps a customer survey will reveal such qualitative factors that our calibrated specialists will be able to take into account. However, the cost of these important measurements is high enough to justify our additional effort.

Avoid Observation Inversion

Many people ask the question: “What conclusion can I draw from this observation?” But Bayes showed us that it is often more useful to ask, “What should I observe if condition X holds?” The answer to the last question allows us to understand the first.

Although Bayesian inversion may seem very labor intensive at first glance, it is one of the most efficient measurement methods at our disposal. If we can formulate the question “What is the probability of seeing X if Y is true?” and turn it into “What is the probability that Y is true if we observe X?”, then a huge number of measurement problems can be solved. In fact, this is how we find answers to most scientific questions. If the proposed hypothesis is correct, what should we observe?

On the contrary, many managers seem to believe that all measurement comes down to finding answers to the question: “What should I conclude from what I see?” When it seems that an observational error has been committed, people decide that no conclusions can be drawn on this basis, no matter how low the probability of such an error. However, Bayesian analysis shows that the errors imagined by managers are extremely unlikely and that measurement would still significantly reduce existing uncertainty. In other words, the lack of at least a theoretical understanding of Bayesian inversion leads to the inversion of the question and the belief that low-probability errors reduce the value of measurements to zero - that is, to the most unfortunate form of “observation inversion.”


