1 Introduction
In the quality inspection, since we do not know whether the fraction defective (or FD) judged by an inspector is equal to a given FD or not, we are likely to be suspicious of inspectors. Whenever automatic inspection machines replaced manual inspection operations, many people thought that the probability of either overestimation or underesti- mation by inspection machines could not exist at all. However, it turned out immediately that they were wrong. There always exist nonzero errors, i.e., type I and II errors, in even a well-automated inspection machine. The assumption of nonzero errors from unavoidable events, for example due to hardware, software, and deterioration, could not be denied. According to Handbook of biometrics [2], the ranges of type I and type II errors have been measured for biometric inspections respectively : for fingerprints (optical scanners) as (0.0005%, 0.0015%) and (1%, 3%), for hand geometry (whole hand) as (0.05%, 0.15%) and (1%, 3%), for voice as (1%, 3%) and (7.5%, 22.5%), for iris as (0.0005%, 0.0015%) and (6%, 18%), and so on.
Sylla and Drury [4] studied the FD judged by an imperfect inspector, calling it the “apparent fraction nonconforming”, i.e., qe = (1 - q)α + q(1 - β), where q is a true FD, α is the probability of a type I error, and β is the probability of a type II error. They found the sample size and the cut-off value for single sampling by attributes, and error-related payoffs, and proposed the concept of liability, which is an inspector’s ability to respond to information. Burk et al. [1] studied the true FD, , and found that for very good process, qe is actually a type I error. They suggested a procedure for estimating the type I and II errors, and gave an industrial example. Ko [3] analyzed a special inspection process that allows up to two times of consecutive testing for each product to decrease type I inspection errors. Using a Markov chain to model the steps of the inspection process and a product unit’s quality states during inspection, he demonstrated that his inspection process could help reduce unnecessary rejects and consequently decrease material and production costs.
There are, however, a few papers dealing with the probability of overestimating a true FD, q, by an imperfect inspector, which we are interested in. The research article by Yang and Cho [6] is partially related with our paper. In order to attain a pre-specified quality rate at the end of an assembly line, he suggested a K-stage inspection-rework (K-IR) system, which was composed of a series of K stages, each of which included an inspection process and a rework process. He suspected the effectiveness of the K-IR system, and proved mathematically that FDA (FD after inspection) is always greater than FDB (FD before inspection), if FDB is less than a value that depends on an FD of rework and inspection errors. Based on his assumptions, he suggested a necessary condition for inspection effectiveness: the sum of two errors must be smaller than one.
Yang and Chang [5] studied the probability of overestimation (or PO) by an imperfect inspector with nonzero inspection errors, under the assumptions that (1) the inspector classified one-by-one an infinite sequence of items with a given FD, but which was unknown to the inspector, (2) inspection error is considered as either a constant or a random variable distributed with a “uniform” distribution. They proved that (1) the inspector overestimates the given FD with PO being more than 50%, if and only if the FD is smaller than a value called “critical FD”, which gives the 50-50 chance of overestimation and underestimation, and (2) PO increases to 100% as the given FD decreases to zero. However, no one knows whether their result still holds true or not, regardless of any distribution. If so, their result might have great economic impact on commercial inspection plans. Since proving it for the case of any distribution may not be mathematically tractable, we are interested in proving it in the case of a “symmetrical triangular” distribution.
Assuming that inspection error is regarded as either a constant or a random variable distributed with a triangular distribution (except the case that two types of errors are constant at the same time), we construct and describe three statistical models in Section 2. In Section 3 through 5, we derive formulas for PO and a critical FD satisfying PO = 50% for each model. Furthermore, we compare our results from the symmetrical triangular distribution with the previous results from the uniform distribution. In Section 6, we integrate our findings to one fundamental theorem.
2 Problem Statement
For the convenience of the readers, we summarize the paper of Yang and Chang [5] as follows. Assuming that an imperfect inspector with nonzero inspection errors classifies one-by-one an infinite sequence of items with a true FD, denoted by a constant q, which is unknown to the inspector, they derived the FD, denoted by Q, of an infinite number of items judged by an imperfect inspector and the probability of overestimation, denoted by PO(q), by an imperfect inspector respectively as follows :
where
n = n-th item inspected,
-
Ai = a constant or a random variable representing the type I error, the probability that the i-th conforming item is misclassified as nonconforming and falsely rejected by the inspector, and
-
Bi = a constant or a random variable representing the type II error, the probability that the i-th nonconforming item is misclassified as conforming and falsely accepted by the inspector.
Note that Q can be regarded as a random variable if and only if both Ai and Bi are not constants at the same time. Also note that PO(q) can be applied for any type of distribution. Since PO(q) depends ultimately upon an error distribution, for convenience, we will use the notation POU(q) instead of PO(q) when errors follows the “uniform” distribution. They defined CFU to be a critical FD satisfying POU(CFU) = 50%, i.e., the fraction defective that the probability of overestimation is exactly equal to the probability of underestimation.
Assuming that
-
(1) Ai is either a constant α for all i or i.i.d (independently identically distributed) with the uniform density function, , where an indicator function with one for 0 < x ≤ xu, and zero otherwise,
-
(2) Bi is either a constant β for all i or i.i.d with the uniform density function
-
(3) E[A] = α, E[B] = β, where E[X] is the expectation of a random variable X,
they constructed four statistical models and proved the theorem as follows.
-
(1) An imperfect inspector with has his/her/its own POU curve and CFU,
-
(2) POU is a function of two variables q and ρ, denoted by POU(q, ρ),
-
(3) POU is a decreasing function of q with POU(0, ρ) = 1 and POU(1, ρ) = 0.
-
(4) POU is a decreasing function of ρ with POU(q, 0) = 1 and POU(q, 1) = 0.
-
(5) There always exists a unique , which depends on only inspection errors and not q.
-
(6) The inspector overestimates q with PO > 0.5 for 0 ≤ q < CFU, estimates q with PO = 0.5 with PO = 0.5 for q = CFU, and underestimates q with PO < 0.5 for CFU < q ≤ 1.
It can be observed from the third statement above that POU increases to 100% as q decreases to 0% regardless of any value of ρ. Based on their findings, they conjectured that their theorem could still be true, regardless of any distribution. Since their theorem is restricted only to the uniform distribution, we are interested in proving whether or not their conjecture still holds true even for the symmetrical triangular distribution as shown in <Figure 1>. For 0 < αu , βu ≤ 1, we define,
In order to compare the results of Yang and Chang [5], we will use the notation POT(q) and CFT instead of POU(q) and CFU respectively, which were derived from the “uniform” distribution. Since inspection error can be regarded as either a constant or a random variable, we construct three statistical models as shown in <Table 1>. Since the case that two types of errors are constants at the same time was discussed in Yang and Chang [4], we exclude this case. Hence, our problem can be described as “Derive POT(q) and CFT for each model, and compare our results with the theorem.”
Throughout this paper, the first and second derivatives of a function f(x) will be expressed as f’(x) and f”(x), respectively.
3 Analysis of Model T (R, C)
Suppose that Bi = β for all i where β is a constant with 0 < β ≤ 1, and that Ai’s are i.i.d with fA (a) . Since , and β can be regarded as the average of type II error, ρ can be obtained as , and it turns out that POT(q) of this case, i.e., POTrc(q) is a function of two variables ρ and q as proved in the following proposition, and as shown in <Figure 2(a)>. In order to compare different results derived from two different distributions, we attach the previous proposition based on the uniform distribution at the end of the following proposition after revising the previous proposition with our notations.
Proposition 1. Under the assumptions that Bi = β for all i, where β is a constant with 0 < β ≤ 1, and Ai’s are i.i.d with fA (a) for 0 < a ≤ αu and 0 < αu ≤ 1, we have,
-
(1)
-
(2) POTrc1(q) is a strictly decreasing concave function of q, and POTrc2(q) is a strictly decreasing convex function of q,
-
(3) POTrc(q) is continuous and differentiable at
-
(4)
-
(5) the inspector with ρ overestimates q with PO > 0.5 for 0 ≤ q < CFTrc, estimates q with PO = 0.5 for q = CFTrc, and underestimates q with PO < 0.5 for CFTrc < q ≤ 1.
Proof : (1) From (1) and (2), we have,
If q = 1, POTrc(1) becomes zero since Qrc = (1 - β ) is always smaller than q = 1. If q = 1 and β = 0, then since Qrc = q = 1, every inspector with β = 0 always estimates correctly regardless of any distribution, and by the definition of PO, POTrc(1) = 0.5. Hence, POTrc(1) has two values, either 0.5 (when β = 0) or zero (when 0 < β ≤ 1) depending on β. Since we assume 0 < β ≤ 1, we have, POTrc(1) = 0. If q≠1, (3) can be further reduced to
where . Note that the value of depends upon the value of .
For (or ),
we have,
(2) Since , POTrc1(q) is a strictly decreasing concave function of q for . Since , POTrc2(q) is a strictly decreasing convex function of q for .
(3) and (4) Since ,, (3) and (4) hold true.
(5) From (1), (2), (3) and (4), it follows that (5) holds true. □
Proposition 2. (Yang and Chang [5]) Under the assumptions that Bi = β for all i, where β is a constant with 0 < β ≤ 1, and Ai’s are i.i.d with for 0 < a ≤ αu and 0 < αu ≤ 1, they have,
-
(1)
-
(2) POUrc(q) is a strictly decreasing concave function of q for 0 ≤ q ≤ ,
-
(3) , and
-
(4) the inspector with ρ overestimates q with PO > 0.5 for 0 ≤ q < CFUrc, estimates q correctly with PO = 0.5 for q = CFUrc, and underestimates q with PO < 0.5 for CFUrc < q ≤ 1.
It can be observed from two propositions above that
-
(1) POTrc(q)≠POUrc(q).
-
(2) CFTrc = CFUrc.
-
(3) In both cases, PO is zero if . That is, the inspector with ρ “always” underestimates q if and only if
-
(4) POUrc(q) is concave for while POTrc(q) is concave for , and convex for .
As shown in <Figure 2(c)>, it can be observed that POTrc(q) ≥ POUrc(q) for 0 ≤ q ≤ 16.1% and POTrc(q) ≤ POUrc(q) for 16.1% ≤ q ≤ 27.7%. This property generally holds as proved in the following proposition. That is, PO by an inspector with (fA(a), β) is greater than or equal to that by an inspector with (gA(a), β) for and vice versa for .
Proposition 3.
-
Proof : Since equivalent to , we have, . Thus, (1) holds true. Similarly, is equivalent to , we have, . Thus, (2) holds true. □
A representative graph of POTrc(q) is shown in <Figure 2(a)>. <Figure 2(b)> shows the changing shape of POTrc(q) as ρ increases from 0.1 to 10. The CFTrc’s for ρ = 0.1, 1, and 10, are computed as 9.1%, 50%, and 90.9% respectively. For example, even if (αu, β) = (2 PPM, 1 PPM), which are extremely small values but correspond to ρ = 1, q is overestimated with more than 50% as long as q is smaller than 50%. <Figure 2(c)> shows the real case of a BLU (backlight unit) company in Korea, where POUrc(q) and POTrc(q) for ρ = 5.23 can be drawn as the dotted curve and the solid curve respectively.
Suppose that both q and ρ are input variables. Since POTrc(q) > 0.5 for , it follows that POTrc(q) > 0.5 if and only if . Hence, both Proposition 1 and 2 implies that every point (q, ρ) in two-dimensional region Rrc gives overestimation with PO > 0.5, where Rrc can be represented by . That is, an inspector with ρ overestimates q if (q, ρ) ∈Rrc, estimates q with PO = 0.5 if , and underestimates q otherwise. Note that if β = 0, then every point (q, 0) for 0 < q < 1 on the line ρ = 0 gives overestimation with PO > 0.5 since Qrc = (1 - q)A + q is always greater than q. Suppose that both αu and β are input variables and q is given as a constant. In the same manner above, both Proposition 1 and 2 implies that every point (αu , β) in twodimensional region Rrc′ gives overestimation with PO > 0.5 where . That is, an inspector with (αu , β) overestimates with PO > 0.5 if (αu, β )∈Rrc ′ , estimates q with PO = 0.5 if , and underestimates q with PO < 0.5 otherwise.
4 Analysis of Model T(C, R)
Suppose that Ai = α for all i where α is a constant with 0 < α≤ 1, and that Bi’s are i.i.d with fB(b). Since , and α can be regarded as the average of type I error, ρ can be obtained as , and it turns out that POTcr(q) is a function of two variables ρ as proved in the following proposition. In order to compare different results derived from two different distributions, we attach the previous proposition based on the uniform distribution.
Proposition 4. Under the assumption that Ai = α for all i where α is a constant with 0 < α ≤ 1 and Bi’s are i.i.d with fB(b) for 0 < b ≤ βu, and 0 < βu ≤ 1, we have,
-
(1)
-
(2) POTcr1(q) is a strictly decreasing concave function of q for and POTcr2(q) is a strictly decreasing convex function of q for ,
-
(3) POTcr(q) is continuous and differentiable at ,
-
(4) CFTcr, and
-
(5) the inspector with ρ overestimates q with PO > 0.5 for 0 ≤ q < CFTcr, estimates q with PO = 0.5 for q = CFTcr, and underestimates q with PO < 0.5 for CFTcr < q ≤ 1.
Proof : (1) From (1) and (2), we have,
If q = 0, POTcr(0) becomes one since Qcr = α is always greater than q = 0. If q = α = 0, then since Qcr(= 0) is always equal to q(= 0), every inspector with α = 0 always estimates correctly regardless of any distribution, and by the definition of PO, POTcr(0) is not one but 50% in this case. This result implies that POTcr(0) can be either one or 50% depending on the value of α . Since we assume that 0 < α ≤ 1, we have, POTcr(0) = 1.
If q ≠ 0, (5) can be further reduced to
where . Note that the value of depends upon the value of .
For , we have,
For , we have,
(2) Since where h(q) = 2q(2ρq-1+q)+1 < 0 for . POTcr1(q) is a strictly decreasing concave function of q. Since and POTcr2(q) is a strictly decreasing convex function of q for .
(3) and (4) Since and (3) and (4) hold true.
(5) From (1), (2), (3) and (4), it follows that (5) holds true. □
Proposition 5. (Yang and Chang [5]) Under the assumption that Ai = α for all i where α is a constant with 0 < α ≤ 1 and Bi’s are i.i.d with , and , they have,
(1)
(2) POUcr(q) is a strictly decreasing convex function of ,
(3) , and
(4) the inspector with overestimates q with PO > 0.5 for 0 ≤ q < CFUcr, estimates q with PO = 0.5 for q = CFUcr, and underestimates q with PO < 0.5 for CFUcr < q ≤ 1.
It can be observed from two propositions above that
(1) POTcr(q) ≠ POUcr(q)
2) In both cases, PO is one if .
(3) CFTcr = CFUcr
(4) POUcr(q) is convex for while POTcr(q) 5 is concave for , and convex for .
As shown in <Figure 3(c)>, it can be observed that POTcr1(q) ≥ POUcr(q) for 8.7% ≤ q ≤ 16% and POTcr2(q) ≤ POUcr(q) for 16% ≤ q ≤ 100%. This property generally holds as proved in the following proposition. That is, PO by an inspector with (α , fB(b)) is greater than or equal to that by an inspector with (α, gB(b)) for and vice versa for Note that equality holds when
Proposition 6.
Proof : Since is equivalent to , we have,
Thus, (1) holds true. Similarly, is equivalent to , we have, . Thus, (2) holds true. □
A representative graph of POTcr(q) is shown in <Figure 3(a)>. <Figure 3(b)> shows the changing shape of POTcr(q) as ρ increases from 0.1 to 10. The values of CFTcr’s for ρ = 0.1, 1, and 10, are computed as 9.1%, 50%, and 90.9% respectively. Note that they are exactly the same as those values of the CFTrc’s as proved in Proposition 1. <Figure 3(c)> shows the case of the BLU company in Korea, where POUcr(q) and POTcr(q) for ρ = 5.23 can be drawn as the dotted curve and the solid curve respectively.
Suppose that both q and ρ are input variables. Since POTcr(q) > 0.5 for (or equivalently, ), both Proposition 4 and 5 implies that every point (q, ρ) in two-dimensional region Rcr gives overestimation with PO > 0.5 where Rcr = . That is, an inspector with ρ overestimates q if (q, ρ) ∈Rcr, estimates q with PO = 0.5 if , and underestimates q with PO < 0.5 otherwise. Note that if α = 0, then every point (q, 0) for 0 < q < 1 gives underestimation since Pr{Qcr > q} = Pr{B < 0} = 0.
Suppose that both α and βu are input variables and q is given as a constant. Similarly, since the condition above is equivalent to , both Proposition 4 and 5 implies that every point (α, βu ) in two-dimensional region R’cr gives overestimation with PO > 0.5 where . That is, an inspector with (α, βu ) overestimates with PO > 0.5 if (α, βu )∈R’cr, estimates q with PO = 0.5 if , and underestimates q with PO < 0.5 otherwise.
5 Analysis of Model T (R, R)
Suppose that Ai’s and Bi’s are i.i.d with fA(a) and fB(b) respectively. Since and , ρ can be obtained as , and it turns out that POTrr(q) is a function of two variables ρ and q as shown in <Figure 4(a)>. In order to compare different results derived from two different distributions, we attached the previous result based on the uniform distributions at the end of Proposition 7.
Proposition 7. Under the assumption that for all i, Ai’s and Bi’s are i.i.d with fA(a) and fB(b) respectively for 0 < a ≤ αu , 0 < b ≤ βu and 0 < αu , βu ≤ 1, we have,
(1)
(2) POTrr(q) is a strictly decreasing function of q, and POTrr1(q) is concave while POTrr4(q) is convex.
(3) POTrr(q) is continuous and differentiable in the open interval (0, 1), i.e., we have
(4) , and
(5) the inspector with ρ overestimates q with PO > 0.5 for 0 ≤ q < CFTrr, estimates q with PO = 0.5 for q = CFTrr, and underestimates q with PO < 0.5 for CFTrr < q ≤ 1.
Proof : Since the proof is too long, it is attached in Appendix. □
Proposition 8. (Yang and Chang [5]) Under the assumption that for all i, Ai’s and Bi’s are i.i.d with respectively for , they have,
(2) POUrr1(q) is a strictly decreasing concave function of q with POUrr1(0) = 1, and POUrr2(q) is a strictly decreasing convex function of q with POUrr2(1) = 0,
(3) , and
(4) the inspector with ρ overestimates q with PO > 0.5 for 0 ≤ 0 < CFUrr, estimates q with PO = 0.5 for q = CFUrr, and underestimates q with PO < 0.5 for CFUrr < q ≤ 1.
It can be observed from two propositions above that
(1) POTrr(q) ≠ POUrr(q).
(2) CFTrr = CFUrr.
(3) POUrr1(q) is concave for and POUrr1(q) is convex for while POTrr1(q) is concave for and POTrr4(q) is convex for
As shown in <Figure 4(c)>, it can be observed that POTrr(q) ≥ POUrr(q) for 0 ≤ q ≤ 16% and POTrr(q) ≤ POUrr(q) for 16% ≤ q ≤ 100%. This property generally holds as proved in the following proposition. In other words, PO by an inspector with (fA(a), fB(b)) is greater than or equal to that by an inspector with gA(a), gB(b) for 0 ≤ q ≤ , and vice versa for . Note that equality holds when .
Proposition 9.
A representative graph of POTrr(q) can be drawn as shown in <Figure 4(a)>. <Figure 4(b)> shows the changing shape of POTrr(q) as ρ increases from 0.1 to 10. The CFTrr’s for ρ = 0.1, 1, and 10, are computed as 9.1%, 50%, and 90.9% respectively. These results are exactly the same as those of two previous models. Even though it may be not easy to detect from the graphs only with the eye, the convexity and concavity of POTrr2(q) and POTrr3(q), it turns out that (1) for ρ = 0.1, POTrr2(q) is concave while POTrr3(q) is concave in the first part of its own interval and convex in the last part, (2) for ρ = 1, POTrr2(q) is concave while POTrr3(q) is convex, and (3) for ρ = 10, POTrr2(q) is concave in the first part of its own interval and convex in the last part while POTrr3(q) is convex. In other words, POTrr2(q) and POTrr3(q) can be either concave or convex or both depending upon the value of ρ. <Figure 4(c)> shows the case of the BLU company in Korea, where POUrr(q) and POTrr(q) for ρ = 5.23 can be drawn as the dotted curve and the solid curve respectively.
Suppose that both q and ρ are input variables. Since (or equivalently, ), both Proposition 7 and 8 implies that every point (q, ρ) in two-dimensional region Rrr gives overestimation with PO > 0.5 where . That is, an inspector with ρ overestimates q if (q, ρ) ∈ Rrr, estimates q with PO = 0.5 if ρ = qs , and underestimates q with PO < 0.5 otherwise. Note that if α = 0, then every point (q, 0) for 0 < q < 1 gives underestimation since Pr{Qrr > q} = Pr{B < 0} = 0.
Suppose that both αu and βu are input variables and q is given as a constant. In the similar manner above, both Proposition 7 and 8 implies that every point (αu , βu ) in twodimensional region R’rr gives overestimation with PO > 0.5 where . That is, an inspector with (αu , βu) overestimates with PO > 0.5 if (αu, βu) ∈, estimates q with PO = 0.5 if and underestimates q with PO < 0.5 otherwise.
6 Summary
From the propositions proved or attached in sections through 3 to 6, the following theorem holds true.
Theorem 7. Assuming an infinite sequence of items with a known FD q, and nonzero inspection errors with either the uniform or the triangular distribution, we have,
-
(1) an imperfect inspector with ρ has his/her/its own PO curve and CF,
-
(2) PO is a function of two variables q and ρ,
-
(3) PO is a decreasing function of q, PO(q), with PO(0) = 1 and PO(1) = 0,
-
(4) there always exists a unique which depends only on inspection errors and not q, and
-
(5) the inspector overestimates q with PO > 0.5 for 0 ≤ q < CF, estimates q with PO = 0.5 for q = CF, and underestimates q with PO < 0.5 for CF < q ≤ 1,
-
(6) PO by an inspector with (fA (a), β) or (α, fB (b)) or ( fA(a), fB (b)) is greater than or equal to that by an inspector with (gA (a), β ) or (α, gB (b)) or (gA (a), gB (b) ) for 0 ≤ q ≤ CF, and the former is smaller than or equal to the latter for CF ≤ q ≤ 1. Note that equality holds when q = CF.
As conjectured by Yang and Chang [4], we have proved that their conjecture still holds true for the case of the symmetrical triangular distribution. However, statement (2) may be changed to a statement that PO is a function of one combined variable, for example, since all formulas above can be expressed as the function of the combined variable. Note that this conversion gives x ≥ 0 and that can be converted to 0.5, 1, and 2 respectively. For example, POTrr2(q) can be expressed as
This implies that PO depends ultimately upon , and that when the values of (ρ, q) and (ρ′, q′) are given respectively, the value of PO is same if and only if
7 Discussion and Concluding Remarks
We started this research expecting both at least significantly different results from Yang and Chang [4]. However, regardless of the uniform or the triangular error distribution, the critical fraction defectives were same except the value of PO and the concavity/convexity of a PO curve. Our strong conjecture is that this phenomenon might result from two assumptions : the symmetry of the probability density functions and the error intervals with a zero lower bound. In other words, our strong conjecture is that the critical fraction defective could be derived differently if the symmetry is not satisfied and/or the interval with a nonzero lower bound would be assumed. Further research may be concentrated on a skewed triangular distribution defined in an interval with a nonzero lower bound or a zero lower bound, at the cost of mathematical complexity. If PO is to be greater than 50% even under the new assumptions, we cannot help but agree with their conclusion that all commercial inspection plans should be revised with the concept of PO in the near future, for the fairness of commercial trades. Furthermore, as mentioned in Yang and Chang [4], since our mathematical models do not consider any related costs, a cost-based optimization model with the PO concept could be constructed to determine a trade-off point between buyers and sellers. We hope that the concept of PO should become one of the major criteria in the future, and that our PO functions would be widely used since they are more accurate than the previous PO functions.