论文信息 - Performance bounds on the splitting algorithm for binary testing

Performance bounds on the splitting algorithm for binary testing

SummaryIn machine fault-location, medical diagnosis, species identification, and computer decisionmaking, one is often required to identify some unknown object or condition, belonging to a known set of M possibilities, by applying a sequence of binary-valued tests, which are selected from a given set of available tests. One would usually prefer such a testing procedure which minimizes or nearly minimizes the expected testing cost for identification. Existing methods for determining a minimal expected cost testing procedure, however, require a number of operations which increases exponentially with M and become infeasible for solving problems of even moderate size. Thus, in practice, one instead uses fast, heuristic methods which hopefully obtain low cost testing procedures, but which do not guarantee a minimal cost solution. Examining the important case in which all M possibilities are equally likely, we derive a number of cost-bounding results for the most common heuristic procedure, which always applies next that test yielding maximum information gain per unit cost. In particular, we show that solutions obtained using this method can have expected cost greater than an arbitrary multiple of the optimal expected cost.

Ronald L. Graham | M. R. Garey | Dr. M. R. Garey | Dr. R. L. Graham

[1] Herbert Y. Chang. A distinguishability criterion for selecting efficient diagnostic tests , 1968, AFIPS '68 (Spring).

[2] R. J. Pankhurst,et al. A Computer Program for Generating Diagnostic Keys , 1970, Comput. J..

[3] S. E. LaMacchia. Diagnosis in Automatic Checkout , 1962, IRE Transactions on Military Electronics.

[4] Michael Randolph Garey. Optimal binary decision trees for diagnostic identification problems , 1970 .

[5] Herbert Y. Chang. An Algorithm for Selecting an Optimum Set of Diagnostic Tests , 1965, IEEE Trans. Electron. Comput..

[6] J A Barnett. Selection of tests for identifying yeasts. , 1971, Nature: New biology.

[7] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .

[8] L. Goddard. Information Theory , 1962, Nature.

[9] Sundaram Seshu,et al. On an Improved Diagnosis Program , 1965, IEEE Trans. Electron. Comput..

[10] David A. Huffman,et al. A method for the construction of minimum-redundancy codes , 1952, Proceedings of the IRE.

[11] R. Tosic. An optimal search procedure , 1980 .

[12] M. Garey. Optimal Binary Identification Procedures , 1972 .