Stochastic voting algorithms for Web services group testing

This paper proposes a stochastic voting for testing a large number of Web services (WS) under group testing. In the future, a large number of WS would be available and they need to be tested and evaluated in real time. While numerous test input generation techniques are available to generate test inputs, the oracle or the expected output of these test input is often difficult to obtain. One way to obtain the oracle in this case is to give the same input to multiple WS and to establish the oracle by a majority voting. This is based on the assumption that faulty WS often would not produce consistent results, and thus if a majority can be reached, the oracle can be established statistically. However, even correct WS may still produce slightly different outputs, and thus the majority-voting scheme must be carefully designed to distinguish correct but slightly variant output from truly incorrect output. This paper proposes a hierarchical classification based on simulated annealing and multi-dimensional Chi-square statistical techniques to analyze data to see if a majority can be reached. The algorithm is evaluated by a comprehensive simulated data as well as actual data. The data show that the proposed algorithm is effective even in a difficult situation where clusters of WS produce clusters of output.

[1]  Khaled S. Al-Sultan,et al.  A Tabu search approach to the clustering problem , 1995, Pattern Recognit..

[2]  D. W. Cooper Adaptive testing , 1976, ICSE '76.

[3]  Ben Choi,et al.  Bidirectional hierarchical clustering for Web mining , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[4]  Jon G. Rokne AN EASY BOUNDING CIRCLE , 1991 .

[5]  Richard C. Dubes,et al.  Experiments in projection and clustering by simulated annealing , 1989, Pattern Recognit..

[6]  Khaled S. Al-Sultan,et al.  Computational experience on four algorithms for the hard clustering problem , 1996, Pattern Recognit. Lett..

[7]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.

[8]  Wei-Tek Tsai,et al.  Voting multi-dimensional data with deviations for Web services under group testing , 2005, 25th IEEE International Conference on Distributed Computing Systems Workshops.

[9]  Cor J. Veenman,et al.  A Maximum Variance Cluster Algorithm , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  C. McDiarmid SIMULATED ANNEALING AND BOLTZMANN MACHINES A Stochastic Approach to Combinatorial Optimization and Neural Computing , 1991 .

[11]  Emile H. L. Aarts,et al.  Simulated Annealing: Theory and Applications , 1987, Mathematics and Its Applications.

[12]  Hans-Peter Kriegel,et al.  A distribution-based clustering algorithm for mining in large spatial databases , 1998, Proceedings 14th International Conference on Data Engineering.

[13]  Shokri Z. Selim,et al.  A simulated annealing algorithm for the clustering problem , 1991, Pattern Recognit..

[14]  Raymond A. Paul,et al.  Adaptive testing, oracle generation, and test case ranking for Web services , 2005, 29th Annual International Computer Software and Applications Conference (COMPSAC'05).

[15]  Georges Gardarin,et al.  Gradual clustering algorithms , 2001, Proceedings Seventh International Conference on Database Systems for Advanced Applications. DASFAA 2001.

[16]  Darrell Whitley,et al.  Scheduling problems and traveling salesman: the genetic edge recombination , 1989 .

[17]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[18]  Raymond A. Paul,et al.  Testing Web Services Using Progressive Group Testing , 2004, AWCC.

[19]  Andrew S. Glassner,et al.  Graphics Gems , 1990 .

[20]  Raymond A. Paul,et al.  A software reliability model for web services , 2004, IASTED Conf. on Software Engineering and Applications.

[21]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[22]  Jiawei Han,et al.  CLARANS: A Method for Clustering Objects for Spatial Data Mining , 2002, IEEE Trans. Knowl. Data Eng..

[23]  Chih-Ping Wei,et al.  Empirical comparison of fast partitioning-based clustering algorithms for large data sets , 2003, Expert Syst. Appl..

[24]  D. Du,et al.  Combinatorial Group Testing and Its Applications , 1993 .

[25]  GeneticAlgorithmsVladimir Estivill Spatial Clustering for Data Mining with , 1997 .

[26]  Donald R. Jones,et al.  Solving Partitioning Problems with Genetic Algorithms , 1991, International Conference on Genetic Algorithms.

[27]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[28]  Jack Ritter,et al.  An efficient bounding sphere , 1990 .

[29]  James Arvo,et al.  Graphics Gems II , 1994 .

[30]  Donald E. Brown,et al.  A practical application of simulated annealing to clustering , 1990, Pattern Recognit..

[31]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[32]  Fred W. Glover,et al.  Future paths for integer programming and links to artificial intelligence , 1986, Comput. Oper. Res..

[33]  Chih-Ping Wei,et al.  Empirical comparison of fast clustering algorithms for large data sets , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[34]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .