A Coupon-Collector Model of Machine-Aided Discovery

Empirical studies of scientific discovery---so-called Eurekometrics---have indicated that the output of exploration proceeds as a logistic growth curve. Although logistic functions are prevalent in explaining population growth that is resource-limited to a given carrying capacity, their derivation do not apply to discovery processes. This paper develops a generative model for logistic \emph{knowledge discovery} using a novel extension of coupon collection, where an explorer interested in discovering all unknown elements of a set is supported by technology that can respond to queries. This discovery process is parameterized by the novelty and quality of the set of discovered elements at every time step, and randomness is demonstrated to improve performance. Simulation results provide further intuition on the discovery process.

[1]  G. Laughlin,et al.  A Scientometric Prediction of the Discovery of the First Potentially Habitable Planet with a Mass Similar to Earth , 2010, PloS one.

[2]  M. Hofri,et al.  The coupon-collector problem revisited — a survey of engineering problems and computational methods , 1997 .

[3]  K. L. Q. Read,et al.  A Lognormal Approximation for the Collector's Problem , 1998 .

[4]  Pramod K. Varshney,et al.  Noise-Enhanced Information Systems , 2014, Proceedings of the IEEE.

[5]  Nathan F. Sayre,et al.  The Genesis, History, and Limits of Carrying Capacity , 2008 .

[6]  Todd Lubart,et al.  How can computers be partners in the creative process: Classification and commentary on the Special Issue , 2005, Int. J. Hum. Comput. Stud..

[7]  Nicholas A. Christakis,et al.  Eurekometrics: Analyzing the Nature of Discovery , 2011, PLoS Comput. Biol..

[8]  S. Atkinson Explaining Creativity: The Science of Human Innovation , 2007 .

[9]  Terri Gullickson The Creative Mind: Myths and Mechanisms. , 1995 .

[10]  M. Boden The creative mind : myths & mechanisms , 1991 .

[11]  Diane E. Vaughan,et al.  A Survey of the Coupon Collector’s Problem with Random Sample Sizes , 2007 .

[12]  Raúl E. Valdés-Pérez,et al.  Principles of Human Computer Collaboration for Knowledge Discovery in Science , 1999, Artif. Intell..

[13]  Maria Liakata,et al.  The Robot Scientist Adam , 2009, Computer.

[14]  John R. Anderson Cognitive Psychology and Its Implications , 1980 .

[15]  Thomas Sauerwald,et al.  The Weighted Coupon Collector's Problem and Applications , 2009, COCOON.

[16]  H. Simon Models of Bounded Rationality: Empirically Grounded Economic Reason , 1997 .