9. A Mathematical Approach to Categorization and Labeling of Qualitative Data: The Latent Categorization Method

As text databases increasingly become available to researchers, the limits to human cognition are rapidly reached. Focusing on examining objective realities, this paper introduces the latent categorization method, a novel new research method for analysis of large and midsize data sets. This method clusters text artifacts and extracts the words that were most important in creating the clusters. Further, it demonstrates a set of techniques for extracting knowledge from a representative data set involving 6135 abstracts from a variety of business-related journals.

[1]  Chris Ding,et al.  On the Use of Singular Value Decomposition for Text Retrieval , 2000 .

[2]  Steven A. Sloman,et al.  Categorization versus similarity: the case of container names , 2001, Similarity and Categorization.

[3]  Jesper B. Sørensen,et al.  Aging, Obsolescence, and Organizational Innovation , 2000 .

[4]  C. Eckart,et al.  The approximation of one matrix by another of lower rank , 1936 .

[5]  Robert Krovetz,et al.  Viewing morphology as an inference process , 1993, Artif. Intell..

[6]  Laura B. Cardinal Technological Innovation in the Pharmaceutical Industry: The Use of Organizational Control in Managing Research and Development , 2001 .

[7]  David A. Hull Stemming Algorithms: A Case Study for Detailed Evaluation , 1996, J. Am. Soc. Inf. Sci..

[8]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[9]  Andrew H. Van de Ven,et al.  Learning the Innovation Journey: Order out of Chaos? , 1996 .

[10]  Susan T. Dumais,et al.  Using Linear Algebra for Intelligent Information Retrieval , 1995, SIAM Rev..

[11]  David A. Hull,et al.  A Detailed Analysis of English Stemming Algorithms , 2006 .

[12]  Earl R. Babbie,et al.  The practice of social research , 1969 .

[13]  Stephen E. Robertson,et al.  Overview of the Okapi projects , 1997, J. Documentation.

[14]  G. Lakoff Women, fire, and dangerous things : what categories reveal about the mind , 1989 .

[15]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[16]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[17]  Sabine Kuester,et al.  Retaliatory Behavior to New Product Entry , 1999 .

[18]  J. William Ahwood,et al.  CLASSIFICATION , 1931, Foundations of Familiar Language.

[19]  E. Rosch,et al.  Cognition and Categorization , 1980 .

[20]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[21]  Paul Thompson,et al.  Finding Out About: A Cognitive Perspective on Search Engine Technology and the WWW , 2002, Information Retrieval.

[22]  E. B. Swanson,et al.  Information systems innovation among organizations , 1994 .

[23]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[24]  David A. Hull Stemming algorithms: a case study for detailed evaluation , 1996 .

[25]  Benoit B. Mandelbrot,et al.  Fractal Geometry of Nature , 1984 .

[26]  Eric Walden,et al.  The Impact of E-Commerce Announcements on the Market Value of Firms , 2001, Inf. Syst. Res..

[27]  Jacob Cohen,et al.  Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[28]  Elizabeth R. Jessup,et al.  Matrices, Vector Spaces, and Information Retrieval , 1999, SIAM Rev..

[29]  R. Chandy,et al.  Organizing for Radical Product Innovation: The Overlooked Role of Willingness to Cannibalize , 1998 .

[30]  Claudia Bird Schoonhoven,et al.  Community, Population, and Organization Effects on Innovation: A Multilevel Perspective , 1996 .

[31]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[32]  P. Sopp Cluster analysis. , 1996, Veterinary immunology and immunopathology.

[33]  Michael W. Berry,et al.  Understanding search engines: mathematical modeling and text retrieval (software , 1999 .

[34]  Garrison W. Cottrell,et al.  Representing documents using an explicit model of their similarities , 1995 .

[35]  R. Chandy,et al.  The Incumbent's Curse? Incumbency, Size, and Radical Product Innovation , 2000 .

[36]  Garrison W. Cottrell,et al.  Representing Documents Using an Explicit Model of Their Similarities , 1995, J. Am. Soc. Inf. Sci..

[37]  William M. Pottenger,et al.  Detecting Patterns in the LSI Term-Term Matrix , 2002 .

[38]  Peter J. Carnevale,et al.  Professional mediators' judgments of mediation tactics: Multidimensional scaling and cluster analyses. , 1991 .

[39]  Peter R. Monge,et al.  Communication and Motivational Predictors of the Dynamics of Organizational Innovation , 1992 .

[40]  K. Klein,et al.  The Challenge of Innovation Implementation , 1996 .

[41]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[42]  Michael R. Anderberg,et al.  Cluster Analysis for Applications , 1973 .

[43]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[44]  Frobisher Crescent Change and Complementarities in the New Competitive Landscape: A European Panel Study, 1992-1996 , 1999 .

[45]  James A. Hampton,et al.  Testing the Prototype Theory of Concepts , 1995 .

[46]  J. Kruschke,et al.  ALCOVE: an exemplar-based connectionist model of category learning. , 1992, Psychological review.

[47]  William K. Estes,et al.  Classification and cognition , 1994 .

[48]  Hans Peter Luhn,et al.  A Statistical Approach to Mechanized Encoding and Searching of Literary Information , 1957, IBM J. Res. Dev..

[49]  Matthew B. Miles,et al.  Qualitative Data Analysis: An Expanded Sourcebook , 1994 .

[50]  Luciana Libutti Building competitive skills in small and medium-sized enterprises through innovation management techniques: overview of an Italian experience , 2000, J. Inf. Sci..

[51]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[52]  Uta Wille,et al.  Qualitative Text Analysis Supported by Conceptual Data Systems , 1999 .

[53]  Mark A. Mone,et al.  Organizational Decline and Innovation: A Contingency Framework , 1998 .

[54]  Nelson P. Repenning,et al.  A Simulation-Based Approach to Understanding the Dynamics of Innovation Implementation , 2002, Organ. Sci..