Information categorisation: an emergent approach

The explosion of information and of naive users on the Internet has highlighted problems of effective access to information. One response to the problem of effective access to information is to classify the information into categories based on the nature of the information being classified. Existing information classifications are typically developed by committees or imposed by organisations and have proved difficult to maintain. This investigation developed a two phase method to systematically determine and analyse information categories in a specific domain as perceived by domain experts. The initial phase, the Term Extraction Phase, applied the librarianship approach of literary warrant guided by Ingarden’s Ontology of Literature to research papers from a specific domain to discover what is studied in the domain. The approach is significant in that it draws upon rigorous and philosophically compatible bodies of work in two areas. Firstly, from work addressing the nature, existence, and categorisation of literary expression found in research papers. Secondly, from qualitative research methods addressing how meaningful terms can be analysed in text and related to each other. We have found that such a guiding ontological theory can be used to seed coding families giving rise to a viable method for generating categorisations for further research. We have also found that the key guiding unit of analysis operationalising Ingarden’s approach is the “reported research activity” and that the process is practical although labour intensive. The second phase, the Term Categorisation Phase, used the librarianship approach of consensus to have domain experts form categories from the terms generated in the first phase. Examining those categories using pairwise comparisons allowed the identification of similar categories based on the common categorisation of terms in the coding family. The pairwise comparisons were undertaken manually, but the development of an automated tool to perform these comparisons would enhance this aspect of the phase. Boisot’s Social Learning Cycle (SLC) was used as a model with which to explain category variations. The single performance of the Term Categorisation Phase undertaken in this investigation demonstrated the value of the SLC for explaining the variations between domain experts, and showed the potential for explaining category changes over time using the SLC and repeated performances of the Term Categorisation Phase. This investigation makes a number of contributions. The investigation demonstrated that the two librarianship approaches of literary warrant and consensus are not necessarily mutually exclusive and that both have much to offer at different stages of the categorisation process. A method was devised which provides a more rigorous and systematic approach to analysing and categorising text. The method consists of two phases which are loosely coupled and could be used independently. A very significant aspect is the ability to view categorisation as a dynamic process. That enables the examination of categorisation and classification schemes and for the identification of areas within those schemes which require attention. The method is not a tool to develop a complete classification scheme, but seeks to contribute insights on how to progress the development of mature schemes.

[1]  Ron Weber,et al.  Editor’s comments , 2003, MIS Q..

[2]  Russell L. Ackoff,et al.  On purposeful systems , 1972 .

[3]  P. Beynon-Davies E-Business , 2004 .

[4]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[5]  Eleanor Rosch,et al.  Principles of Categorization , 1978 .

[6]  Ted Boren,et al.  Thinking aloud: reconciling theory and practice , 2000 .

[7]  Ian C. MacMillan,et al.  Delineating a forum for business policy scholars , 1987 .

[8]  Rudy Hirschheim,et al.  Crisis in the IS Field? A Critical Reflection on the State of the Discipline , 2003, J. Assoc. Inf. Syst..

[9]  G. Lakoff,et al.  Women, Fire, and Dangerous Things: What Categories Reveal about the Mind , 1988 .

[10]  Dennis F. Galletta,et al.  Mis research directions: a survey of researchers' views , 1991, DATB.

[11]  B. Hofer Epistemological Understanding as a Metacognitive Process: Thinking Aloud During Online Searching , 2004 .

[12]  Steven Walczak,et al.  A Re-Evaluation of Information Systems Publication Forums , 1999 .

[13]  Graham Pervan,et al.  The status of information systems research in Australia: preliminary results , 2001 .

[14]  Willard Van Orman Quine,et al.  From a Logical Point of View , 1955 .

[15]  Barry Smith AN ESSAY IN FORMAL ONTOLOGY , 1978 .

[16]  Simon K. Milton An ontological comparison and evaluation of data modelling frameworks , 2000 .

[17]  M. D. Myers,et al.  Qualitative Research in Business & Management , 2008 .

[18]  Michael C. Calver,et al.  What makes a journal international? A case study using conservation biology journals , 2010, Scientometrics.

[19]  David Rooney,et al.  Handbook on the Knowledge Economy , 2008 .

[20]  Peter Weill,et al.  The Implications of Information Technology Infrastructure for Business Process Redesign , 1999, MIS Q..

[21]  Elin K. Jacob,et al.  Classification and Categorization: A Difference that Makes a Difference , 2004, Libr. Trends.

[22]  Roman Ingarden,et al.  Vom Erkennen des literarischen Kunstwerks , 1970 .

[23]  Robert J. Kauffman,et al.  50th Anniversary Article: The Evolution of Research on Information Systems: A Fiftieth-Year Survey of the Literature in Management Science , 2004, Manag. Sci..

[24]  Detmar W. Straub,et al.  IS Bibliographic Repository (ISBIB): A Central Repository of Research Information for the IS Community , 2002, Commun. Assoc. Inf. Syst..

[25]  Nikolaos A. Mylonopoulos,et al.  Global perceptions of IS journals , 2001 .

[26]  S. Debowski Knowledge Management , 2005 .

[27]  G. E. Moore Proof of an external world , 1939 .

[28]  Mark L. Gillenson,et al.  Academic Issues in MIS: Journals and Books , 1991, MIS Q..

[29]  D. Rondinelli,et al.  Panacea, common sense, or just a label?: The value of ISO 14001 environmental management systems , 2000 .

[30]  Jody Bales Foote,et al.  Reclassification in Academic Research Libraries: Is It Still Relevant in an E-Book World? , 2011 .

[31]  Barry Smith,et al.  Formal ontology, common sense and cognitive science , 1995, Int. J. Hum. Comput. Stud..

[32]  Rex G. Cammack,et al.  Basic-Level Geographic Categories∗ , 1996 .

[33]  P. Liamputtong Qualitative Research Methods , 2005 .

[34]  Cathy Urquhart,et al.  Putting the ‘theory’ back into grounded theory: guidelines for grounded theory studies in information systems , 2009, Inf. Syst. J..

[35]  Alan Gilchrist,et al.  Thesauri, taxonomies and ontologies - an etymological note , 2003, J. Documentation.

[36]  Bill C. Hardgrave,et al.  Forums for management information systems scholars , 1995, CACM.

[37]  Merilyn Annells,et al.  Grounded Theory Method: Philosophical Perspectives, Paradigm of Inquiry, and Postmodernism , 1996 .

[38]  Bill C. Hardgrave,et al.  Forums for information systems scholars: III , 2001, Inf. Manag..

[39]  J. Daniel Couger,et al.  IS '95: guidelines for undergraduate IS curriculum , 1995 .

[40]  Amie L. Thomasson Fiction and Intentionality , 1996 .

[41]  Nicola Guarino,et al.  Formal ontology, conceptual analysis and knowledge representation , 1995, Int. J. Hum. Comput. Stud..

[42]  Ephraim R. McLean,et al.  Information Systems Success: The Quest for the Dependent Variable , 1992, Inf. Syst. Res..

[43]  Barry Smith,et al.  Ontology with Human Subjects Testing: An Empirical Investigation of Geographic Categories , 1998 .

[44]  J. Daniel Couger,et al.  IS'95: Guideline for Undergraduate IS Curriculum , 1995, MIS Q..

[45]  C. Hall,et al.  Publish and perish? Bibliometric analysis, journal ranking and the assessment of research quality in tourism , 2011 .

[46]  Rudy Hirschheim,et al.  Towards a distinctive body of knowledge for Information Systems experts: coding ISD process knowledge in two IS journals , 2004, Inf. Syst. J..

[47]  Roland Holten,et al.  Deriving an IS-Theory from an Epistemological Position , 2007 .

[48]  W. Montague,et al.  Category norms of verbal items in 56 categories A replication and extension of the Connecticut category norms , 1969 .

[49]  K. A. Ericsson,et al.  Protocol Analysis: Verbal Reports as Data , 1984 .

[50]  Björn Niehaves,et al.  Epistemological perspectives on IS research: a framework for analysing and systematizing epistemological assumptions , 2007, Inf. Syst. J..

[51]  Debra Howcroft,et al.  Grounded Theory: never knowingly understood , 2000 .

[52]  Thomas R. Gruber,et al.  Ontolingua: a mechanism to support portable ontologies , 1991 .

[53]  A. Strauss Basics Of Qualitative Research , 1992 .

[54]  Yair Wand,et al.  Ontology as a foundation for meta-modelling and method engineering , 1996, Inf. Softw. Technol..

[55]  J. Rée,et al.  The Translation of Philosophy , 2001 .

[56]  Yair Wand,et al.  Theoretical foundations for conceptual modelling in information systems development , 1995, Decis. Support Syst..

[57]  E. Rosch,et al.  Cognition and Categorization , 1980 .

[58]  Guy G. Gable,et al.  The Information Systems Academic Discipline in Australia , 2011, Commun. Assoc. Inf. Syst..

[59]  Ulrich Frank,et al.  Different Paths of Development of Two Information Systems Communities: A Comparative Study Based on Peer Interviews , 2008, Commun. Assoc. Inf. Syst..

[60]  B. Glaser Doing grounded theory : issues and discussions , 1998 .

[61]  Bamshad Mobasher,et al.  Personalized recommendation in social tagging systems using hierarchical clustering , 2008, RecSys '08.

[62]  Werner Kuhn,et al.  Semantic interoperability: A central issue for sharing geographic information , 1999 .

[63]  Simon K. Milton,et al.  Indexing research: an approach to grounding ingarden's ontological framework , 2006 .

[64]  Wanda J. Orlikowski,et al.  CASE Tools as Organizational Change: Investigating Incremental and Radical Changes in Systems Development , 1993, MIS Q..

[65]  T. Saaty Relative measurement and its generalization in decision making why pairwise comparisons are central in mathematics for the measurement of intangible factors the analytic hierarchy/network process , 2008 .

[66]  Barry Smith,et al.  Framework for formal ontology , 1983 .

[67]  Nick Haslam,et al.  Possible research area bias in the Excellence in Research for Australia (ERA) draft journal rankings , 2010 .

[68]  Juhani Iivari,et al.  A Paradigmatic Analysis of Information Systems As a Design Science , 2007, Scand. J. Inf. Syst..

[69]  Melvil Dewey,et al.  A Classification and Subject Index for Cataloguing and Arranging the Books and Pamphlets of a Library , 2006 .

[70]  Alan Sangster,et al.  The ERA: A Brave New World of Accountability for Australian University Accounting Schools , 2010 .

[71]  Peter Tarasewich,et al.  Global perceptions of journals publishing e-commerce research , 2002, CACM.

[72]  M. Ferguson-Hessler,et al.  Cognitive structures of good and poor novice problem solvers in physics , 1986 .

[73]  Ron Weber,et al.  Research Commentary: Information Systems and Conceptual Modeling - A Research Agenda , 2002, Inf. Syst. Res..

[74]  Ian Horrocks,et al.  OIL in a Nutshell , 2000, EKAW.

[75]  P. Winch The Idea of a Social Science and Its Relation to Philosophy , 1960 .

[76]  Ken Herold,et al.  Librarianship and the Philosophy of Information. , 2001 .

[77]  Victor R. Prybutok,et al.  Relationship among organizational support, JIT implementation, and performance , 2001, Ind. Manag. Data Syst..

[78]  Simon K. Milton,et al.  The reality of information systems research , 2004 .

[79]  Amie L. Thomasson Fiction and metaphysics , 1998 .

[80]  Dirk S. Hovorka,et al.  Analyzing unstructured text data: Using latent categorization to identify intellectual communities in information systems , 2008, Decis. Support Syst..

[81]  A. Strauss,et al.  The discovery of grounded theory: strategies for qualitative research aldine de gruyter , 1968 .

[82]  Kalle Lyytinen,et al.  Nothing At The Center?: Academic Legitimacy in the Information Systems Field , 2004, J. Assoc. Inf. Syst..

[83]  Mark Keil,et al.  Theorizing in information systems research: A reflexive analysis of the adaptation of theory in information systems research , 2006, J. Assoc. Inf. Syst..

[84]  Ulrich Frank,et al.  Towards a pluralistic conception of research methods in information systems research , 2006 .

[85]  H. Klein,et al.  Information systems research: contemporary approaches and emergent traditions , 1991 .

[86]  Jonathan Grudin,et al.  Enterprise Knowledge Management and Emerging Technologies , 2006, Proceedings of the 39th Annual Hawaii International Conference on System Sciences (HICSS'06).

[87]  B. Tversky,et al.  Categories of environmental scenes , 1983, Cognitive Psychology.

[88]  Richard Baskerville,et al.  Diversity in information systems action research methods , 1998 .

[89]  Joyce J. Elam,et al.  The Nature of DSS literature Presented in major is Conference Proceedings (1980-1985) , 1986, ICIS.

[90]  N. Mohaghegh,et al.  WHY THE IMPACT FACTOR OF JOURNALS SHOULD NOT BE USED FOR EVALUATING RESEARCH , 2005 .

[91]  M. Toleman,et al.  The long march: a novice researcher's journey of discovery through the research methodological and philosophical maze and haze , 2006 .

[92]  Gordon B. Davis,et al.  A Framework for Research in Computer-Based Management Information Systems , 1980 .

[93]  E. Trauth Qualitative Research in IS: Issues and Trends , 2001 .

[94]  Moonja P. Kim,et al.  The Method of Sorting as a Data-Gathering Procedure in Multivariate Research. , 1975, Multivariate behavioral research.

[95]  Simon Linacre,et al.  Producing Spaces for Academic Discourse: The Impact of Research Assessment Exercises and Journal Quality Rankings , 2010 .

[96]  Paula M. C. Swatman,et al.  Information Systems Research Methods: The Technology Transfer Problem 1 , 1994 .

[97]  Henry Evelyn Bliss,et al.  The Organization of Knowledge in Libraries and the Subject-Approach to Books , 1933 .

[98]  Ann Majchrzak,et al.  Perceived Individual Collaboration Know-How Development Through Information Technology-Enabled Contextualization: Evidence from Distributed Teams , 2005, Inf. Syst. Res..

[99]  Michael Gorman,et al.  Anglo-American Cataloguing Rules , 1967 .

[100]  William M. K. Trochim,et al.  An introduction to concept mapping for planning and evaluation. , 1989 .

[101]  Peter Checkland,et al.  Systems Thinking, Systems Practice , 1981 .

[102]  Ron Weber,et al.  On the ontological expressiveness of information systems analysis and design grammars , 1993, Inf. Syst. J..

[103]  Ian C. MacMillan,et al.  The emerging forum for business policy scholars , 1991 .

[104]  Ernest Sosa,et al.  A Companion to Metaphysics , 1995 .

[105]  Shirley Gregor,et al.  A Theory of Theories in Information Systems , 2002 .

[106]  K. A. Ericsson,et al.  Verbal reports as data. , 1980 .

[107]  A. C. Foskett,et al.  The subject approach to information , 1969 .

[108]  George M. Giaglis,et al.  A research framework for analysing eBusiness models , 2004, Eur. J. Inf. Syst..

[109]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[110]  Pairin Katerattanakul,et al.  IS Journal Rankings Versus Citation Analysis: Consistency and Concerns , 2003, AMCIS.

[111]  M. Boisot Information space : a framework for learning in organizations, institutions and culture , 2013 .

[112]  Nigel P. Melville,et al.  Theories Used in Information Systems Research: Identifying Theory Networks in Leading IS Journals , 2009, ICIS.

[113]  K. Peffers,et al.  Identifying and Evaluating the Universe of Outlets for Information Systems Research: Ranking the Journals , 2003 .

[114]  Bg Glaser,et al.  The grounded theory perspective Theoretical coding. , 2005 .

[115]  Ron Weber,et al.  On the deep structure of information systems , 1995, Inf. Syst. J..

[116]  Shirley Gregor,et al.  The struggle towards an understanding of theory in information systems , 2005 .

[117]  Venkataraman Ramesh,et al.  Research in Information Systems: An Empirical Study of Diversity in the Discipline and Its Journals , 2002, J. Manag. Inf. Syst..

[118]  Anne Lazaraton,et al.  Quantitative Research Methods , 2005 .

[119]  Nick F. Pidgeon,et al.  The Use of Grounded Theory for Conceptual Analysis in Knowledge Elicitation , 1991, Int. J. Man Mach. Stud..

[120]  A. Lin Knowledge Assets: Securing Competitive Advantage in the Information Economy , 2001 .

[121]  Zdeněk Salzmann,et al.  Language, Culture, and Society: An Introduction to Linguistic Anthropology , 1993 .

[122]  R. Steiner,et al.  The Three Worlds , 2011 .

[123]  Jonny Holmström Theorizing in IS Research: What Came Before and What Comes Next? , 2005, Scand. J. Inf. Syst..

[124]  W. Quine On What There Is , 1948 .

[125]  Brian Fitzgerald,et al.  A systemic framework for the field of information systems , 2001, DATB.

[126]  Weixiong Zhang Search techniques , 2002 .

[127]  Suzanne Rivard,et al.  A Keyword Classification Scheme for IS Research Literature: An Update , 1993 .

[128]  Maarten van Someren,et al.  The Think Aloud Method: A Practical Guide to Modelling Cognitive Processes , 1994 .

[129]  Clare Beghtol Domain analysis, literary warrant, and consensus: the case of fiction studies , 1995 .

[130]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[131]  C. Urquhart An encounter with grounded theory: tackling the practical and philosophical issues , 2001 .

[132]  Björn Niehaves,et al.  On Episemological Diversity in Design Science: New Vistas for a Design-Oriented IS Research? , 2007, ICIS.

[133]  M. Boisot Exploring the information space: a strategic perspective on information systems , 2004 .

[134]  J. Daniel Couger,et al.  IS '97: model curriculum and guidelines for undergraduate degree programs in information systems , 1996, IS '97.

[135]  Shirley Gregor,et al.  The Nature of Theory in Information Systems , 2006, MIS Q..

[136]  Barry Smith,et al.  The Basic Tools of Formal Ontology , 1998 .

[137]  Michael Baur,et al.  Blackwell Companions to Philosophy , 2011 .

[138]  Henny P. A. Boshuizen,et al.  Expertise-related differences in conceptual and ontological knowledge in the legal domain , 2008 .

[139]  Ron Weber,et al.  An Ontological Model of an Information System , 1990, IEEE Trans. Software Eng..

[140]  Bernard J. Jaworski,et al.  E-Commerce , 2021, Strategic International Restaurant Development.

[141]  Walter Daniel Fernandez,et al.  Metateams in major information technology projects : a grounded theory on conflict, trust, communication, and cost , 2003 .

[142]  Frances G. Livingston Dewey Decimal Classification and Relative Index. , 1966 .

[143]  Detmar W. Straub,et al.  Normative standards for IS research , 1994, DATB.

[144]  Alan L. Rector,et al.  Web ontology segmentation: analysis, classification and use , 2006, WWW '06.

[145]  Deborah Bunker,et al.  Australian Eclecticism and Theorezing in Information Systems Research , 2007, Scand. J. Inf. Syst..

[146]  Gordon B. Davis,et al.  Model Curriculum and Guidelines for Undergraduate Degree Programs in Information Systems , 1997 .

[147]  Björn Lundell,et al.  2G.">On the adaptation of Grounded Theory procedures: insights from the evolution of the 2G , 2005, Inf. Technol. People.

[148]  J. H. Muirhead,et al.  A DEFENCE OF COMMON SENSE , 2004 .

[149]  I. Nonaka,et al.  The Knowledge Creating Company , 2008 .

[150]  Robert D. Galliers,et al.  Trans-disciplinary research in information systems , 2004, Int. J. Inf. Manag..

[151]  Soongoo Hong,et al.  Objective quality ranking of computing journals , 2003, CACM.

[152]  Ephraim R. McLean,et al.  The DeLone and McLean Model of Information Systems Success: A Ten-Year Update , 2003, J. Manag. Inf. Syst..

[153]  Fatemeh Zahedi,et al.  The Analytic Hierarchy Process—A Survey of the Method and its Applications , 1986 .

[154]  Richard Baskerville,et al.  Information Systems as a Reference Discipline , 2002, MIS Q..

[155]  Simon K. Milton,et al.  An Exploratory Study of Information Systems Subject Indexing , 2003 .

[156]  Walter Fernandez,et al.  The Grounded Theory Method and Case Study Data in IS Research: Issues and Design , 2005 .

[157]  Andrew B. Whinston,et al.  Operationalizing the Essential Role of the Information Technology Artifact in Information Systems Research: Gray Area, Pitfalls, and the Importance of Strategic Ambiguity , 2004, MIS Q..

[158]  Mike Metcalfe,et al.  Theory: Seeking a Plain English Explanation , 2004 .

[159]  Suzanne Rivard,et al.  An Information Systems Keyword Classification Scheme , 1988, MIS Q..

[160]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[161]  Anthony R. Hendrickson,et al.  Research Commentary. Academic Rewards for Teaching, Research, and Service: Data and Discourse , 1999, Inf. Syst. Res..

[162]  Ruth French Strout The Development of the Catalog and Cataloging Codes , 1956, The Library Quarterly.