Editorial: Evaluating knowledge engineering techniques

We argue below that knowledge engineers (KEs) use a range of techniques ( ). Yet more techniques are constantly being evolved. How sure are we that any of the current KE techniques either work at all or work better than past practice? Currently, our basic question has no clear answer. It is not easy to evaluate software development techniques, be they from software engineering or knowledge engineering (see the list of problems in [Shadbolt, O’Hara & Crow 2000]). However, many successful evaluations have been performed, as evidenced by numerous KE case studies studies (Figure 1) and the wealth of material on KE evaluation (Figure 2). In our experience, the core problem with performing evaluations is a concern that the whole principle of evaluation is misguided. KE researchers are often reluctant to design and perform rigorous evaluations, asking “what is the general point of such studies”? That is, how can results from one study be relevant to anything else Detailed surveys of evaluation techniques: [Fenton 1991, Cohen 1995].

[1]  Tim Menzies,et al.  Evaluating a Qualitative Reasoner , 1997, Australian Joint Conference on Artificial Intelligence.

[2]  Bob J. Wielinga,et al.  Using explicit ontologies in KBS development , 1997, Int. J. Hum. Comput. Stud..

[3]  Hector J. Levesque,et al.  An Essential Hybrid Reasoning System: Knowledge and Symbol Level Accounts of KRYPTON , 1985, IJCAI.

[4]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[5]  Robert L. Glass,et al.  Science and substance: a challenge to software engineers , 1994, IEEE Software.

[6]  Nigel Shadbolt,et al.  Models Exposed: An Empirical Study , 1995 .

[7]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[8]  Casimir A. Kulikowski,et al.  A Model-Based Method for Computer-Aided Medical Decision-Making , 1978, Artif. Intell..

[9]  John McDermott,et al.  VT: an expert elevator designer that uses knowledge-based backtracking , 1992 .

[10]  Eleni Stroulia,et al.  Evaluating PSMs in evolutionary design: the A UTOGNOSTIC experiments , 1999, Int. J. Hum. Comput. Stud..

[11]  Penelope M Sanderson,et al.  State-space and verbal protocol methods for studying the human operator in process control. , 1989, Ergonomics.

[12]  Gregg R. Yost Implementing the Sisyphus-93 task using Soar/TAQL , 1996, Int. J. Hum. Comput. Stud..

[13]  Ralph Johnson,et al.  design patterns elements of reusable object oriented software , 2019 .

[14]  John P. McDermott,et al.  Easy programming: empowering people to build their own applications , 1992, IEEE Expert.

[15]  Tim Menzies On the Practicality of Abductive Validation , 1996, ECAI.

[16]  Ramanathan V. Guha,et al.  CYC: A Midterm Report , 1990, AI Mag..

[17]  Luc Steels,et al.  Components of Expertise , 1990, AI Mag..

[18]  Yolanda Gil,et al.  Explicit Representations of Problem-Solving Strategies to Support Knowledge Acquisition , 1996, AAAI/IAAI, Vol. 1.

[19]  William J. Clancey,et al.  Model Construction Operators , 1992, Artif. Intell..

[20]  Tim Menzies,et al.  Critical success metrics: evaluation at the business level , 1999, Int. J. Hum. Comput. Stud..

[21]  David Wright,et al.  Some Conservative Stopping Rules for the Operational Testing of Safety-Critical Software , 1997, IEEE Trans. Software Eng..

[22]  James M. Crawford,et al.  Experimental Results on the Application of Satisfiability Algorithms to Scheduling Problems , 1994, AAAI.

[23]  John P. McDermott,et al.  SALT: A Knowledge Acquisition Language for Propose-and-Revise Systems , 1993, Artif. Intell..

[24]  Yuval Shahar,et al.  Task Modeling with Reusable Problem-Solving Methods , 1995, Artif. Intell..

[25]  Robert A. Kowalski,et al.  The early years of logic programming , 1988, CACM.

[26]  Michael Uschold,et al.  Ontologies: principles, methods and applications , 1996, The Knowledge Engineering Review.

[27]  A. T. Schreiber,et al.  The Sisyphus-VT initiative , 1996 .

[28]  Paul Compton,et al.  A 2000 Rule Expert System Without Knowledge Engineers , 1993 .

[29]  Victor R. Basili,et al.  Experimental Software Engineering Issues: Critical Assessment and Future Directions , 1993, Lecture Notes in Computer Science.

[30]  Nils J. Nilsson,et al.  Logic and Artificial Intelligence , 1991, Artif. Intell..

[31]  Paul R. Cohen,et al.  A Survey of the Eighth National Conference on Artificial Intelligence: Pulling Together or Pulling Apart? , 1991, AI Mag..

[32]  B. Chandrasekaran,et al.  Towards a Taxonomy of Problem Solving Types , 1983, AI Mag..

[33]  Allen Newell,et al.  A Problem Space Approach to Expert System Specification , 1989, IJCAI.

[34]  William J. Clancey,et al.  Heuristic Classification , 1986, Artif. Intell..

[35]  Todd R. Johnson,et al.  Task-structure analysis for knowledge modeling , 1992, CACM.

[36]  Marc Linster,et al.  A Review of Sisyphus 91 & 92: Models of Problem-Solving Knowledge , 1993, EKAW.

[37]  Timothy W. Finin,et al.  Enabling Technology for Knowledge Sharing , 1991, AI Mag..

[38]  Nigel Shadbolt,et al.  The experimental evaluation of knowledge acquisition techniques and methods: history, problems and new directions , 1999, Int. J. Hum. Comput. Stud..

[39]  P. Compton,et al.  A philosophical basis for knowledge acquisition , 1990 .

[40]  Carole D. Hafner,et al.  The State of the Art in Ontology Design: A Survey and Comparative Review , 1997, AI Mag..

[41]  Victor R. Basili,et al.  The Experimental Paradigm in Software Engineering , 1992, Experimental Software Engineering Issues.

[42]  W. Chase,et al.  Visual information processing. , 1974 .

[43]  V. R. Benjamins,et al.  Problem-Solving Methods for Diagnosis and their Role in Knowledge Acquisition , 1996 .

[44]  Allen Newell,et al.  A Universal Weak Method: Summary of Results , 1983, IJCAI.

[45]  R. A. Brooks,et al.  Intelligence without Representation , 1991, Artif. Intell..

[46]  Alistair Sutcliffe,et al.  Domain Knowledge for Interactive System Design , 1996, IFIP Advances in Information and Communication Technology.

[47]  Bob J. Wielinga,et al.  CommonKADS: a comprehensive methodology for KBS development , 1994, IEEE Expert.

[48]  John Mylopoulos,et al.  Readings in Artificial Intelligence and Databases , 1988 .

[49]  Bob J. Wielinga,et al.  KADS: a modelling approach to knowledge engineering , 1992 .

[50]  Lawrence M. Fagan,et al.  Antimicrobial selection by a computer. A blinded evaluation by infectious diseases experts. , 1979, JAMA.

[51]  Intelligent Testing can be Very Lazy , 2000 .

[52]  Marvin Minsky,et al.  A framework for representing knowledge" in the psychology of computer vision , 1975 .

[53]  Caroline C. Hayes,et al.  QUEM: An Achievement Test for Knowledge-Based Systems , 1997, IEEE Trans. Knowl. Data Eng..

[54]  Charles L. Forgy,et al.  Rete: a fast algorithm for the many pattern/many object pattern match problem , 1991 .

[55]  Juan Pedro Caraça-Valente,et al.  Knowledge-based systems' validation: when to stop running test cases , 1999, Int. J. Hum. Comput. Stud..

[56]  Raymond Reiter,et al.  On Inheritance Hierarchies With Exceptions , 1983, AAAI.

[57]  Kunkel Jm,et al.  Spontaneous subclavain vein thrombosis: a successful combined approach of local thrombolytic therapy followed by first rib resection. , 1989 .

[58]  Gregg R. Yost TAQL: a problem space tool for expert system development , 1992 .

[59]  Kim J. Vicente,et al.  Supporting operator problem solving through ecological interface design , 1995, IEEE Trans. Syst. Man Cybern..

[60]  Shari Lawrence Pfleeger,et al.  Software Metrics , 1991 .

[61]  Yoram Reich,et al.  Measuring the value of knowledge , 1995, Int. J. Hum. Comput. Stud..

[62]  Yolanda Gil,et al.  Flexible Knowledge Acquisition Through Explicit Representation of Knowledge Roles , 1996 .

[63]  Masahiro Hori,et al.  Empirical evaluation of a domain-oriented component library based on an embedded case study design , 1999, Int. J. Hum. Comput. Stud..

[64]  Tim Menzies \{O\}\{O\} Patterns: Lessons from Expert Systems , 1997 .

[65]  Tim Menzies Evaluation Issues for Problem Solving Methods , 1998 .

[66]  Robert M. O'Keefe,et al.  The Effect of Knowledge Representation Schemes on Maintainability of Knowledge-Based Systems , 1996, IEEE Trans. Knowl. Data Eng..

[67]  Philip Klahr,et al.  Evaluation of expert systems: issues and case studies , 1983 .

[68]  Paul R. Cohen,et al.  The DARPA High-Performance Knowledge Bases Project , 1998, AI Mag..

[69]  Gary S. Kahn,et al.  Strategies for Knowledge Acquisition , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[70]  Paul R. Cohen,et al.  Does Prior Knowledge Facilitate the Development of Knowledge-based Systems? , 1999, AAAI/IAAI.

[71]  S.J.J. Smith,et al.  Empirical Methods for Artificial Intelligence , 1995 .

[72]  Terry Winograd,et al.  FRAME REPRESENTATIONS AND THE DECLARATIVE/PROCEDURAL CONTROVERSY , 1975 .

[73]  Frederick Hayes-Roth,et al.  Building expert systems , 1983, Advanced book program.

[74]  Lawrence Birnbaum,et al.  Rigor Mortis: A Response to Nilsson's "Logic and Artificial Intelligence" , 1991, Artif. Intell..

[75]  Brian R. Gaines,et al.  Knowledge acquisition for knowledge-based systems , 1991, IEEE Expert.

[76]  Tim Menzies Object-oriented patterns: lessons from expert systems , 1997 .

[77]  Hector J. Levesque,et al.  A New Method for Solving Hard Satisfiability Problems , 1992, AAAI.

[78]  Gheorghe Tecuci Building Intelligent Agents: An Apprenticeship, Multistrategy Learning Theory, Methodology, Tool and Case Studies , 1998 .

[79]  Tim Menzies,et al.  hQkb- The High Quality Knowledge Base Initiative (Sisyphus V: Learning Design Assessment Knowledge) , 1999 .