Automating hierarchical document classification for construction management information systems

The widespread use of information technologies for construction is considerably increasing the number of electronic text documents stored in construction management information systems. Consequently, automated methods for organizing and improving the access to the information contained in these types of documents become essential to construction information management. This paper describes a methodology developed to improve information organization and access in construction management information systems based on automatic hierarchical classification of construction project documents according to project components. A prototype system for document classification is presented, as well as the experiments conducted to verify the feasibility of the proposed approach.

[1]  Renate Fruchter,et al.  A/E/C Teamwork: A Collaborative Design and Learning Space , 1999 .

[2]  Chimay J. Anumba,et al.  A Taxonomy for Communication Facets in Concurrent Life‐Cycle Design and Construction , 1999 .

[3]  Raimar J. Scherer,et al.  Retrieval of Project Knowledge from Heterogeneous AEC Documents , 2000 .

[4]  Thomas Froese,et al.  INTEGRATING HETEROGENEOUS DATA REPRESENTATIONS IN MODEL-BASED AEC/FM SYSTEMS , 2000 .

[5]  Lucio Soibelman,et al.  Generating Construction Knowledge with Knowledge Discovery in Databases , 2000 .

[6]  Eric Brill,et al.  Text Classification in USENET Newsgroups: A Progress Report , 1996 .

[7]  Eddy M. Rojas,et al.  WEB-CENTRIC SYSTEMS: A NEW PARADIGM FOR COLLABORATIVE ENGINEERING , 1999 .

[8]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[9]  Lachmi Khemlani Building Product Models: Computer Environments Supporting Design and Construction , 2002 .

[10]  Yacine Rezgui,et al.  An information management model for concurrent construction engineering , 1996 .

[11]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[12]  Jiawei Han,et al.  AUTOMATED CLASSIFICATION OF CONSTRUCTION PROJECT DOCUMENTS , 2002 .

[13]  Yacine Rezgui,et al.  Open System for Inter-enterprise Information Management in Dynamic Virtual Environments , 1999 .

[14]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[15]  Boyd C. Paulson,et al.  Adaptability of information classification systems for civil works , 1997 .

[16]  Maria C. Yang,et al.  Data Mining for Thesaurus Generation in Informal Design Information Retrieval , 1998 .

[17]  Mary Lou Maher,et al.  Ontology-Based Multimedia Data Mining for Design Information Retrieval , 1998 .

[18]  Yimin Zhu,et al.  Web-Based Construction Document Processing via Malleable Frame , 2001 .

[19]  William J. O'Brien IMPLEMENTATION ISSUES IN PROJECT WEB SITES: A PRACTIONER'S VIEWPOINT , 2000 .

[20]  A Zarli,et al.  A survey of internet-oriented technologies for document-driven applications in construction open dynamic virtual environments , 2000 .

[21]  Martin Fischer,et al.  The Circle: Architecture for Integrating Software , 1995 .

[22]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[23]  J FenvesS,et al.  A WWW-based regulation broker , 1996 .

[24]  Lucio Soibelman,et al.  DISTRIBUTED MULTI-REASONING MECHANISM TO SUPPORT CONCEPTUAL STRUCTURAL DESIGN , 2000 .

[25]  William H. Wood The Development of Modes in Textual Design Data , 2000 .

[26]  Boyd C. Paulson,et al.  INFORMATION CLASSIFICATION FOR CIVIL ENGINEERING PROJECTS BY UNICLASS , 2000 .

[27]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.