An Ontology-Based Text-Mining Method to Cluster Proposals for Research Project Selection

Research project selection is an important task for government and private research funding agencies. When a large number of research proposals are received, it is common to group them according to their similarities in research disciplines. The grouped proposals are then assigned to the appropriate experts for peer review. Current methods for grouping proposals are based on manual matching of similar research discipline areas and/or keywords. However, the exact research discipline areas of the proposals cannot often be accurately designated by the applicants due to their subjective views and possible misinterpretations. Therefore, rich information in the proposals' full text can be used effectively. Text-mining methods have been proposed to solve the problem by automatically classifying text documents, mainly in English. However, these methods have limitations when dealing with non-English language texts, e.g., Chinese research proposals. This paper presents a novel ontology-based text-mining approach to cluster research proposals based on their similarities in research areas. The method is efficient and effective for clustering research proposals with both English and Chinese texts. The method also includes an optimization model that considers applicants' characteristics for balancing proposals by geographical regions. The proposed method is tested and validated based on the selection process at the National Natural Science Foundation of China. The results can also be used to improve the efficiency and effectiveness of research project selection processes in other government and private research funding agencies.

[1]  Dieter Fensel,et al.  Ontologies: A silver bullet for knowledge management and electronic commerce , 2002 .

[2]  Liana Razmerita An Ontology-Based Framework for Modeling User Behavior—A Case Study in Knowledge Management , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[3]  Jian Ma,et al.  A multilingual ontology framework for R&D project management systems , 2010, Expert Syst. Appl..

[4]  Qiong Zhang,et al.  The Smart Architect: Scalable Ontology-Based Modeling of Ancient Chinese Architectures , 2008, IEEE Intelligent Systems.

[5]  Hsin-Chang Yang,et al.  A text mining approach for automatic construction of hypertexts , 2005, Expert Syst. Appl..

[6]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[7]  Jun Wang,et al.  A Hybrid Knowledge and Model Approach for Reviewer Assignment , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[8]  Douglas J. Morrice,et al.  A Multiple Attribute Utility Theory Approach to Ranking and Selection , 2001, Manag. Sci..

[9]  Weiguo Fan,et al.  An integrated two-stage model for intelligent information routing , 2006, Decis. Support Syst..

[10]  Amy J. C. Trappey,et al.  A Fuzzy Ontological Knowledge Document Clustering Methodology , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  Prabir Bhattacharya,et al.  A fuzzy-logic-based approach to project selection , 2000, IEEE Trans. Engineering Management.

[12]  Bin Zhu,et al.  Newsmap: a knowledge map for online news , 2005, Decis. Support Syst..

[13]  Jian Ma,et al.  Decision support for proposal grouping: A hybrid approach using knowledge rule and genetic algorithm , 2009, Expert Syst. Appl..

[14]  Norman P. Archer,et al.  Project portfolio selection through decision support , 2000, Decis. Support Syst..

[15]  John W. Fowler,et al.  A hybrid approach using the analytic hierarchy process and integer programming to screen weapon systems projects , 2003, IEEE Trans. Engineering Management.

[16]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[17]  Christoph H. Loch,et al.  Dynamic Portfolio Selection of NPD Programs Using Marginal Returns , 2002, Manag. Sci..

[18]  Yongtae Park,et al.  R&D proposal screening system based on text-mining approach , 2006 .

[19]  Boaz Golany,et al.  Optimal Allocation of Proposals to Reviewers to Facilitate Effective Ranking , 2005, Manag. Sci..

[20]  K. Zhang,et al.  ManuHub: A Semantic Web System for Ontology-Based Service Management in Distributed Manufacturing Environments , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[21]  Hsin-Chang Yang,et al.  A method for multilingual text mining and retrieval using growing hierarchical self-organizing maps , 2009, J. Inf. Sci..

[22]  Xiaohua Hu,et al.  Exploiting the Social Tagging Network for Web Clustering , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[23]  Chih-Ping Wei,et al.  Discovering Event Evolution Patterns From Document Sequences , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[24]  Jun Wang,et al.  A Group Decision Support Approach to Evaluate Experts for R&D Project Selection , 2008, IEEE Transactions on Engineering Management.

[25]  Michael J. Pazzani,et al.  Mining for proposal reviewers: lessons learned at the national science foundation , 2006, KDD '06.

[26]  Huan-Chao Keh,et al.  The Chinese text categorization system with association rule and category priority , 2008, Expert Syst. Appl..

[27]  Zhengding Lu,et al.  A Chinese word segmentation based on language situation in processing ambiguous words , 2004, Inf. Sci..

[28]  Xindong Wu,et al.  Ontology-Based Business Process Customization for Composite Web Services , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[29]  Karl T. Ulrich,et al.  Valuing R&D Projects in a Portfolio: Evidence from the Pharmaceutical Industry , 2007, Manag. Sci..

[30]  Yong Liu,et al.  Modeling Complex Architectures Based on Granular Computing on Ontology , 2010, IEEE Transactions on Fuzzy Systems.

[31]  Adrien Presley,et al.  R&D project selection using the analytic network process , 2002, IEEE Trans. Engineering Management.

[32]  Chih-Ping Wei,et al.  A Clustering-Based Approach for Integrating Document-Category Hierarchies , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[33]  Narasimhaiah Gorla,et al.  Information system project selection using fuzzy logic , 1998, IEEE Trans. Syst. Man Cybern. Part A.

[34]  Jian Ma,et al.  A hybrid knowledge and model system for R&D project selection , 2002, Expert Syst. Appl..

[35]  Manu Konchady Text Mining Application Programming , 2006 .

[36]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[37]  Ronen Feldman,et al.  Book Reviews: The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data by Ronen Feldman and James Sanger , 2008, CL.

[38]  Xiaolong Wang,et al.  ConSOM: A conceptional self-organizing map model for text clustering , 2008, Neurocomputing.

[39]  Ron Chi-Wai Kwok,et al.  An organizational decision support system for effective R&D project selection , 2005, Decis. Support Syst..

[40]  Il Hong Suh,et al.  Ontology-Based Unified Robot Knowledge for Service Robots in Indoor Environments , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[41]  Maria Vargas-Vera,et al.  Multiagent Ontology Mapping Framework for the Semantic Web , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[42]  Anil Arya,et al.  Project Assignments When Budget Padding Taints Resource Allocation , 2006, Manag. Sci..

[43]  Nada Lavrac,et al.  An Ontology for Virtual Organization Breeding Environments , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[44]  A. D. Henriksen,et al.  A practical R&D project-selection scoring tool , 1999 .

[45]  Ivica Kostanic,et al.  Principles of Neurocomputing for Science and Engineering , 2000 .

[46]  Dongsong Zhang,et al.  An Ontology-Supported Misinformation Model: Toward a Digital Misinformation Library , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.