Ontology-Based Model and Procedure Creation for Topic Analysis in Chinese Language

This paper focuses on setting up a methodology to create models and procedures for the functions and processes involved in topic analysis, especially for the business documentation in the Chinese language. Ontologies of different types are established and maintained containing the annotated and evolved knowledge. Extraction, transforming, and loading methods are adapted from approaches which are used to set up a data warehouse for standardized data models, exploited as the basis of a large variety of analysis. Topic discovery is conducted based on Latent Dirichlet Allocation for different usage. An interactive tool is implemented to support the proposed design and realistic demands.

[1]  A. Strauss,et al.  The discovery of grounded theory: strategies for qualitative research aldine de gruyter , 1968 .

[2]  Dong Han,et al.  Ontology Based Qualitative Methodology for Chinese Language Analysis , 2012, 2012 26th International Conference on Advanced Information Networking and Applications Workshops.

[3]  Dong Han,et al.  An Interactive Working Tool for Qualitative Text Analysis , 2012, EGC.

[4]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[5]  Kevin Gimpel,et al.  Modeling Topics , 2006 .

[6]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.