Text Mining for Analysis of Interviews and Questionnaires

Interviews and questionnaires are the basis for collecting information about the opinions, concerns and needs of people. Analysis of those texts is crucial to understand the kansei of people. Text mining is an approach to discover useful and interesting patterns, knowledge and information from texts. This chapter contains two sections on text mining for beginners of it. The first section gives a brief survey of basic text mining techniques, such as keyword extraction, word graphs, clustering of texts and association rule mining. The second section demonstrates an example of text mining applied to interview analysis. Two text mining systems the concept graph system and the matrix search system are applied to analyze 2,409 remarks about products and services from 19 people. The analysis shows that text mining systems with a search function achieve interactive analysis of texts and an examination of various problems that we targeted. DOI: 10.4018/978-1-4666-2455-9.ch072

[1]  Xindong Wu,et al.  Keyword Extraction Based on Lexical Chains and Word Co-occurrence for Chinese News Web Pages , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[2]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[3]  M. Klemettinen,et al.  Applying Data Mining Techniques in Text Analysis , 1997 .

[4]  Erik G. Larsson,et al.  Optimizing Fault Tolerance for Multi-Processor System-on-Chip , 2010 .

[5]  Romaric Besançon,et al.  Text Mining, knowledge extraction from unstructured textual data , 1998 .

[6]  P. Sneath The application of computers to taxonomy. , 1957, Journal of general microbiology.

[7]  Yukio Ohsawa,et al.  KeyGraph: automatic indexing by co-occurrence graph based on building construction metaphor , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[8]  Robert R. Sokal,et al.  A statistical method for evaluating systematic relationships , 1958 .

[9]  Loïc Martínez,et al.  Blind User Interfacing: Requirements, Models and a Framework , 2012 .

[10]  Joanna Leng,et al.  Handbook of Research on Computational Science and Engineering: Theory and Practice , 2011 .

[11]  Chris D. Paice,et al.  Another stemmer , 1990, SIGF.

[12]  Makoto Iwayama,et al.  Topic Graph Generation for Query Navigation: Use of Frequency Classes for Topic Extraction , 1997, ArXiv.

[13]  Marcos Ruano-Mayoral,et al.  Partner Relationship Management: Semantic Extension of CRM Systems for the Partner Searching and Management in R&D Environments , 2012 .

[14]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[15]  Yukio Ohsawa,et al.  Modeling the Process of Chance Discovery by Chance Discovery on Double Helix , 2002 .

[16]  Hamid R. Nemati,et al.  Organizational Data Mining: Leveraging Enterprise Data Resources for Optimal Performance , 2003 .

[17]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[18]  Masao Utiyama,et al.  Selecting level-specific specialized vocabulary using statistical measures , 2006 .

[19]  Raymond J. Mooney,et al.  Mining Soft-Matching Rules from Textual Data , 2001, IJCAI.

[20]  Paul D. McNicholas,et al.  Standardising the lift of an association rule , 2008, Comput. Stat. Data Anal..

[21]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[22]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[23]  Yehuda Lindell,et al.  Text Mining at the Term Level , 1998, PKDD.

[24]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[25]  G. Andersson,et al.  Exploration of coping patterns in burned adults: cluster analysis of the coping with burns questionnaire (CBQ). , 2002, Burns : journal of the International Society for Burn Injuries.

[26]  Mitsuru Ishizuka,et al.  Keyword extraction from a single document using word co-occurrence statistical information , 2004, Int. J. Artif. Intell. Tools.

[27]  Heinrich Theodor Vierhaus,et al.  Design and Test Technology for Dependable Systems-on-Chip , 2010 .

[28]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[29]  Norihiro Hagita,et al.  Recognition of Degraded Machine-Printed Characters Using a Complementary Similarity Measure and Error-Correction Learning (Special Issue on Character Recognition and Document Understanding) , 1996 .

[30]  Sachio Hirokawa,et al.  Multiple Analysis of Remarks of Elderly and Disabled People by Text Mining , 2007 .

[31]  Sachio Hirokawa,et al.  Multiple Viewed Search Engine for an e-Journal - A Case Study on Zoological Science , 2007, HCI.

[32]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[33]  Karen Sparck Jones A statistical interpretation of term specificity and its application in retrieval , 1972 .

[34]  Toru Hisamitsu,et al.  Topic-Word Selection Based on Combinatorial Probability , 2001, NLPRS.

[35]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[36]  Sachio Hirokawa,et al.  Dynamic Thesaurus Construction from English-Japanese Dictionary , 2008, 2008 International Conference on Complex, Intelligent and Software Intensive Systems.

[37]  Hércules Antonio do Prado,et al.  Emerging Technologies of Text Mining: Techniques and Applications , 2007 .

[38]  Sharma Chakravarthy,et al.  A Single Pass Algorithm for Discovering Significant Intervals in Time-Series Data , 2007, Int. J. Data Warehous. Min..