Measuring Algebraic Complexity of Text Understanding Based on Human Concept Learning

This paper advocates for a novel approach to recommend texts at various levels of difficulties based on a proposed method, the algebraic complexity of texts (ACT). Different from traditional complexity measures that mainly focus on surface features like the numbers of syllables per word, characters per word, or words per sentence, ACT draws from the perspective of human concept learning, which can reflect the complex semantic relations inside texts. To cope with the high cost of measuring ACT, the Degree-2 Hypothesis of ACT is proposed to reduce the measurement from unrestricted dimensions to three dimensions. Based on the principle of “mental anchor,” an extension of ACT and its general edition [denoted as extension of text algebraic complexity (EACT) and general extension of text algebraic complexity (GEACT)] are developed, which take keywords' and association rules' complexities into account. Finally, using the scores given by humans as a benchmark, we compare our proposed methods with linguistic models. The experimental results show the order GEACT>EACT>ACT> Linguistic models, which means GEACT performs the best, while linguistic models perform the worst. Additionally, GEACT with lower convex functions has the best ability in measuring the algebraic complexities of text understanding. It may also indicate that the human complexity curve tends to be a curve like lower convex function rather than linear functions.

[1]  Jia Wang,et al.  Recommendation in Internet Forums and Blogs , 2010, ACL.

[2]  Rynson W. H. Lau,et al.  Guest Editors' Introduction: Emerging Internet Technologies for E-Learning , 2009, IEEE Internet Computing.

[3]  Peter Dolog,et al.  An environment for flexible advanced compensations of Web service transactions , 2008, TWEB.

[4]  Setsuo Tsuruta,et al.  Toward Reducing Human Involvement in Validation of Knowledge-Based Systems , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[5]  D. McNamara,et al.  Assessing Text Readability Using Cognitively Based Indices , 2008 .

[6]  R. P. Fishburne,et al.  Derivation of New Readability Formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy Enlisted Personnel , 1975 .

[7]  G. Harry McLaughlin,et al.  SMOG Grading - A New Readability Formula. , 1969 .

[8]  Xiangfeng Luo,et al.  Evaluating the capability of text understanding based on cognitive principle , 2008, 2008 International Conference on Audio, Language and Image Processing.

[9]  E. Rosch,et al.  Family resemblances: Studies in the internal structure of categories , 1975, Cognitive Psychology.

[10]  J. Feldman A catalog of Boolean concepts , 2003 .

[11]  R. Gunning The Technique of Clear Writing. , 1968 .

[12]  Yingxu Wang,et al.  Special Issue on Cybernetics and Cognitive Informatics , 2009, IEEE Trans. Syst. Man Cybern. Part B.

[13]  J. Feldman An algebra of human concept learning , 2006 .

[14]  Ryszard S. Michalski,et al.  Constraints and Preferences in Inductive Learning: An Experimental Study of Human and Machine Performance , 1987 .

[15]  Yingxu Wang,et al.  Contemporary Cybernetics and Its Facets of Cognitive Informatics and Computational Intelligence , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Yingxu Wang,et al.  A new measure of software complexity based on cognitive weights , 2003 .

[17]  K. Galotti Cognitive psychology in and out of the laboratory , 1993 .

[18]  M. Coleman,et al.  A computer readability formula designed for machine scoring. , 1975 .

[19]  Matteo Gaeta,et al.  Ontology Extraction for Knowledge Reuse: The e-Learning Perspective , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[20]  Qing Li,et al.  Research of E-Learning Intelligent Affective Model Based on BDI Agent with Learning Materials , 2011, CSISE.

[21]  Chen Zhang,et al.  News Recommendation in Forum-Based Social Media , 2010, AAAI.

[22]  Arthur C. Graesser,et al.  Coh-Metrix: Analysis of text on cohesion and language , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[23]  Xue Chen,et al.  Building Association Link Network for Semantic Link on Web Resources , 2011, IEEE Transactions on Automation Science and Engineering.

[24]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[25]  Xiangfeng Luo,et al.  Evaluating text understanding based on context , 2008, 2008 7th IEEE International Conference on Cognitive Informatics.

[26]  Gang Chen,et al.  Continuous nearest-neighbor search in the presence of obstacles , 2011, TODS.

[27]  Ani Nenkova,et al.  Revisiting Readability: A Unified Framework for Predicting Text Quality , 2008, EMNLP.

[28]  Ryszard S. Michalski,et al.  Constraints and Preferences in Inductive Learning: An Experimental Study of Human and Machine Performance , 1987, Cogn. Sci..

[29]  F. Lamberti,et al.  A Relation-Based Page Rank Algorithm for Semantic Web Search Engines , 2009, IEEE Transactions on Knowledge and Data Engineering.

[30]  Ingo Wegener,et al.  The complexity of Boolean functions , 1987 .

[31]  Jun Zhang,et al.  Measurement of textual complexity based on categorical invariance in human concept learning , 2011, IEEE 10th International Conference on Cognitive Informatics and Cognitive Computing (ICCI-CC'11).

[32]  Christos D. Zaroliagis,et al.  A Collaborative Decentralized Approach to Web Search , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[33]  Jun Zhang,et al.  Power Series Representation Model of Text Knowledge Based on Human Concept Learning , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[34]  E. Leung,et al.  Emerging Internet Technologies for E-Learning , 2009 .

[35]  Xiangfeng Luo,et al.  Measuring Textual Context Based on Cognitive Principles , 2009, Int. J. Softw. Sci. Comput. Intell..

[36]  Rynson W. H. Lau,et al.  An Adaptive Multiresolution Method for Progressive Model Transmission , 2001, Presence: Teleoperators & Virtual Environments.

[37]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[38]  Chi-Yin Chow,et al.  Efficient Evaluation of k-NN Queries Using Spatial Mashups , 2011, SSTD.

[39]  R. Flesch A new readability yardstick. , 1948, The Journal of applied psychology.

[40]  Jacob Feldman,et al.  Minimization of Boolean complexity in human concept learning , 2000, Nature.

[41]  Donald D. Givone,et al.  Introduction to switching circuit theory , 1970 .

[42]  Luo Si,et al.  A statistical model for scientific readability , 2001, CIKM '01.

[43]  Susan R. Goldman,et al.  Memory for embedded and sequential story structures , 1986 .

[44]  Wolfgang Nejdl,et al.  A hybrid approach for efficient Web service composition with end-to-end QoS constraints , 2012, TWEB.