What Makes It Difficult to Understand a Scientific Literature?

In the artificial intelligence area, one of the ultimate goals is to make computers understand human language and offer assistance. In order to achieve this ideal, researchers of computer science have put forward a lot of models and algorithms attempting at enabling the machine to analyze and process human natural language on different levels of semantics. Although recent progress in this field offers much hope, we still have to ask whether current research can provide assistance that people really desire in reading and comprehension. To this end, we conducted a reading comprehension test on two scientific papers which are written in different styles. We use the semantic link models to analyze the understanding obstacles that people will face in the process of reading and figure out what makes it difficult for human to understand a scientific literature. Through such analysis, we summarized some characteristics and problems which are reflected by people with different levels of knowledge on the comprehension of difficult science and technology literature, which can be modelled in semantic link network. We believe that these characteristics and problems will help us re-examine the existing machine models and are helpful in the designing of new one.

[1]  Christopher Potts,et al.  Recursive Neural Networks Can Learn Logical Semantics , 2014, CVSC.

[2]  Hai Zhuge,et al.  Communities and Emerging Semantics in Semantic Link Network: Discovery and Learning , 2009, IEEE Transactions on Knowledge and Data Engineering.

[3]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[4]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[5]  Arthur C. Graesser,et al.  The psychology of science text comprehension , 2014 .

[6]  Yue Zhang,et al.  Word Segmentation for Chinese Novels , 2015, AAAI.

[7]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[8]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[9]  Renée J. Miller,et al.  A framework for semantic link discovery over relational data , 2009, CIKM.

[10]  Mario Cannataro,et al.  The knowledge grid , 2003, CACM.

[11]  Cícero Nogueira dos Santos,et al.  Semantic Role Labeling , 2012 .

[12]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[13]  John McCarthy,et al.  A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence, August 31, 1955 , 2006, AI Mag..

[14]  The Grammatical Analysis of Sentences , 2011 .

[15]  Sameer Singh,et al.  Injecting Logical Background Knowledge into Embeddings for Relation Extraction , 2015, NAACL.

[16]  Hai Zhuge,et al.  Semantic linking through spaces for cyber-physical-socio intelligence: A methodology , 2011, Artif. Intell..

[17]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[18]  Hai Zhuge,et al.  Interactive semantics , 2010, Artif. Intell..

[19]  John McCarthy,et al.  The Inversion of Functions Defined by Turing Machines , 1956 .

[20]  K. Nandhini,et al.  Improving readability through extractive summarization for learners with reading difficulties , 2013 .

[21]  Hinrich Schütze,et al.  FLORS: Fast and Simple Domain Adaptation for Part-of-Speech Tagging , 2014, TACL.