The Effects and Limitations of Automated Text Condensing on Reading Comprehension Performance

The optimal amount of information needed in a given decision-making situation lies somewhere along a continuum from "not enough" to "too much". Ackoff proposed that information systems often hinder the decision-making process by creating information overload. To deal with this problem, he called for systems that could filter and condense data so that only relevant information reached the decision maker. The potential for information overload is especially critical in text-based information. The purpose of this research is to investigate the effects and theoretical limitations of extract condensing as a text processing tool in terms of recipient performance. In the experiment described here, an environment is created in which the effects of text condensing are isolated from the effects of message and individual recipient differences. The data show no difference in reading comprehension performance between the condensed forms and the original document. This indicates that condensed forms can be produced that are equally as informative as the original document. These results suggest that it is possible to apply a relatively simple computer algorithm to text and produce extracts that capture enough of the information contained in the original document so that the recipient can perform as if he or she had read the original. These results also identify a methodology for assessing the effectiveness of text condensing schemes. The research presented here contributes to a small but growing body of work on text-based information systems and, specifically, text condensing.

[1]  Edward T. Cremmins The Art of Abstracting. , 1982 .

[2]  Starr Roxanne Hiltz,et al.  Structuring computer-mediated communication systems to avoid information overload , 1985, CACM.

[3]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[4]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[5]  Samuel S. Epstein,et al.  Transportable natural language processing through simplicity—the PRE system , 1985, TOIS.

[6]  Ben H. Weil,et al.  Standards for writing abstracts , 1970 .

[7]  Don R. Swanson,et al.  Information Retrieval as a Trial-And-Error Process , 1977, The Library Quarterly.

[8]  George M. Kasper,et al.  The Effect of Presentation Media on Recipient Performance in Text-based Information Systems , 1988, J. Manag. Inf. Syst..

[9]  D. Kahneman Attention and Effort , 1973 .

[10]  Sara Kiesler,et al.  Social psychological aspects of computer-mediated communication , 1984 .

[11]  E. Burton Swanson,et al.  Document-Based Systems for Management Planing and Control: A Classification, Survey, and Assessment , 1978, MIS Q..

[12]  Ronald Schwartz,et al.  AMANDA: a computerized document management system , 1980 .

[13]  Phyllis B. Baxendale,et al.  Machine-Made Index for Technical Literature - An Experiment , 1958, IBM J. Res. Dev..

[14]  H. P. Edmundson,et al.  Problems in automatic abstracting , 1964, CACM.

[15]  James E. Rush,et al.  Improvement of automatic abstracts by the use of structural analysis , 1973, J. Am. Soc. Inf. Sci..

[16]  Cyril H.P. Brookes Text processing as a tool for DSS design , 1983 .

[17]  Norbert Wiener,et al.  Cybernetics, or control and communication in the animal and the machine, 2nd ed. , 1961 .

[18]  Jacob Slonim,et al.  NDX-100: An Electronic Filing Machine for the Office of the Future , 1981, Computer.

[19]  David C. Blair The data-document distinction in information retrieval , 1984, CACM.

[20]  Andrew Hunter Morris,et al.  Supporting environmental scanning and organizational communication with the processing of text: the use of computer-generated abstracts , 1988 .

[21]  H. P. Edmundson,et al.  Automatic abstracting and indexing—survey and recommendations , 1961, CACM.

[22]  Lance A. Miller,et al.  Text-critiquing with the EPISTLE system: an author's aid to better syntax , 1981, AFIPS '81.

[23]  Martin Chodorow,et al.  The EPISTLE Text-Critiquing System , 1982, IBM Syst. J..

[24]  Norman L. Chervany,et al.  An Experimental Evaluation of Information Overload in a Production Environment , 1974 .

[25]  Diana C. Reep,et al.  A Survey of the Frequency, Types, and Importance of Writing Tasks in Four Career Areas , 1990 .

[26]  Charles H. Kriebel,et al.  A Survey of the MIS and Telecommunications Activities of Major Business Firms , 1984, MIS Q..

[27]  Dennis Tsichritzis Message addressing schemes , 1984, TOIS.

[28]  P. H. Lindsay Human Information Processing , 1977 .

[29]  James C. Wetherbe,et al.  Key issues in information systems management , 1987 .

[30]  Judi Kesselman-Turkel,et al.  The magazine writer's handbook , 1982 .

[31]  Martin Dillon,et al.  FASIT: A fully automatic syntactically based indexing system , 1983, J. Am. Soc. Inf. Sci..

[32]  R. Daft,et al.  Chief executive scanning, environmental characteristics, and company performance: An empirical study , 1988 .

[33]  Harry H. Goode Greenhouses of Science for Management , 1958 .

[34]  H. P. Edmundson,et al.  Mathematical models of text , 1984, Inf. Process. Manag..

[35]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[36]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[37]  Harold Borko,et al.  Abstracting Concepts and Methods , 1975 .

[38]  J. Licklider,et al.  Long-range constraints in the statistical structure of printed English. , 1955, The American journal of psychology.

[39]  Omar El Sawy,et al.  ICIS Paper: Personal Information Systems for Strategic Scanning in Turbulent Environments: Can the CEO Go On-Line? , 1985, MIS Q..

[40]  Alan F. Smeaton,et al.  Information retrieval in an office filing facility and future work in project minstrel , 1986, Inf. Process. Manag..

[41]  D. C. Blair,et al.  Language and Representation in Information Retrieval , 1990 .

[42]  Gordon B. Davis,et al.  Management information systems : conceptual foundations, structure, and development , 1985 .

[43]  George M. Kasper,et al.  A knowledge exchange architecture for collaborative human-computer communication , 1991, IEEE Trans. Syst. Man Cybern..

[44]  Chris D. Paice,et al.  The automatic generation of literature abstracts: an approach based on the identification of self-indicating phrases , 1980, SIGIR '80.

[45]  James E. Rush,et al.  Automatic abstracting and indexing. II. Production of indicative abstracts by application of contextual inference and syntactic coherence criteria , 1971 .

[46]  Hans Peter Luhn,et al.  A Business Intelligence System , 1958, IBM J. Res. Dev..

[47]  James H. Bair,et al.  Human communication needs and organizational productivity: The potential impact of office automation , 1983, J. Am. Soc. Inf. Sci..

[48]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[49]  R. Gunning The Technique of Clear Writing. , 1968 .

[50]  Dennis A. Adams,et al.  Organizational connectivity systems: is function being effectively managed? , 1989, DATB.

[51]  Kweku Ewusi-Mensah,et al.  The external organizational environment and its impact on management information systems , 1981 .

[52]  Russell L. Ackoff,et al.  Management misinformation systems , 1967 .

[53]  Lance A. Miller,et al.  Project EPISTLE: A System for the Automatic Analysis of Business Correspondence , 1980, AAAI.

[54]  Antonio Zamora,et al.  Automatic Abstracting Research at Chemical Abstracts Service , 1975, J. Chem. Inf. Comput. Sci..

[55]  Claude E. Shannon,et al.  Prediction and Entropy of Printed English , 1951 .

[56]  Lisa F. Rau,et al.  SCISOR: extracting information from on-line news , 1990, CACM.

[57]  Michael W. Kibby,et al.  Intersentential Processes in Reading Comprehension , 1980 .