From paragraph networks to document networks

We investigate hypertext repositories with automatically generated hyperlinks among paragraphs. The algorithms that generate the links yield results with significant common characteristics despite large differences among the algorithms. Furthermore, different repositories, generated by the same algorithms show significant common characteristics, although generated from the return lists of different search engines. Finally, repositories generated for different domains also have common characteristics. This suggests that these characteristics are pervasive properties of judiciously retrieved document sets. The common characteristics revolve around a tendency for some documents (the kernel) to be destinations of a disproportionately high fraction of the hyperlinks in the repository. An emergent property of such repositories with practical significance is that browsing activities will have a tendency to trap the user within the kernel, perhaps without their realizing it. To enable users to choose to avoid trapping, we propose a ring structure for hypertext repositories which is exposed to the user by annotations to the hyperlinks.

[1]  Loren G. Terveen,et al.  Constructing, organizing, and visualizing collections of topically related Web resources , 1999, TCHI.

[2]  D. Berleant,et al.  Customizing information. 1. Getting what we need, when we need it , 1994, Computer.

[3]  George Kingsley Zipf,et al.  The Repetition of Words, Time-Perspective, and Semantic Balance , 1945 .

[4]  James Mayfield,et al.  Indexing Using Both N-Grams and Words , 1998, TREC.

[5]  Daniel Berleant,et al.  Toward dialogues with documents: MultiBrowser , 2004, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004..

[6]  M Damashek,et al.  Gauging Similarity with n-Grams: Language-Independent Categorization of Text , 1995, Science.

[7]  S. A. Bly,et al.  A comparison of tiled and overlapping windows , 1986, CHI '86.

[8]  P. David Stotts,et al.  Petri-net-based hypertext: document structure with browsing semantics , 1989, TOIS.

[9]  Eser Kandogan,et al.  Elastic Windows: evaluation of multi-window operations , 1997, CHI.

[10]  George Kingsley Zipf,et al.  The Psychobiology of Language , 2022 .

[11]  Ben Shneiderman,et al.  Sorting out searching: a user-interface framework for text searches , 1998, CACM.

[12]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[13]  Ben Shneiderman,et al.  Designing to Facilitate Browsing: A Look Back at the Hyperties Workstation Browser , 1991, Hypermedia.

[14]  George W. Furnas,et al.  Effective view navigation , 1997, CHI.

[15]  John Scott Social Network Analysis , 1988 .

[16]  Yoshinori Hara,et al.  Hypermedia research at C&C research labs, NEC USA , 1997, CHI Extended Abstracts.

[17]  Stuart K. Card,et al.  Information foraging in information access environments , 1995, CHI '95.

[18]  Frank G. Halasz,et al.  Reflections on NoteCards: seven issues for the next generation of hypermedia systems , 1987, Hypertext.

[19]  G. Halasz Frank,et al.  Reflections on NoteCards: seven issues for the next generation of hypermedia systems , 1987, CACM.

[20]  G. Zipf The meaning-frequency relationship of words. , 1945, The Journal of general psychology.

[21]  Saul Greenberg,et al.  Revisitation patterns in World Wide Web navigation , 1997, CHI.

[22]  D. Berleant,et al.  Customizing information. 2. How successful are we so far? , 1994, Computer.

[23]  Eser Kandogan,et al.  Elastic Windows: a hierarchical multi-window World-Wide Web browser , 1997, UIST '97.

[24]  R. Schiffer Psychobiology of Language , 1986 .

[25]  Mark A. Foltz Designing navigable information spaces , 1998 .