Genre Classification of Web Pages

Genre classification means to discriminate between documents bymeans of their form, their style, or their targeted audience. Put another way, genre classification is orthogonal to a classification based on the documents’ contents.

[1]  Kevin Crowston,et al.  Genre based navigation on the Web , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[2]  Aidan Finn,et al.  Learning to classify documents according to genre , 2006, J. Assoc. Inf. Sci. Technol..

[3]  Johanna D. Moore,et al.  Proceedings of the Conference on Human Factors in Computing Systems , 1989 .

[4]  Douglas Douglas,et al.  The multi-dimensional approach to linguistic analyses of genre variation: An overview of methodology and findings , 1992, Comput. Humanit..

[5]  Benno Stein,et al.  Genre classification of Web pages user study and feasibility analysis , 2004 .

[6]  Hinrich Schütze,et al.  Automatic Detection of Text Genre , 1997, ACL.

[7]  Carol Van Ess-Dykema,et al.  The Form is the Substance: Classification of Genres in Text , 2001, HTLKM@ACL.

[8]  Benno Stein,et al.  Distinguishing Topic from Genre , 2004 .

[9]  Georg Rehm Towards Automatic Web Genre Identification , 2002, HICSS.

[10]  Kevin Crowston,et al.  The effects of linking on genres of Web documents , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[11]  Efstathios Stamatatos,et al.  Text Genre Detection Using Common Word Frequencies , 2000, COLING.

[12]  Jussi Karlgren,et al.  Recognizing Text Genres With Simple Metrics Using Discriminant Analysis , 1994, COLING.

[13]  Adele E. Howe,et al.  Effects of web document evolution on genre classification , 2005, CIKM '05.

[14]  Jussi Karlgren,et al.  Web-Specific Genre Visualization , 1998, WebNet.

[15]  Sung-Hyon Myaeng,et al.  Text genre classification with genre-revealing and subject-revealing features , 2002, SIGIR '02.

[16]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .