Web media and the quantitative content analysis: Methodological challenges in measuring online news content

This article presents a method for quantitative content analysis of news online. The research design is based on a triangulation approach, using qualitative and quantitative measures combined with automated computer-assisted analysis. Used to perform a content analysis of the online news output of the Norwegian Broadcasting Corporation [NRK] from 2009, this approach revealed that methodologies designed for measuring broadcasting news content do not suffice in the online news environment. Online research methods need to be redesigned to account for the medium-specific news features on the internet. Computer-assisted coding methods can contribute depth and scale to such an analysis, as it can extract and assemble detailed data on large quantities of articles. Using a combination of automatic coding methods with established content analysis for television news, this article presents a new design for quantitative content analysis of news online.

[1]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[2]  Knut Helland Public service and commercial news: Contexts of production, genre conventions and textual claims in television. , 1993 .

[3]  G. Bettega,et al.  [Bad news!]. , 2008, Revue de stomatologie et de chirurgie maxillo-faciale.

[4]  Michael Karlsson,et al.  Freezing the Flow of Online News : Exploring Approaches to Study the Liquidity of Online News , 2009 .

[5]  Pablo J. Boczkowski,et al.  Digitizing the News: Innovation in Online Newspapers , 2004 .

[6]  Peter Fankhauser,et al.  Boilerplate detection using shallow text features , 2010, WSDM '10.

[7]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[8]  Anna van Raaphorst CMS (Content Management System) , 2007 .

[9]  A. Jönsson Samma nyheter eller likadana? : studier av mångfald i svenska TV-nyheter , 2004 .

[10]  James H. Cross,et al.  Reverse engineering and design recovery: a taxonomy , 1990, IEEE Software.

[11]  Cédrick Fairon,et al.  Building and Exploring Web Corpora. Proceedings of the 3rd web as corpus workshop, incorporating cleaneval , 2007 .

[12]  L. Manovich,et al.  The language of new media , 2001 .

[13]  R. Weber Basic Content Analysis , 1986 .

[14]  Michael Karlsson,et al.  Freezing the Flow of Online News : Exploring Approaches to Study the Liquidity of Online News , 2009 .

[15]  Kimberly A. Neuendorf,et al.  The Content Analysis Guidebook , 2001 .