The Logic of the Big Data Turn in Digital Literary Studies

The Digital Humanities, and especially the literary side of the Digital Humanities, i.e., Digital Literary Studies, propose systematic and technologically equipped methodologies in activities where, for centuries, intuition and intelligent handling had played a predominant role. The recent “big data” turn in the natural and social sciences has been particularly revealing of how these new approaches can be applied to traditional scholarly disciplines, such as literary studies. In so doing, big data can renew, with the use of computers, the Humanities, i.e., the disciplines rationally studying humanworks and cultural production. Digital Literary Studies are emblematic of these new approaches, certainly because they constitute the oldest subfield of the Digital Humanities, as some early projects like the Trésor de la Langue Française attest but also because they are the domain in which the intellectual stakes of mass digitization has already been extensively used and debated as demonstrated by Franco Moretti’s Graphs, Maps, Trees (Moretti, 2005), for instance. Some view this evolution enthusiastically as a shift toward the “hard” sciences. This is the case of Matthew Jockers who affirms in the chapter entitled “Revolution” of his book Macroanalysis (Jockers, 2013) that: “Now, slowly and surely, the same elements that have had such an impact on the sciences are revolutionizing the way that research in the humanities get done” (p. 10). Further on, he declares that literary methodology is “in essence no different from the scientific one” (p. 13). Others assert that some questions cannot be dealt with using the same methods in the humanities and the natural sciences, like physics or biology. That is the case of Stephen Ramsay, who, in Reading Machines (Ramsay, 2011), assures us that, even if some problems in the Humanities, like authorship identification, can clearly find comfort with themethods developed by the natural sciences, for most literary critical endeavors, such as characterizing the subjectivity of Virginia Wolf in her novel The Waves, for instance, it is not possible to clearly identify a set of “falsifiable” facts. Between these two extremes, many scholars provide convincing illustrations of what digitization allows and then discuss the nature and current evolution of the Humanities in general, and literary studies in particular. TheCompanion toDigital Humanities (Schreibman et al., 2004), theCompanion to Digital Literary Studies (Siemens and Schreibman, 2008), and more recently an excellent online MLACommons anthology dedicated to Literary Studies in the Digital Age (Price and Siemens, 2013) all provide various and enriching views on these topics. We attempt here to conciliate the two above-mentioned and apparently antagonistic views with the help of a philosophical approach. More precisely, our Grand Challenge is in the service of establishing solid epistemological foundations for the Digital Humanities, which is necessitated by the increasingly important role attributed to digital tools in humanistic research. We also claim that employing a conceptual apparatus originally built by German neo-Kantian philosophers at the beginning of the twentieth century, in particular by Heinrich Rickert and Ernst Cassirer, seems particularly relevant today with the emergence of “big data,” primarily because the logical nature of the possible inferences drawn from this sort of data needs to be clarified.

[1]  Louis Hay,et al.  La littérature des écrivains : questions de critique génétique , 2002 .

[2]  Jean-Gabriel Ganascia,et al.  Automatic detection of reuses and citations in literary texts , 2014, Lit. Linguistic Comput..

[3]  Alfred Jarry Gestes et opinions du docteur Faustroll pataphysicien : roman néo-scientifique , 1911 .

[4]  Katharine Armstrong,et al.  Big data: a revolution that will transform how we live, work, and think , 2014 .

[5]  Ray Siemens,et al.  Literary Studies in the Digital Age , 2013 .

[6]  T. Brown,et al.  Debates in the Digital Humanities , 2013 .

[7]  Matthew L. Jockers Macroanalysis: Digital Methods and Literary History , 2013 .

[8]  Erez Lieberman Aiden,et al.  Uncharted: Big Data as a Lens on Human Culture , 2013 .

[9]  Collectif Multitudes Digital Humanities Manifesto 2.0 , 2015 .

[10]  Alfred Jarry Gestes et opinions du docteur Faustroll , 1911 .

[11]  Jean-Louis Lebrave,et al.  Manuscrits, genèse et documents numérisés , 2004 .

[12]  Helle Porsdam,et al.  Too much ‘digital’, too little ‘humanities’? An attempt to explain why many humanities scholars are reluctant converts to Digital Humanities , 2011 .

[13]  Franco Moretti Graphs, Maps, Trees: Abstract Models for a Literary History , 2005 .

[14]  Frédéric Kaplan,et al.  A Map for Big Data Research in Digital Humanities , 2015, Front. Digit. Humanit..

[15]  Jean-Gabriel Ganascia,et al.  Manuscrits, genèse et documents numérisés. EDITE : une étude informatisée du travail de l'écrivain , 2004, Document Numérique.

[16]  K. Mcsweeney,et al.  The Anxiety of Influence: A Theory of Poetry , 1974 .

[17]  Heinrich Rickert,et al.  Kulturwissenschaft und Naturwissenschaft , 1921 .

[18]  Ray Siemens,et al.  A companion to digital literary studies , 2007 .

[19]  Gregory R. Crane,et al.  What Do You Do with a Million Books? , 2006, D Lib Mag..

[20]  Ernst Cassirer,et al.  From Substance and Function , 2008 .

[21]  Jean-Michel Adam,et al.  Intertextualité et interdiscours : filiations et contextualisation de concepts hétérogènes , 2006, Travaux neuchâtelois de linguistique.

[22]  Kathleen Fitzpatrick,et al.  The Humanities, Done Digitally , 2012 .

[23]  A. Compagnon La Seconde Main, ou le Travail de la Citation@@@Nous, Michel de Montaigne , 1982 .

[24]  Ernst Cassirer Zur Logik der Kulturwissenschaften : Fünf Studien / , 2011 .

[25]  Cameron D. Palmer,et al.  Association Testing of Previously Reported Variants in a Large Case-Control Meta-analysis of Diabetic Nephropathy , 2011, Diabetes.

[26]  Emilio Pastor Platero Eléments de critique génétique , 1995 .

[27]  Francesca Frontini,et al.  Linguistic Pattern Extraction and Analysis for Classic French Plays , 2015 .

[28]  Francesca Frontini,et al.  Computational Study of Stylistics: A Clustering-Based Interestingness Measure for Extracting Relevant Syntactic Patterns , 2015, Int. J. Comput. Linguistics Appl..

[29]  Olha Buchel,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2015 .

[30]  Pierre Saint-Amand,et al.  Palimpsestes. La litterature au second degre , 1983 .