Coding Sequences: A History of Sequence Comparison Algorithms as a Scientific Instrument

Sequence comparison algorithms are sophisticated pieces of software that compare and match identical or similar regions of DNA, RNA, or protein sequence. This paper examines the origins and development of these algorithms from the 1960s to the 1990s. By treating this software as a kind of scientific instrument used to examine sets of biological objects, the paper shows how algorithms have been used as different sorts of tools and appropriated for different sorts of uses according to the disciplinary context in which they were deployed. These particular uses have made sequences themselves into different kinds of objects.

[1]  J. Hagen Naturalists, Molecular Biologists, and the Challenges of Molecular Evolution , 1999, Journal of the History of Biology.

[2]  H. Cunningham Race to the Finish: Identity and Governance in an Age of Genomics , 2006 .

[3]  M. Fortun Projecting Speed Genomics , 1999 .

[4]  G. Mahairas,et al.  Sequencing the human genome. , 1997, Science.

[5]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[6]  D. Lipman,et al.  Rapid and sensitive protein similarity searches. , 1985, Science.

[7]  H. McAdams,et al.  Circuit simulation of genetic networks. , 1995, Science.

[8]  Sahotra Sarkar,et al.  Decoding “coding”—information and DNA , 1996 .

[9]  J. Felsenstein,et al.  An evolutionary model for maximum likelihood alignment of DNA sequences , 1991, Journal of Molecular Evolution.

[10]  S M Ulam,et al.  Some ideas and prospects in biomathematics. , 1972, Annual review of biophysics and bioengineering.

[11]  E. Margoliash,et al.  EVOLUTION OF CYTOCHROME C. , 1964, Federation proceedings.

[12]  Ralph E. Hoffman,et al.  The Gene Wars: Science, Politics, and the Human Genome , 1996 .

[13]  M. Barbieri Biology with information and meaning. , 2003, History and philosophy of the life sciences.

[14]  Hannah Landecker,et al.  Culturing Life: How Cells Became Technologies , 2007 .

[15]  Joseph Felsenstein,et al.  The number of evolutionary trees , 1978 .

[16]  M. Dietrich,et al.  Paradox and Persuasion: Negotiating the Place of Molecular Evolution within Evolutionary Biology , 1998, Journal of the history of biology.

[17]  H. Rheinberger Living and Working with the New Medical Technologies: Beyond nature and culture: modes of reasoning in the age of molecular biology and medicine , 2000 .

[18]  J. Ségal,et al.  The use of information theory in biology: a historical perspective. , 2003, History and philosophy of the life sciences.

[19]  Russell F. Doolittle,et al.  On the trail of protein sequences , 2000, Bioinform..

[20]  Michael M. J. Fischer,et al.  Living and Working with the New Medical Technologies:Living and Working with the New Medical Technologies. , 2004 .

[21]  Rohit Parikh,et al.  States of Knowledge , 2002, WoLLIC.

[22]  Information Metaphors and the Human Genome Project , 2015, Perspectives in biology and medicine.

[23]  E. D. Hyman A new method of sequencing DNA. , 1988, Analytical biochemistry.

[24]  Telecommunications Board Funding a Revolution: Government Support for Computing Research , 1999 .

[25]  R. Kohler, Lords of the fly: Drosophila genetics and the experimental life. , 1995 .

[26]  V. Bryson,et al.  Evolving Genes and Proteins. , 1965, Science.

[27]  George E. Kimball,et al.  Punched Card Calculation of Resonance Energies , 1949 .

[28]  M. S. Lindee,et al.  Genetic disease since 1945 , 2000, Nature Reviews Genetics.

[29]  Paul Stroobant,et al.  Platelet-derived growth factor is structurally related to the putative transforming protein p28sis of simian sarcoma virus , 1983, Nature.

[30]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[31]  M. I. Kanehisa,et al.  Pattern recognition in nucleic acid sequences. I. A general method for finding local homologies and symmetries , 1982, Nucleic Acids Res..

[32]  The Origin and Early Development of the Method of Minimum Evolution for the Reconstruction of Phylogenetic Trees , 1996 .

[33]  M. Goodman On the Emergence of Intraspecific Differences in the Protein Antigens of Human Beings , 1960, The American Naturalist.

[34]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[35]  W. Fitch THE PROBABLE SEQUENCE OF NUCLEOTIDES IN SOME CODONS. , 1964, Proceedings of the National Academy of Sciences of the United States of America.

[36]  T. Gingeras,et al.  Computer programs for the assembly of DNA sequences. , 1979, Nucleic acids research.

[37]  Swee Lay Thein,et al.  Hypervariable ‘minisatellite’ regions in human DNA , 1985, Nature.

[38]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[39]  Richard Bellman,et al.  Eye of the hurricane : an autobiography , 1984 .

[40]  D. Nelkin,et al.  The DNA Mystique: The Gene As a Cultural Icon , 1995 .

[41]  R F Doolittle Some reflections on the early days of sequence searching. , 1997, Journal of molecular medicine.

[42]  S. Jasanoff States of Knowledge: The Co-production of Science and the Social Order , 2004 .

[43]  F. Sanger,et al.  DNA sequencing with chain-terminating inhibitors. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[44]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[45]  A. Blank Composite Substance, Common Notions, and Kenelm Digby's Theory of Animal Generation , 2007, Science in Context.

[46]  M O Dayhoff Computer aids to protein sequence determination. , 1965, Journal of theoretical biology.

[47]  Lily E. Kay,et al.  Who Wrote the Book of Life?: A History of the Genetic Code , 2000 .

[48]  Vincent M. Sarich,et al.  Immunological Time Scale for Hominid Evolution , 1967, Science.

[49]  W. A. Beyer,et al.  Additive evolutionary trees. , 1977, Journal of theoretical biology.

[50]  D. Lipman,et al.  Rapid similarity searches of nucleic acid and protein data banks. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[51]  C. Cranor Are genes us? : the social consequences of the new genetics , 1994 .

[52]  Margaret Oakley Dayhoff Margaret Oakley Dayhoff March 11, 1925–February 5, 1983 , 2005, Journal of Molecular Evolution.

[53]  Joel B. Hagen,et al.  1The introduction of computers into systematic research in the United States during the 1960s , 2001 .

[54]  P H Sellers Pattern recognition in genetic sequences. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[55]  Giovanni Boniolo,et al.  Biology without information. , 2003, History and philosophy of the life sciences.

[56]  G J Morgan,et al.  Emile Zuckerkandl, Linus Pauling, and the Molecular Evolutionary Clock, 1959–1965 , 1998, Journal of the history of biology.

[57]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[58]  Jean-Paul Gaudillière,et al.  Making Mice and Other Devices: The Dynamics of Instrumentation in American Biomedical Research (1930–1960) , 2001 .

[59]  Brian K. Hall,et al.  Homology: The hierarchical basis of comparative biology , 1994 .

[60]  M. O. Dayhoff,et al.  Viral src gene products are related to the catalytic chain of mammalian cAMP-dependent protein kinase. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[61]  Philippe Descola,et al.  Beyond nature and culture , 2012, HAU: Journal of Ethnographic Theory.

[62]  A. Kerr (Re)Constructing Genetic Disease , 2000 .

[63]  J. Craig Venter,et al.  A Life Decoded: My Genome, My Life , 2007 .

[64]  Jan A Witkowski,et al.  Picture control: the electron microscope and the transformation of biology in America, 1940–1960 , 2000, Medical History.

[65]  I. Tóth Non-Euclidean Geometry before Euclid , 1969 .

[66]  L. J. Korn,et al.  Computer analysis of nucleic acid regulatory sequences. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[67]  Leroy Hood,et al.  The Code of Codes Scientific and Social Issues in the Human Genome Project , 1992 .

[68]  A. Edwards,et al.  The reconstruction of evolution , 1963 .

[69]  M. Fortun,et al.  The practices of human genetics , 1999 .

[70]  Calvin R. Bernard,et al.  Life in science. Ichthyologists hooked on Facebook. , 2011, Science.

[71]  R. Casiday The DNA Mystique: The Gene as a Cultural Icon. Second Edition. By Dorothy Nelkin & M. Susan Lindee. Pp. 284. (University of Michigan Press, Cambridge, 2004.) US$22.95, ISBN 0-472-03004-3, paperback. , 2005, Journal of Biosocial Science.

[72]  C. Brandt Genetic Code, Text, and Scripture: Metaphors and Narration in German Molecular Biology , 2005, Science in Context.

[73]  M. Dietrich,et al.  The origins of the neutral theory of molecular evolution , 1994, Journal of the history of biology.

[74]  P. Sellers On the Theory and Computation of Evolutionary Distances , 1974 .

[75]  Joan H. Fujimura,et al.  The Practices of Producing Meaning in Bioinformatics , 1999 .

[76]  E. Díaz The Rhetoric of Informational Molecules: Authority and Promises in the Early Study of Molecular Evolution , 2007, Science in Context.

[77]  S. B. Needleman,et al.  Rabbit heart cytochrome c. , 1966, Journal of Biological Chemistry.

[78]  W. A. Beyer,et al.  Some Biological Sequence Metrics , 1976 .

[79]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[80]  M. O. Dayhoff Computer analysis of protein evolution. , 1969, Scientific American.

[81]  D. Morrison Why would phylogeneticists ignore computerized sequence alignment? , 2009, Systematic biology.

[82]  Boelie Elzen,et al.  Two Ultracentrifuges: A Comparative Study of the Social Construction of Artefacts , 1986 .

[83]  Gregory Radick,et al.  The Century of the Gene , 2001, Heredity.

[84]  Jenny Reardon Race to the Finish , 2009 .

[85]  R F Doolittle,et al.  Simian sarcoma virus onc gene, v-sis, is derived from the gene (or genes) encoding a platelet-derived growth factor. , 1983, Science.

[86]  David Sankoff,et al.  The early introduction of dynamic programming into computational biology , 2000, Bioinform..

[87]  L. E. Kay Life as Technology: Representing, Intervening, and Molecularizing , 1996 .

[88]  Thomas Haigh,et al.  Histories of Computing , 2011 .

[89]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[90]  Bruno J. Strasser,et al.  GenBank--Natural History in the 21st Century? , 2008, Science.

[91]  Stuart Dreyfus,et al.  Richard Bellman on the Birth of Dynamic Programming , 2002, Oper. Res..

[92]  Mark Bitensky Sequencing the human genome: summary report of the Santa Fe workshop March 3-4, 1986 , 1986 .

[93]  Paul E. Ceruzzi,et al.  A history of modern computing , 1999 .

[94]  H. Rheinberger Beyond nature and culture. Modes of reasoning in the age of molecular biology , 2000 .

[95]  W. A. Beyer,et al.  A molecular sequence metric and evolutionary trees , 1974 .

[96]  M. Waterman,et al.  Comparative biosequence metrics , 2005, Journal of Molecular Evolution.

[97]  R. Doolittle Similar amino acid sequences: chance or common ancestry? , 1981, Science.

[98]  Bruno J. Strasser,et al.  "Sickle Cell Anemia, a Molecular Disease" , 1999, Science.

[99]  Kay Le Laboratory technology and biological knowledge: the Tiselius electrophoresis apparatus, 1930-1945. , 1988 .

[100]  W. Fitch An improved method of testing for evolutionary homology. , 1966, Journal of molecular biology.

[101]  G. Wagner The Biological Homology Concept , 1989 .

[102]  P. Campbell,et al.  Mapping and Sequencing the Human Genome , 1989, Biotechnology and applied biochemistry.

[103]  Sahotra Sarkar,et al.  Biological Information: A Skeptical Look at Some Central Dogmas of Molecular Biology , 1996 .

[104]  M. Dietrich The problem of the gene. , 2000, Comptes rendus de l'Academie des sciences. Serie III, Sciences de la vie.

[105]  Marianne Sommer History in the Gene: Negotiations Between Molecular and Organismal Anthropology , 2008, Journal of the history of biology.