Compter les globules blancs, analyser les partitions

Comparer, et plus generalement traiter, analyser ou indexer les sequences de caracteres constitue le champ de recherche de l''algorithmique du texte. Chercheur CNRS en informatique depuis fin 2006 dans le laboratoire LIFL, maintenant CRIStAL (UMR 9189, Universite de Lille), Mathieu Giraud a eu son parcours academique rythme par les comparaisons de sequences. Ce manuscrit d'habilitation decrit les deux projets dans lesquels il s'est investi les cinq dernieres annees. Au sein de l'equipe Bonsai, commune avec le centre Inria Lille, Mathieu mene avec Mikael Salson un projet de bioinformatique applique a l'hematologie et l'immunologie sur l'analyse des populations de lymphocytes par leurs recombinaisons V(D)J (« Compter les globules blancs »). Debute par une collaboration avec des collegues bioinformaticiens et hematologues de l'hopital de Lille, ce projet combine algorithmique pour l'immunologie et l'hematologie, developpement logiciel et applications fondamentales et cliniques. Le logiciel Vidjil concu par Mathieu et ses collegues est utilise regulierement par plusieurs laboratoires en France et a l'etranger, dont, en situation de routine, a l'hopital de Lille. Mathieu dirige aussi un projet d'informatique musicale (« Analyser les partitions »). Les humanites numeriques lient les methodes informatiques au patrimoine culturel et a la recherche en sciences humaines et sociales. Est-ce qu'un ordinateur peut comprendre la musique? L'equipe emergente Algomus, repartie entre les laboratoires MIS (Amiens, Univ. Picardie Jules Verne) et CRIStAL, rassemble expertise musicologique et competences algorithmiques pour proposer des methodes analysant les partitions musicales -- comparaison de motifs et d'accords, detection de textures, analyse de formes. Algomus mene des collaborations pluridisciplinaires avec des musicologues, des professeurs de musique et des artistes et realise des projets combinant science et art. Ce manuscrit se conclut par la description d'actions de mediation scientifique et artistique.

[1]  J. Stephen Downie,et al.  The Music Information Retrieval Evaluation eXchange (MIREX) , 2006 .

[2]  W. Robinson Sequencing the functional antibody repertoire—diagnostic and therapeutic discovery , 2015, Nature Reviews Rheumatology.

[3]  Mikhail Shugay,et al.  MiXCR: software for comprehensive adaptive immunity profiling , 2015, Nature Methods.

[4]  O. Gotoh An improved algorithm for matching biological sequences. , 1982, Journal of molecular biology.

[5]  A. V. van Kampen,et al.  Discovery of Invariant T Cells by Next-Generation Sequencing of the Human TCR α-Chain Repertoire , 2014, The Journal of Immunology.

[6]  Shlomo Dubnov,et al.  Using Factor Oracles for Machine Improvisation , 2004, Soft Comput..

[7]  D. Temperley The Cognition of Basic Musical Structures , 2001 .

[9]  Mathieu Giraud,et al.  Subject and Counter-Subject Detection for Analysis of the Well-Tempered Clavier Fugues , 2012, CMMR.

[10]  Oriol Nieto,et al.  JAMS: A JSON Annotated Music Specification for Reproducible MIR Research , 2014, ISMIR.

[11]  Abraham Lempel,et al.  Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[12]  R. Gadagkar Nothing in Biology Makes Sense Except in the Light of Evolution , 2005 .

[13]  Mathieu Giraud,et al.  RHYTHM EXTRACTION FROM POLYPHONIC SYMBOLIC MUSIC , 2011, ISMIR 2011.

[14]  Mathieu Giraud,et al.  Computational Analysis of Musical Form , 2016, Computational Music Analysis.

[15]  J. Stephen Downie,et al.  The music information retrieval evaluation exchange (2005-2007): A window into music information retrieval research , 2008 .

[16]  G. Gould,et al.  Conversations with Glenn Gould , 1984 .

[17]  Timothy F. Jones El disfrute de la música mediante la obtención de resultados: el sistema pedagógico de la "Associated Board of the Royal Schools of Music" , 1999 .

[18]  Pierre Hanna,et al.  Improvements of Key finding Methods , 2008 .

[19]  Robert Strandh,et al.  Dynamic Chord Analysis for Symbolic Music , 2009, ICMC.

[20]  Costas S. Iliopoulos,et al.  Toward a General Framework for Polyphonic Comparison , 2009, Fundam. Informaticae.

[21]  Donald E. Knuth,et al.  Fast Pattern Matching in Strings , 1977, SIAM J. Comput..

[22]  Jiang Xue-qiang,et al.  Clinical Significance of Minimal Residual Disease in Childhood Acute Lymphoblastic Leukemia , 2010 .

[23]  James Hepokoski,et al.  Elements of sonata theory : norms, types, and deformations in the late eighteenth-century sonata , 2006 .

[24]  George Georgiou,et al.  High-throughput sequencing of the paired human immunoglobulin heavy and light chain repertoire , 2013, Nature Biotechnology.

[25]  Patrice Duroux,et al.  IMGT/HIGHV-QUEST: THE IMGT® WEB PORTAL FOR IMMUNOGLOBULIN (IG) OR ANTIBODY AND T CELL RECEPTOR (TR) ANALYSIS FROM NGS HIGH THROUGHPUT AND DEEP SEQUENCING , 2012 .

[26]  Yannis Manolopoulos,et al.  Detection of Stream Segments in Symbolic Musical Data , 2008, ISMIR.

[27]  B. Song,et al.  Heterogeneous expansion of CD4+ tumor-infiltrating T-lymphocytes in clear cell renal cell carcinomas. , 2015, Biochemical and biophysical research communications.

[28]  Masataka Goto,et al.  AIST Annotation for the RWC Music Database , 2006, ISMIR.

[29]  R. Holt,et al.  Profiling the T-cell receptor beta-chain repertoire by massively parallel sequencing. , 2009, Genome research.

[30]  C. Krumhansl,et al.  Tracing the dynamic changes in perceived tonal organization in a spatial representation of musical keys. , 1982, Psychological review.

[31]  Hanna M. Lukashevich Towards Quantitative Measures of Evaluating Song Segmentation , 2008, ISMIR.

[32]  M. Giraud,et al.  Modélisation et visualisation de schémas d'analyse musicale avec music21 , 2015 .

[33]  Darrell Conklin,et al.  Segmental Pattern Discovery in Music , 2006, INFORMS J. Comput..

[35]  Frédéric Dufeu,et al.  TIAALS: A New Generic Set of Tools for the Interactive Aural Analysis of Electroacoustic Music , 2013 .

[36]  Mikhail Shugay,et al.  MiTCR: software for T-cell receptor sequencing data analysis , 2013, Nature Methods.

[37]  R. Emerson,et al.  High-throughput pairing of T cell receptor α and β sequences , 2015, Science Translational Medicine.

[38]  André Gédalge Traité de la Fugue , 1949 .

[39]  Esko Ukkonen,et al.  Geometric algorithms for transposition invariant content based music retrieval , 2003, ISMIR.

[40]  Daniela Latorre,et al.  Functional heterogeneity of human memory CD4+ T cell clones primed by pathogens or vaccines , 2015, Science.

[41]  Jean-Jacques Nattiez,et al.  Musicologie générale et sémiologie , 1990 .

[42]  M. Kneba,et al.  Has MRD monitoring superseded other prognostic factors in adult ALL? , 2012, Blood.

[43]  Robert O. Gjerdingen,et al.  The Psychology of Music , 1972 .

[44]  Dmitri Tymoczko Geometry of Music , 2016 .

[45]  Jordan B. L. Smith,et al.  Design and creation of a large-scale database of structural annotations , 2011, ISMIR.

[46]  François Pachet,et al.  The Continuator: Musical Interaction With Style , 2003, ICMC.

[47]  D. Campana,et al.  Concurrent detection of minimal residual disease (MRD) in childhood acute lymphoblastic leukaemia by flow cytometry and real‐time PCR , 2005, British journal of haematology.

[48]  Costas S. Iliopoulos,et al.  Approximate string matching for music analysis , 2004, Soft Comput..

[49]  Claude Preudhomme,et al.  Fast multiclonal clusterization of V(D)J recombinations from high-throughput sequencing , 2014, BMC Genomics.

[50]  David Huron,et al.  Music Information Processing Using the Humdrum Toolkit: Concepts, Examples, and Lessons , 2002, Computer Music Journal.

[51]  E. Mejstrikova,et al.  The predictive strength of next-generation sequencing MRD detection for relapse compared with current methods in childhood ALL. , 2015, Blood.

[52]  Marie-Paule Lefranc,et al.  IMGT/JunctionAnalysis: the first tool for the analysis of the immunoglobulin and T cell receptor complex V-J and V-D-J JUNCTIONs , 2004, ISMB/ECCB.

[53]  Yi Shi,et al.  TCRklass: A New K-String–Based Algorithm for Human and Mouse TCR Repertoire Characterization , 2015, The Journal of Immunology.

[54]  Marie-Paule Lefranc,et al.  IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis , 2008, Nucleic Acids Res..

[55]  Scott Boyd,et al.  Benchmarking the performance of human antibody gene alignment utilities using a 454 sequence dataset , 2010, Bioinform..

[56]  Rajat K De,et al.  Immunoinformatics: a brief review. , 2014, Methods in molecular biology.

[57]  Lawrence Gushee,et al.  De institutione musica , 1994 .

[58]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[59]  Y. Louzoun,et al.  Rep‐Seq: uncovering the immunological repertoire through next‐generation sequencing , 2012, Immunology.

[60]  David A. Hafler,et al.  pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires , 2014, Bioinform..

[61]  Frans Wiering,et al.  Unfolding the potential of computational musicology , 2011, ICISO 2011.

[62]  Alan Marsden 'What was the question?': music analysis and the computer. , 2009 .

[63]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[64]  William P. Birmingham,et al.  Automatic Thematic Extractor , 2003, Journal of Intelligent Information Systems.

[65]  Gérard Assayag,et al.  OpenMusic: visual programming environment for music composition, analysis and research , 2011, ACM Multimedia.

[66]  José Manuel Iñesta Quereda,et al.  Harmonic, Melodic, and Functional Automatic Analysis , 2007, ICMC.

[67]  Masaki Matsubara,et al.  Prioritized contig combining to segregate voices in polyphonic music , 2011 .

[68]  William A. Wood,et al.  Peptide/MHC Tetramer–Based Sorting of CD8+ T Cells to a Leukemia Antigen Yields Clonotypes Drawn Nonspecifically from an Underlying Restricted Repertoire , 2015, Cancer Immunology Research.

[69]  J. Bach,et al.  Forty-eight preludes and fugues , 1981 .

[70]  Heinrich Schenker,et al.  Der freie Satz , 1935 .

[71]  iAnalyse : un logiciel d'aide à l'analyse musicale , 2008 .

[72]  Abraham Lempel,et al.  A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[73]  Oliver Kohlbacher,et al.  Immunoinformatics and epitope prediction in the age of genomic medicine , 2015, Genome Medicine.

[74]  C. Nusbaum,et al.  High-Resolution Description of Antibody Heavy-Chain Repertoires in Humans , 2011, PloS one.

[75]  M. Neuwirth,et al.  What Is a Cadence?: Theoretical and Analytical Perspectives on Cadences in the Classical Repertoire , 2015 .

[76]  Carlos Pérez-Sancho,et al.  New framework for score segmentation and analysis in OpenMusic , 2012 .

[77]  Petri Toiviainen,et al.  MIR In Matlab: The MIDI Toolbox , 2004, ISMIR.

[78]  Mathieu Giraud,et al.  Detecting Episodes with Harmonic Sequences for Fugue Analysis , 2012, ISMIR.

[79]  Thomas B. Kepler,et al.  SoDA2: a Hidden Markov Model approach for identification of immunoglobulin rearrangements , 2010, Bioinform..

[80]  Holger H. Hoos,et al.  The GUIDO Notation Format: A Novel Approach for Adequately Representing Score-Level Music , 1998, ICMC.

[81]  Satoshi Tojo,et al.  Fatta: Full Automatic Time-Span Tree Analyzer , 2007, ICMC.

[82]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[83]  Rainer Typke,et al.  Music Retrieval based on Melodic Similarity , 2007 .

[84]  Elaine Chew,et al.  Separating Voices in Polyphonic Music: A Contig Mapping Approach , 2004, CMMR.

[85]  R. Gold,et al.  Thymocyte‐derived BDNF influences T‐cell maturation at the DN3/DN4 transition stage , 2015, European journal of immunology.

[86]  Emmanuel Vincent,et al.  Semiotic Description of Music Structure: An Introduction to the Quaero/Metiss Structural Annotations , 2014, Semantic Audio.

[87]  R. Jackendoff,et al.  A Generative Theory of Tonal Music , 1985 .

[88]  Quentin R. Nordgren A Measure of Textural Patterns and Strengths , 1960 .

[89]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[90]  David Sankoff,et al.  Comparison of musical sequences , 1990, Comput. Humanit..

[91]  Arbee L. P. Chen,et al.  Efficient repeating pattern finding in music databases , 1998, CIKM '98.

[92]  Rajeev Raman,et al.  String-Matching techniques for musical similarity and melodic recognition , 1998 .

[93]  Claude Preudhomme,et al.  Multi‐loci diagnosis of acute lymphoblastic leukaemia with high‐throughput sequencing and bioinformatics analysis , 2016, British journal of haematology.

[94]  陳良弼,et al.  Automatic Musical Form Analysis , 2005 .

[95]  Jean-Louis Giavitto,et al.  Computation and Visualization of Musical Structures in Chord-Based Simplicial Complexes , 2013, MCM.

[96]  Chantal Buteau,et al.  Can computational music analysis be both musical and computational? , 2010 .

[97]  Emmanuel Vincent,et al.  Semiotic Structure Labeling of Music Pieces: Concepts, Methods and Annotation Conventions , 2012, ISMIR.

[98]  Ning Ma,et al.  IgBLAST: an immunoglobulin variable domain sequence analysis tool , 2013, Nucleic Acids Res..

[99]  Amar Mukherjee,et al.  The Burrows-Wheeler Transform:: Data Compression, Suffix Arrays, and Pattern Matching , 2008 .

[100]  T. Kalina,et al.  EuroFlow standardization of flow cytometer instrument settings and immunophenotyping protocols , 2012, Leukemia.

[101]  Patrick Wilson,et al.  iHMMune-align: hidden Markov model-based alignment and identification of germline genes in rearranged immunoglobulin gene sequences , 2007, Bioinform..

[102]  John Shawe-Taylor,et al.  Decombinator: a tool for fast, efficient gene assignment in T-cell receptor sequences using a finite state machine , 2013, Bioinform..

[103]  Thierry Lecroq,et al.  The exact online string matching problem: A review of the most recent results , 2013, CSUR.

[104]  M. Giraud,et al.  Vers une analyse automatique des formes sonates , 2014 .

[105]  David A. Fenstermacher,et al.  Introduction to bioinformatics , 2005, J. Assoc. Inf. Sci. Technol..

[106]  Alfred V. Aho,et al.  Efficient string matching , 1975, Commun. ACM.

[107]  Mathieu Giraud,et al.  Computational Fugue Analysis , 2015, Computer Music Journal.

[108]  Wojciech Rytter,et al.  Text Algorithms , 1994 .

[109]  Olivier Messiaen Technique de mon langage musical , 1944 .

[110]  J. Webster,et al.  Musical Form, Forms & Formenlehre: Three Methodological Reflections , 2009 .

[111]  R Bellman,et al.  On the Theory of Dynamic Programming. , 1952, Proceedings of the National Academy of Sciences of the United States of America.

[112]  D. Le Paslier,et al.  Human T cell gamma genes are frequently rearranged in B-lineage acute lymphoblastic leukemias but not in chronic B cell proliferations , 1987, The Journal of experimental medicine.

[113]  M. Nielsen,et al.  No evidence for the use of DIR, D–D fusions, chromosome 15 open reading frames or VHreplacement in the peripheral repertoire was found on application of an improved algorithm, JointML, to 6329 human immunoglobulin H rearrangements , 2006, Immunology.

[114]  S. Tonegawa Somatic generation of antibody diversity , 1983, Nature.

[115]  Marie-Paule Lefranc,et al.  IMGT , the international ImMunoGeneTics information system , 2003 .

[116]  F. Papavasiliou,et al.  V(D)J Recombination and the Evolution of the Adaptive Immune System , 2003, PLoS biology.

[117]  Jiajie Zhang,et al.  PEAR: a fast and accurate Illumina Paired-End reAd mergeR , 2013, Bioinform..

[118]  Sven Ahlbäck,et al.  Melodic similarity as a determinant of melody structure , 2007 .

[119]  W. L. Windsor Music and Probability , 2009 .

[120]  J. Dongen,et al.  Multiple clonal Ig/TCR products: implications for interpretation of clonality findings , 2012, Journal of Hematopathology.

[121]  David Temperley,et al.  What's Key for Key? The Krumhansl-Schmuckler Key-Finding Algorithm Reconsidered , 1999 .

[122]  Marcel Mesnage Morphoscope, a computer system for music analysis , 1993 .

[123]  Nicholas Cook,et al.  A guide to musical analysis , 1987 .

[124]  Alexander Zelikovsky,et al.  Bioinformatics Algorithms: Techniques and Applications , 2008 .

[125]  Christopher Ariza,et al.  Music21: A Toolkit for Computer-Aided Musicology and Symbolic Music Data , 2010, ISMIR.

[126]  Jérôme Lane,et al.  IMGT®, the international ImMunoGeneTics information system® , 2004, Nucleic Acids Res..

[127]  David Huron,et al.  Characterizing Musical Textures , 1989, ICMC.

[128]  Jon Louis Bentley,et al.  Quad trees a data structure for retrieval on composite keys , 1974, Acta Informatica.

[129]  Mathieu Giraud,et al.  Modeling Musical Structure with Parametric Grammars , 2015, MCM.

[130]  M Hummel,et al.  Design and standardization of PCR primers and protocols for detection of clonal immunoglobulin and T-cell receptor gene recombinations in suspect lymphoproliferations: Report of the BIOMED-2 Concerted Action BMH4-CT98-3936 , 2003, Leukemia.

[131]  Nicolas Donin,et al.  L'analyse musicale, une pratique et son histoire , 2009 .

[132]  Peter N. Robinson,et al.  IMSEQ - a fast and error aware approach to immunogenetic sequence analysis , 2015, Bioinform..

[133]  Gerhard Widmer,et al.  Key-Finding with Interval Profiles , 2007, ICMC.

[134]  O. Lartillot Motivic Pattern Extraction in Symbolic Domain , 2008 .

[135]  Mathieu Bergeron,et al.  Discovery of Contrapuntal Patterns , 2010, ISMIR.

[136]  R. White,et al.  High-Throughput Sequencing of the Zebrafish Antibody Repertoire , 2009, Science.

[137]  Mathieu Giraud,et al.  Fragmentations with Pitch, Rhythm and Parallelism Constraints for Variation Matching , 2013, CMMR.

[138]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[139]  Mathieu Giraud,et al.  Algorithmes pour l'analyse de la musique tonale , 2014, Tech. Sci. Informatiques.

[140]  S. R. Holtzman A program for key determination , 1977 .

[141]  Nicolas Guiomard-Kagan,et al.  Comparing Voice and Stream Segmentation Algorithms , 2015, ISMIR.

[143]  Pierre Couprie EAnalysis : aide à l'analyse de la musique électroacoustique , 2012 .

[144]  P. Lipsky,et al.  Characterization of the Human Ig Heavy Chain Antigen Binding Complementarity Determining Region 3 Using a Newly Developed Software Algorithm, JOINSOLVER , 2004, The Journal of Immunology.

[145]  Marie-Paule Lefranc,et al.  IMGT, the international ImMunoGeneTics information system® , 2004, Nucleic Acids Res..

[146]  William P. Birmingham,et al.  Algorithms for Chordal Analysis , 2002, Computer Music Journal.

[147]  T. Lingner,et al.  Modulation of CNS autoimmune responses by CD8+ T cells coincides with their oligoclonal expansion , 2016, Journal of Neuroimmunology.