L1000 Viewer: A Search Engine and Web Interface for the LINCS Data Repository

The LINCS L1000 data repository contains almost two million gene expression profiles for thousands of small molecules and drugs. However, due to the complexity and the size of the data repository and a lack of an interoperable interface, the creation of pharmacologically meaningful workflows utilizing these data is severely hampered. In order to overcome this limitation, we developed the L1000 Viewer, a search engine and graphical web interface for the LINCS data repository. The web interface serves as an interactive platform allowing the user to select different forms of perturbation profiles, e.g., for specific cell lines, drugs, dosages, time points and combinations thereof. At its core, our method has a database we created from inferring and utilizing the intricate dependency graph structure among the data files. The L1000 Viewer is accessible via http://L1000viewer.bio-complexity.com/.

[1]  Vasileios Stathias,et al.  Data Portal for the Library of Integrated Network-based Cellular Signatures (LINCS) program: integrated access to diverse large-scale cellular perturbation response data , 2017, Nucleic Acids Res..

[2]  Francis Jack Smith,et al.  Data science as an academic discipline , 2006, Data Sci. J..

[3]  Ravi Iyengar,et al.  The Library of Integrated Network-Based Cellular Signatures NIH Program: System-Level Cataloging of Human Cells Response to Perturbations. , 2017, Cell systems.

[4]  Yu Lin,et al.  Ontological representation, integration, and analysis of LINCS cell line cells and their cellular responses , 2017, BMC Bioinformatics.

[5]  Avi Ma'ayan,et al.  L1000FWD: fireworks visualization of drug‐induced transcriptomic signatures , 2018, Bioinform..

[6]  M. Dehmer,et al.  The human disease network , 2013 .

[7]  Darcy A. Davis,et al.  Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks , 2011, PloS one.

[8]  Angela N. Brooks,et al.  A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles , 2017, Cell.

[9]  Laleh Soltan Ghoraie,et al.  A review of connectivity map and computational approaches in pharmacogenomics , 2017, Briefings Bioinform..

[10]  A. Hopkins Network pharmacology: the next paradigm in drug discovery. , 2008, Nature chemical biology.

[11]  Matthias Dehmer,et al.  NetBioV: an R package for visualizing large network data in biology and medicine , 2014, Bioinform..

[12]  Joshua A. Bittker,et al.  The Carcinogenome Project: In Vitro Gene Expression Profiling of Chemical Perturbations to Predict Long-Term Carcinogenicity , 2018, bioRxiv.

[13]  Dexter Hadley,et al.  Systematic integration of biomedical knowledge prioritizes drugs for repurposing , 2017, bioRxiv.

[14]  Matthias Dehmer,et al.  Defining Data Science by a Data-Driven Quantification of the Community , 2018, Mach. Learn. Knowl. Extr..

[15]  Andrew H. Beck,et al.  PharmacoGx: an R package for analysis of large pharmacogenomic datasets , 2015, Bioinform..

[16]  Klaus Hinkelmann,et al.  Design and Analysis of Experiments: Introduction to Experimental Design , 1994 .

[17]  Benjamin W. Wah,et al.  Significance and Challenges of Big Data Research , 2015, Big Data Res..

[18]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[19]  Marc Hafner,et al.  L1000CDS2: LINCS L1000 characteristic direction signatures search engine , 2016, npj Systems Biology and Applications.

[20]  María Rodríguez Martínez,et al.  Elucidating Compound Mechanism of Action by Network Perturbation Analysis Graphical , 2015 .

[21]  Amar Koleti,et al.  Metadata Standard and Data Exchange Specifications to Describe, Model, and Integrate Complex and Diverse High-Throughput Screening Data from the Library of Integrated Network-based Cellular Signatures (LINCS) , 2014, Journal of biomolecular screening.

[22]  Avi Ma'ayan,et al.  Lean Big Data integration in systems biology and systems pharmacology. , 2014, Trends in pharmacological sciences.

[23]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[24]  Matthias Dehmer,et al.  Exploiting Genomic Relations in Big Data Repositories by Graph-Based Search Methods , 2018, Mach. Learn. Knowl. Extr..

[25]  Rajiv Narayan,et al.  The GCTx format and cmap{Py, R, M, J} packages: resources for optimized storage and integrated traversal of annotated dense matrices , 2018, Bioinform..

[26]  Matthias Dehmer,et al.  samExploreR: exploring reproducibility and robustness of RNA-seq results based on SAM files , 2016, Bioinform..

[27]  Steve Vinoski,et al.  Node.js: Using JavaScript to Build High-Performance Network Programs , 2010, IEEE Internet Comput..

[28]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[29]  Frank Emmert-Streib,et al.  GSAR: Bioconductor package for Gene Set analysis in R , 2017, BMC Bioinformatics.

[30]  Levi A Garraway,et al.  Adaptive resistance of melanoma cells to RAF inhibition via reversible induction of a slowly dividing de‐differentiated state , 2017, Molecular systems biology.