Machine Learning in Computational Biology

DEFINITION Advances in high throughput sequencing and “omics” technologies and the resulting exponential growth in the amount of macromolecular sequence, structure, gene expression measurements, have unleashed a transformation of biology from a data-poor science into an increasingly data-rich science. Despite these advances, biology today, much like physics was before Newton and Leibnitz, has remained a largely descriptive science. Machine learning [6] currently offers some of the most cost-effective tools for building predictive models from biological data, e.g., for annotating new genomic sequences, for predicting macromolecular function, for identifying functionally important sites in proteins, for identifying genetic markers of diseases, and for discovering the networks of genetic interactions that orchestrate important biological processes [3]. Advances in machine learning e.g., improved methods for learning from highly unbalanced datasets, for learning complex structures of class labels (e.g., labels linked by directed acyclic graphs as opposed to one of several mutually exclusive labels) from richly structured data such as macromolecular sequences, 3-dimensional molecular structures, and reliable methods for assessing the performance of the resulting models, are critical to the transformation of biology from a descriptive science into a predictive science.

[1]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[2]  Agathoniki Trigoni,et al.  A drift-tolerant model for data management in ocean sensor networks , 2007, MobiDE '07.

[3]  Clement H. C. Leung,et al.  Benchmarking for Content-Based Visual Information Search , 2000, VISUAL.

[4]  F. Bruggeman,et al.  The nature of systems biology. , 2007, Trends in microbiology.

[5]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[6]  Christos Faloutsos,et al.  Fast Nearest Neighbor Search in Medical Image Databases , 1996, VLDB.

[7]  Arie Shoshani,et al.  OLAP and statistical databases: similarities and differences , 1997, PODS '97.

[8]  John Wilkes,et al.  An introduction to disk drive modeling , 1994, Computer.

[9]  Sang Lyul Min,et al.  On the existence of a spectrum of policies that subsumes the least recently used (LRU) and least frequently used (LFU) policies , 1999, SIGMETRICS '99.

[10]  Gottfried Vossen,et al.  Transactional Information Systems: Theory, Algorithms, and the Practice of Concurrency Control and Recovery , 2002 .

[11]  Erhard Rahm,et al.  Rondo: a programming platform for generic model management , 2003, SIGMOD '03.

[12]  Shahram Ghandeharizadeh,et al.  Trading memory for disk bandwidth in video-on-demand servers , 1998, SAC '98.

[13]  Alexander H. Waibel,et al.  Multimodal interfaces , 1996, Artificial Intelligence Review.

[14]  Angelo Chianese,et al.  Managing Uncertainties in Image Databases: A Fuzzy Approach , 2004, Multimedia Tools and Applications.

[15]  Ralf Hartmut Güting,et al.  BerlinMOD: a benchmark for moving object databases , 2009, The VLDB Journal.

[16]  CARLO MEGHINI,et al.  A model of multimedia information retrieval , 2001, JACM.

[17]  Thierry Pun,et al.  Performance evaluation in content-based image retrieval: overview and proposals , 2001, Pattern Recognit. Lett..

[18]  Adele E. Howe,et al.  Experiences with selecting search engines using metasearch , 1997, TOIS.

[19]  Dirk Grunwald,et al.  The Case for Massive Arrays of Idle Disks (MAID) , 2002 .

[20]  Shih-Fu Chang,et al.  Transform features for texture classification and discrimination in large image databases , 1994, Proceedings of 1st International Conference on Image Processing.

[21]  Yueting Zhuang,et al.  Content-based retrieval of Flash™ movies: research issues, generic framework, and future directions , 2007, Multimedia Tools and Applications.

[22]  Euripides G. M. Petrakis,et al.  Similarity Searching in Medical Image Databases , 1997, IEEE Trans. Knowl. Data Eng..

[23]  Ilya Shmulevich,et al.  On Learning Gene Regulatory Networks Under the Boolean Network Model , 2003, Machine Learning.

[24]  Kamesh Munagala,et al.  A Sampling-Based Approach to Optimizing Top-k Queries in Sensor Networks , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[25]  Arnon Rosenthal,et al.  Anatomy of a Mudular Multiple Query Optimizer , 1988, VLDB.

[26]  Nicu Sebe,et al.  Multimodal Human Computer Interaction: A Survey , 2005, ICCV-HCI.

[27]  Gerhard Weikum,et al.  Principles and realization strategies of multilevel transaction management , 1991, TODS.

[28]  Pierangela Samarati,et al.  Protecting Respondents' Identities in Microdata Release , 2001, IEEE Trans. Knowl. Data Eng..

[29]  Ron Rymon,et al.  Search through Systematic Set Enumeration , 1992, KR.

[30]  Alberto Abelló,et al.  Research in data warehouse modeling and design: dead or alive? , 2006, DOLAP '06.

[31]  Panos K. Chrysanthis,et al.  MINT Views: Materialized In-Network Top-k Views in Sensor Networks , 2007, 2007 International Conference on Mobile Data Management.

[32]  Hans-Jörg Schek,et al.  Concepts and Applications of Multilevel Transactions and Open Nested Transactions , 1992, Database Transaction Models for Advanced Applications.

[33]  C. Mohan,et al.  Caching Technologies for Web Applications , 2001, VLDB.

[34]  George Buchanan,et al.  Improving Web Interaction on Small Displays , 1999, Comput. Networks.

[35]  V. S. Subrahmanian,et al.  A multimedia presentation algebra , 1999, SIGMOD '99.

[36]  James J. Little,et al.  Automatic extraction of Irregular Network digital terrain models , 1979, SIGGRAPH.

[37]  Torben Bach Pedersen,et al.  Extending Practical Pre-Aggregation in On-Line Analytical Processing , 1999, VLDB.

[38]  Hugues Hoppe,et al.  Progressive meshes , 1996, SIGGRAPH.

[39]  Matthias Jarke,et al.  Dwq : Esprit Long Term Research Project, No 22469 Data Warehouse Quality: a Review of the Dwq Project , 2022 .

[40]  Tran Cao Son,et al.  Design and implementation of display specification for multimedia answers , 1998, Proceedings 14th International Conference on Data Engineering.

[41]  Kamran Mohseni,et al.  SensorFlock: an airborne wireless sensor network of micro-air vehicles , 2007, SenSys '07.

[42]  Gerhard Weikum,et al.  The LRU-K page replacement algorithm for database disk buffering , 1993, SIGMOD Conference.

[43]  Gerhard Weikum,et al.  Multi-level recovery , 1990, PODS.

[44]  K. Selçuk Candan,et al.  On Similarity Measures for Multimedia Database Applications , 2001, Knowledge and Information Systems.

[45]  Sridhar Ramaswamy,et al.  Join synopses for approximate query answering , 1999, SIGMOD '99.

[46]  Donald F. Towsley,et al.  Providing VCR capabilities in large-scale video servers , 1994, MULTIMEDIA '94.

[47]  Dean Kuo,et al.  Model and verification of a data manager based on ARIES , 1992, TODS.

[48]  Torben Bach Pedersen,et al.  Multidimensional Database Technology , 2001, Computer.

[49]  Timos K. Sellis,et al.  Multiple-query optimization , 1988, TODS.

[50]  Shahram Ghandeharizadeh,et al.  Minimizing start-up latency in scalable continuous media servers , 1997, Electronic Imaging.

[51]  Nacéra Bennacer,et al.  Semantic Mappings in Description Logics for Spatio-temporal Database Schema Integration , 2005, J. Data Semant..

[52]  Ellen M. Voorhees,et al.  TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing) , 2005 .

[53]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[54]  Philip A. Bernstein,et al.  Categories and Subject Descriptors: H.2.4 [Database Management]: Systems. , 2022 .

[55]  Ronald Fagin,et al.  Composing schema mappings: second-order dependencies to the rescue , 2004, PODS 2004.

[56]  Banu Özden,et al.  Demand paging for video-on-demand servers , 1995, Proceedings of the International Conference on Multimedia Computing and Systems.

[57]  Ramesh C Agarwal,et al.  Depth first generation of long patterns , 2000, KDD '00.

[58]  Christos Faloutsos,et al.  Efficient and effective Querying by Image Content , 1994, Journal of Intelligent Information Systems.

[59]  King-Lup Liu,et al.  Evaluation of Result Merging Strategies for Metasearch Engines , 2005, WISE.

[60]  Alan H. Barr,et al.  Accurate triangulations of deformed, intersecting surfaces , 1987, SIGGRAPH.

[61]  Georg Lausen Formal aspects of optimistic concurrency control in a multiple version database system , 1983, Inf. Syst..

[62]  P. Venkat Rangan,et al.  Efficient Storage Techniques for Digital Continuous Multimedia , 1993, IEEE Trans. Knowl. Data Eng..

[63]  K. Selçuk Candan,et al.  View management in multimedia databases , 2000, The VLDB Journal.

[64]  Harrick M. Vin,et al.  Design and performance tradeoffs in clustered video servers , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[65]  Ilaria Bartolini,et al.  WARP: accurate retrieval of shapes using phase of Fourier descriptors and time warping distance , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  Kristofer S. J. Pister,et al.  CotsBots: an off-the-shelf platform for distributed robotics , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[67]  Shahram Ghandeharizadeh,et al.  Staggered striping in multimedia information systems , 1994, SIGMOD '94.

[68]  Jung-Hwan Oh,et al.  STRG-Index: spatio-temporal region graph indexing for large video databases , 2005, SIGMOD '05.

[69]  Philip A. Bernstein,et al.  Model management 2.0: manipulating richer mappings , 2007, SIGMOD '07.

[70]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[71]  Christian S. Jensen,et al.  A foundation for capturing and querying complex multidimensional data , 2001, Inf. Syst..

[72]  Shivakumar Venkataraman,et al.  Cost-based optimization of decision support queries using transient-views , 1998, SIGMOD '98.

[73]  David J. DeWitt,et al.  The SPIFFI scalable video-on-demand system , 1995, SIGMOD '95.

[74]  David B. Lomet,et al.  MLR: a recovery method for multi-level systems , 1992, SIGMOD '92.

[75]  Christos H. Papadimitriou,et al.  On concurrency control by multiple versions , 1982 .

[76]  Josep Domingo-Ferrer,et al.  A polynomial-time approximation to optimal multivariate microaggregation , 2008, Comput. Math. Appl..

[77]  Tony DeRose,et al.  Multiresolution analysis for surfaces of arbitrary topological type , 1997, TOGS.

[78]  Shahram Ghandeharizadeh,et al.  Object Placement in Parallel Hypermedia Systems , 1991, VLDB.

[79]  Oren Etzioni,et al.  The MetaCrawler architecture for resource aggregation on the Web , 1997 .

[80]  Mohamed A. Sharaf,et al.  Balancing energy efficiency and quality of aggregate data in sensor networks , 2004, The VLDB Journal.

[81]  Thomas Gerstner Multiresolution Compression and Visualization of Global Topographic Data , 2003, GeoInformatica.

[82]  Ramesh Govindan,et al.  Localized edge detection in sensor fields , 2003, Ad Hoc Networks.

[83]  Josep Domingo-Ferrer,et al.  Practical Data-Oriented Microaggregation for Statistical Disclosure Control , 2002, IEEE Trans. Knowl. Data Eng..

[84]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[85]  Tomás Skopal,et al.  On Fast Non-metric Similarity Search by Metric Access Methods , 2006, EDBT.

[86]  Philip S. Yu,et al.  Rotation invariant indexing of shapes and line drawings , 2005, CIKM '05.

[87]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[88]  Fabio Paternò,et al.  Tool support for designing nomadic applications , 2003, IUI '03.

[89]  J. Hellerstein,et al.  Data gathering tours in sensor networks , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[90]  Sunil Prabhakar,et al.  Evaluating probabilistic queries over imprecise data , 2003, SIGMOD '03.

[91]  Arie Shoshani,et al.  An efficient compression scheme for bitmap indices , 2004 .

[92]  Ralph C. Merkle,et al.  Secrecy, authentication, and public key systems , 1979 .

[93]  Eamonn Keogh Exact Indexing of Dynamic Time Warping , 2002, VLDB.

[94]  K. Selçuk Candan,et al.  Similarity-based ranking and query processing in multimedia databases , 2000, Data Knowl. Eng..

[95]  Michel Scholl,et al.  Building a constraint-based spatial database system: model, languages, and implementation , 2003, Inf. Syst..

[96]  V. Bryant Metric Spaces: Iteration and Application , 1985 .

[97]  Kenneth P. Birman,et al.  Reliable Distributed Systems: Technologies, Web Services, and Applications , 2005 .

[98]  Margaret Martonosi,et al.  Hardware design experiences in ZebraNet , 2004, SenSys '04.

[99]  Albrecht Schmidt,et al.  In-car interaction using search-based user interfaces , 2008, CHI.

[100]  Deborah Estrin,et al.  Data-centric storage in sensornets , 2003, CCRV.

[101]  Yufei Tao,et al.  Spatial queries in dynamic environments , 2003, TODS.

[102]  George T. Duncan,et al.  Enhancing Access to Microdata while Protecting Confidentiality: Prospects for the Future , 1991 .

[103]  Richard A. Crus Data Recovery in IBM Database 2 , 1984, IBM Syst. J..

[104]  Ritei Shibata,et al.  High-dimensional data visualisation: The textile plot , 2008, Comput. Stat. Data Anal..

[105]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[106]  H. V. Jagadish,et al.  A retrieval technique for similar shapes , 1991, SIGMOD '91.

[107]  Cyril Cleverdon,et al.  The Cranfield tests on index language devices , 1997 .

[108]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[109]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[110]  Ralf Hartmut Güting,et al.  Spatio-Temporal Data Types: An Approach to Modeling and Querying Moving Objects in Databases , 1999, GeoInformatica.

[111]  Christiaan J. J. Paredis,et al.  Millibots: The Development of a Framework and Algorithms for a Distributed Heterogeneous Robot Team , 2002 .

[112]  Vasant G Honavar,et al.  Predicting linear B‐cell epitopes using string kernels , 2008, Journal of molecular recognition : JMR.

[113]  Mohammed J. Zaki,et al.  Efficiently mining maximal frequent itemsets , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[114]  Thanasis Hadzilacos,et al.  Algorithmic aspects of multiversion concurrency control , 1985, PODS '85.

[115]  Alberto Del Bimbo,et al.  Visual information retrieval , 1999 .

[116]  David A. Hull Using statistical testing in the evaluation of retrieval experiments , 1993, SIGIR.

[117]  T.H.W. Westerveld,et al.  RECVID as a Re-Usable Test-Collection for Video Retrieval , 2003 .

[118]  Matthias Jarke,et al.  GeRoMe: A Generic Role Based Metamodel for Model Management , 2005, J. Data Semant..

[119]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[120]  Gerard Salton,et al.  Automatic indexing , 1980, ACM '80.

[121]  Richard A. Bolt,et al.  “Put-that-there”: Voice and gesture at the graphics interface , 1980, SIGGRAPH '80.

[122]  Ralf Hartmut Güting,et al.  Querying Moving Objects in SECONDO , 2006, 7th International Conference on Mobile Data Management (MDM'06).

[123]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[124]  Seon Ho Kim,et al.  Striping in Multi-Disk Video Servers , 1995 .

[125]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[126]  William Ribarsky,et al.  Real-time, continuous level of detail rendering of height fields , 1996, SIGGRAPH.

[127]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[128]  D. Davidson Truth and meaning , 2004, Synthese.

[129]  L. De Floriani A pyramidal data structure for triangle-based surface description , 1989, IEEE Computer Graphics and Applications.

[130]  Timos K. Sellis,et al.  A survey of logical models for OLAP databases , 1999, SGMD.

[131]  Asit Dan,et al.  Session Scheduling and Resource Sharing in Multimedia Systems , 1996 .

[132]  Hamid Pirahesh,et al.  ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging , 1998 .

[133]  Yuan Yan Tang,et al.  Multimodal interface for human-machine communication , 2002 .

[134]  K. Selçuk Candan,et al.  CHIMP: a framework for supporting distributed multimedia document authoring and presentation , 1997, MULTIMEDIA '96.

[135]  Banu Özden,et al.  Buffer replacement algorithms for multimedia storage systems , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[136]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[137]  Guizhen Yang,et al.  The complexity of mining maximal frequent itemsets and maximal frequent patterns , 2004, KDD.

[138]  Ramesh C. Jain Experiential computing , 2003, CACM.

[139]  Gustavo Alonso,et al.  Atomicity and isolation for transactional processes , 2002, TODS.

[140]  Paolo Cignoni,et al.  Planet-sized batched dynamic adaptive meshes (P-BDAM) , 2003, IEEE Visualization, 2003. VIS 2003..

[141]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[142]  Ryan Newton,et al.  The pothole patrol: using a mobile sensor network for road surface monitoring , 2008, MobiSys '08.

[143]  Reudiger Buck-Emden,et al.  Sap R/3 System: A Client/Server Technology , 1996 .

[144]  Goetz Graefe,et al.  The Volcano optimizer generator: extensibility and efficient search , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[145]  Kevin C. Almeroth,et al.  Long Term Channel Allocation Strategies for Video Applications , 1995 .

[146]  R. Manmatha,et al.  Image retrieval by appearance , 1997, SIGIR '97.

[147]  John G. Stell,et al.  Stratified Map Spaces: A Formal Basis for Multi-resolution Spatial Databases , 2001 .

[148]  Jung Hong Chuang Level of Detail for 3D Graphics , 2002 .

[149]  Heiko Schuldt,et al.  Setting the Foundations of Digital Libraries: The DELOS Manifesto , 2007, D Lib Mag..

[150]  Neil J. Gunther,et al.  Benchmark for image retrieval using distributed systems over the Iinternet: BIRDS-I , 2000, IS&T/SPIE Electronic Imaging.

[151]  James H. Clark,et al.  Hierarchical geometric models for visible surface algorithms , 1976, CACM.

[152]  Vasant Honavar,et al.  Predicting DNA-binding sites of proteins from amino acid sequence , 2006, BMC Bioinformatics.

[153]  Multimodal interaction , communication and navigation guidelines , 2022 .

[154]  Tomás Skopal,et al.  Improving the Performance of M-Tree Family by Nearest-Neighbor Graphs , 2007, ADBIS.

[155]  Philip A. Bernstein,et al.  Concurrency Control in Distributed Database Systems , 1986, CSUR.

[156]  Alberto H. F. Laender,et al.  OMT-G: An Object-Oriented Data Model for Geographic Applications , 2001, GeoInformatica.

[157]  Lynda Hardman,et al.  That Obscure Object of Desire: Multimedia Metadata on the Web, Part 1 , 2004, IEEE Multim..

[158]  Arie Segev,et al.  Using common subexpressions to optimize multiple queries , 1988, Proceedings. Fourth International Conference on Data Engineering.

[159]  Marc H. Graham,et al.  Abstraction in recovery management , 1986, SIGMOD '86.

[160]  August-Wilhelm Scheer,et al.  ARIS — Architecture of Integrated Information Systems , 1992 .

[161]  Torben Bach Pedersen,et al.  Incomplete Information in Multidimensional Databases , 2003, Multidimensional Databases.

[162]  Rajeev Rastogi,et al.  Independence is good: dependency-based histogram synopses for high-dimensional data , 2001, SIGMOD '01.

[163]  D. Burago,et al.  A Course in Metric Geometry , 2001 .

[164]  Amol Deshpande,et al.  Online Filtering, Smoothing and Probabilistic Modeling of Streaming data , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[165]  Ben Taskar,et al.  Selectivity estimation using probabilistic models , 2001, SIGMOD '01.

[166]  Wei Hong,et al.  The design of an acquisitional query processor for sensor networks , 2003, SIGMOD '03.

[167]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[168]  Thomas Rist,et al.  A standard reference model for intelligent multimedia presentation systems , 1997, Comput. Stand. Interfaces.

[169]  Vasant Honavar,et al.  Exploring inconsistencies in genome-wide protein function annotations: a machine learning approach , 2007, BMC Bioinformatics.

[170]  Dirk Grunwald,et al.  Massive Arrays of Idle Disks For Storage Archives , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[171]  Jean-Daniel Zucker,et al.  How to Integrate Heterogeneous Spatial Databases in a Consistent Way? , 2004, ADBIS.

[172]  Randall H. Trigg,et al.  Design issues for a Dexter-based hypermedia system , 1994, CACM.

[173]  Andrea J. Borr Robustness to Crash in a Distributed Database: A Non Shared-memory Multi-Processor Approach , 1984, VLDB.

[174]  Kenneth A. Ross,et al.  Reusing invariants: a new strategy for correlated queries , 1998, SIGMOD '98.

[175]  Justin Zobel,et al.  How reliable are the results of large-scale information retrieval experiments? , 1998, SIGIR '98.

[176]  C. Lee Giles,et al.  Inquirus, the NECI Meta Search Engine , 1998, Comput. Networks.

[177]  Pavel Zezula,et al.  Similarity Search - The Metric Space Approach , 2005, Advances in Database Systems.

[178]  Stefan Wirag Modeling of Adaptive Multimedia Documents , 1997, IDMS.

[179]  Erhard Rahm,et al.  Schema and ontology matching with COMA++ , 2005, SIGMOD '05.

[180]  Hannu Toivonen,et al.  Finding Frequent Substructures in Chemical Compounds , 1998, KDD.

[181]  Ronald Fagin,et al.  Fuzzy queries in multimedia database systems , 1998, PODS '98.

[182]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[183]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[184]  Philip A. Bernstein,et al.  A SOPHISTICATE'S'INTRODUCTION TO DISTRIBUTED DATABASE CONCURRENCY CONTROL , 1982 .

[185]  King-Lup Liu,et al.  Building efficient and effective metasearch engines , 2002, CSUR.

[186]  Colin Potts,et al.  Design of Everyday Things , 1988 .

[187]  Shahram Ghandeharizadeh,et al.  On Minimizing Startup Latency in Scalable Continuous Media Servers ∗ , 1996 .

[188]  Asit Dan,et al.  Buffering and caching in large-scale video servers , 1995, Digest of Papers. COMPCON'95. Technologies for the Information Superhighway.

[189]  Thomas G. Dietterich Machine Learning for Sequential Data: A Review , 2002, SSPR/SPR.

[190]  Subhash Suri,et al.  Surface approximation and geometric partitions , 1994, SODA '94.

[191]  Sébastien Mustière,et al.  Database Requirements for Generalisation and Multiple Representations , 2007 .

[192]  Fouad A. Tobagi,et al.  Streaming RAID: a disk array management system for video files , 1993, MULTIMEDIA '93.

[193]  Zvi M. Kedem,et al.  Pincer-Search: A New Algorithm for Discovering the Maximum Frequent Set , 1998, EDBT.

[194]  Vasant Honavar,et al.  Assessing the Performance of Macromolecular Sequence Classifiers , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[195]  Chitta Baral,et al.  SQL+D: extended display capabilities for multimedia database queries , 1998, MULTIMEDIA '98.

[196]  Yong Li,et al.  GeRoMeSuite: A System for Holistic Generic Model Management , 2007, VLDB.

[197]  Julian Jang,et al.  Isolation Support for Service-based Applications: A Position Paper , 2007, CIDR.

[198]  J. A. Hartigan,et al.  Mosaics for Contingency Tables , 1981 .

[199]  Christodoulakis Stavros,et al.  An object oriented architecture for multimedia information systems , 1991 .

[200]  Shahram Ghandeharizadeh,et al.  Design of Multi-User Editing Servers for Continuous Media , 2004, Multimedia Tools and Applications.

[201]  Hans-Peter Kriegel,et al.  Optimal multi-step k-nearest neighbor search , 1998, SIGMOD '98.

[202]  Mohan S. Kankanhalli,et al.  Benchmarking Multimedia Databases , 1997, Multimedia Tools and Applications.

[203]  Alan L. Cox,et al.  A comparative evaluation of transparent scaling techniques for dynamic content servers , 2005, 21st International Conference on Data Engineering (ICDE'05).

[204]  Markus Schneider,et al.  A foundation for representing and querying moving objects , 2000, TODS.

[205]  Juha-Pekka Tolvanen,et al.  MetaEdit+: integrated modeling and metamodeling environment for domain-specific languages , 2006, OOPSLA '06.

[206]  Michael Y. Galperin The Molecular Biology Database Collection: 2008 update , 2007, Nucleic Acids Res..

[207]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[208]  Joshua R. Smith,et al.  Image retrieval evaluation , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[209]  Brian P. Bailey,et al.  Nsync—a toolkit for building interactive multimedia presentations , 1998, MULTIMEDIA '98.

[210]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[211]  Donna K. Harman,et al.  Overview of the Eighth Text REtrieval Conference (TREC-8) , 1999, TREC.

[212]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[213]  Clement T. Yu,et al.  A highly scalable and effective method for metasearch , 2001, TOIS.

[214]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[215]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[216]  Yizhong Fan,et al.  Adaptive Agents for Information Gathering from Multiple, Distributed Information Sources , 1999 .

[217]  Paul Over,et al.  The TREC2001 Video Track: Information Retrieval on Digital Video Information , 2002, ECDL.

[218]  Stefano Spaccapietra,et al.  Conceptual modeling for traditional and spatio-temporal applications - the MADS approach , 2006 .

[219]  William Stallings,et al.  Cryptography and Network Security: Principles and Practice , 1998 .

[220]  Ramesh C. Jain,et al.  ACM SIGMM retreat report on future directions in multimedia research , 2005, TOMCCAP.

[221]  Erhard Rahm,et al.  Data Warehouse Scenarios for Model Management , 2000, ER.

[222]  Gaurav S. Sukhatme,et al.  Robomote: enabling mobility in sensor networks , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[223]  Randall H. Trigg,et al.  Design issues for a Dexter-based hypermedia system , 1992, ECHT '92.

[224]  Marinette Savonnet,et al.  Do we need metamodels AND ontologies for engineering platforms? , 2006, GaMMa '06.

[225]  J. Leon Zhao,et al.  Buffer management for video database systems , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[226]  Abraham Silberschatz,et al.  Kernel Support for Recoverable-Persistent Virtual Memory , 1993, USENIX MACH Symposium.

[227]  Diego Calvanese,et al.  The Description Logic Handbook , 2007 .

[228]  Polle Zellweger,et al.  Automatically generating consistent schedules for multimedia documents , 1993, Multimedia Systems.

[229]  Moni Naor,et al.  Optimal aggregation algorithms for middleware , 2001, PODS '01.

[230]  Victor R. Basili,et al.  A meta-model for software development resource expenditures , 1981, ICSE '81.

[231]  Yang Zhang,et al.  CarTel: a distributed mobile sensor computing system , 2006, SenSys '06.

[232]  Timos K. Sellis,et al.  Improvements on a Heuristic Algorithm for Multiple-Query Optimization , 1994, Data Knowl. Eng..

[233]  Enrico Puppo Variable Resolution Terrain Surfaces , 1996, CCCG.

[234]  Leland Wilkinson The Grammar of Graphics , 1999 .

[235]  Hans-Jörg Schek,et al.  Architectural Issues of Transaction Management in Multi-Layered Systems , 1984, VLDB.

[236]  Yong Yao,et al.  The cougar approach to in-network query processing in sensor networks , 2002, SGMD.

[237]  CiacciaPaolo,et al.  Searching in metric spaces with user-defined and approximate distances , 2002 .

[238]  Vijay V. Raghavan,et al.  Fully automatic wrapper generation for search engines , 2005, WWW '05.

[239]  Chung Laung Liu,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[240]  Richard A. Becker,et al.  The Visual Design and Control of Trellis Display , 1996 .

[241]  L. Sarjakoski Conceptual Models of Generalisation and Multiple Representation , 2007 .

[242]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[243]  George Karypis,et al.  Frequent Substructure-Based Approaches for Classifying Chemical Compounds , 2005, IEEE Trans. Knowl. Data Eng..

[244]  K. Selçuk Candan,et al.  SEMCOG: A Hybrid Object-based Image and Video Database System and Its Modeling, Language, and Query Processing , 1999, Theory Pract. Object Syst..

[245]  Gultekin Özsoyoglu,et al.  Querying Multimedia Presentations Based on Content , 1999, IEEE Trans. Knowl. Data Eng..

[246]  Gerhard Weikum,et al.  Multi-level transaction management for complex objects: Implementation, performance, parallelism , 1993, VLDB J..

[247]  Peter Thanisch,et al.  Logical Multidimensional Database Design for Ragged and Unbalanced Aggregation , 2001, DMDW.

[248]  Deborah F. Swayne,et al.  Interactive and Dynamic Graphics for Data Analysis - With R and GGobi , 2007, Use R.

[249]  John Anderson,et al.  An analysis of a large scale habitat monitoring application , 2004, SenSys '04.

[250]  Erik Duval,et al.  Metadata Principles and Practicalities , 2002, D Lib Mag..

[251]  Judith M. Myerson The Complete Book of Middleware , 2002 .

[252]  Mohan S. Kankanhalli,et al.  Experiential Sampling in Multimedia Systems , 2006, IEEE Transactions on Multimedia.

[253]  Catriel Beeri,et al.  A model for concurrency in nested transactions systems , 1989, JACM.

[254]  Kevin Chen-Chuan Chang,et al.  RankSQL: query algebra and optimization for relational top-k queries , 2005, SIGMOD '05.

[255]  Binjia Jiao Multimedia presentation database system , 2000, MM 2000.

[256]  Arie Shoshani,et al.  SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases , 1981, VLDB.

[257]  Ralf Hartmut Güting,et al.  A data model and data structures for moving objects databases , 2000, SIGMOD '00.

[258]  Harold W. Thimbleby,et al.  Successful user interface design from efficient computer algorithms , 2000, CHI Extended Abstracts.

[259]  Lawrence H. Cox,et al.  Effects of Rounding on the Quality and Confidentiality of Statistical Data , 2006, Privacy in Statistical Databases.

[260]  Holger Günzel,et al.  Data-Warehouse-Systeme: Architektur, Entwicklung, Anwendung , 2005 .

[261]  Klaus H. Hinrichs,et al.  Managing uncertainty in moving objects databases , 2004, TODS.

[262]  Charles T. Davies,et al.  Data Processing Spheres of Control , 1978, IBM Syst. J..

[263]  Donald F. Towsley,et al.  Channel Allocation under Batching and VCR Control in Video-on-Demand Systems , 1995, J. Parallel Distributed Comput..

[264]  Christian S. Jensen,et al.  A Conceptual Schema Language for the Management of Multiple Representations of Geographic Entities , 2005, Trans. GIS.

[265]  Dimitrios Gunopulos,et al.  Discovering All Most Specific Sentences by Randomized Algorithms , 1997, ICDT.

[266]  Stefano Spaccapietra,et al.  View Integration: A Step Forward in Solving Structural Conflicts , 1994, IEEE Trans. Knowl. Data Eng..

[267]  Shahram Ghandeharizadeh,et al.  Controlled Buffer Sharing in Continuous Media Servers , 2004, Multimedia Tools and Applications.

[268]  V. S. Subrahmanian,et al.  An Algebra for PowerPoint Sources , 2004, Multimedia Tools and Applications.

[269]  Arie Shoshani,et al.  Summarizability in OLAP and statistical data bases , 1997, Proceedings. Ninth International Conference on Scientific and Statistical Database Management (Cat. No.97TB100150).

[270]  Wolfgang Lehner,et al.  Efficient exploitation of similar subexpressions for query processing , 2007, SIGMOD '07.

[271]  Terry E. Weymouth,et al.  Semantic Queries with Pictures: The VIMSYS Model , 1991, VLDB.

[272]  Liu Xiao-ying Fast Subsequence Matching in Time-series Database , 2008 .

[273]  Philip A. Bernstein,et al.  Principles of Transaction Processing , 1996 .

[274]  Torben Bach Pedersen Warehousing the world: a few remaining challenges , 2007, DOLAP '07.

[275]  Dick C. A. Bulterman,et al.  The Amsterdam hypermedia model: adding time and context to the Dexter model , 1994, CACM.

[276]  C. Guestrin,et al.  Near-optimal sensor placements: maximizing information while minimizing communication cost , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[277]  S. Sudarshan,et al.  Scheduling and Caching in MultiQuery Optimization , 2006, COMAD.

[278]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[279]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit , 2009 .

[280]  David J. DeWitt,et al.  An evaluation of buffer management strategies for relational database systems , 1986, Algorithmica.

[281]  Ralf Hartmut Güting,et al.  Modeling and querying moving objects in networks , 2006, The VLDB Journal.

[282]  Deborah Estrin,et al.  Directed diffusion: a scalable and robust communication paradigm for sensor networks , 2000, MobiCom '00.

[283]  Ramesh C. Jain,et al.  An Interactive Image Management System for Face Information Retrieval , 1992, CIKM.

[284]  Martin Gogolla Unified Modeling Language , 2009, Encyclopedia of Database Systems.

[285]  Ralf Hartmut Güting,et al.  Moving Objects Databases , 2005 .

[286]  Stefano Spaccapietra,et al.  On Spatial Database Integration , 1998, Int. J. Geogr. Inf. Sci..

[287]  David J. Spiegelhalter,et al.  Probabilistic Networks and Expert Systems , 1999, Information Science and Statistics.

[288]  Shin'ichi Satoh,et al.  The SR-tree: an index structure for high-dimensional nearest neighbor queries , 1997, SIGMOD '97.

[289]  Hans-Peter Kriegel,et al.  Generalizing the Optimality of Multi-step k -Nearest Neighbor Query Processing , 2007, SSTD.

[290]  Christian S. Jensen,et al.  Computational data modeling for network-constrained moving objects , 2003, GIS '03.

[291]  L. Willenborg,et al.  Elements of Statistical Disclosure Control , 2000 .

[292]  Philip S. Yu,et al.  Mining long sequential patterns in a noisy environment , 2002, SIGMOD '02.

[293]  Panos K. Chrysanthis,et al.  SenseSwarm: a perimeter-based data acquisition framework for mobile sensor networks , 2007, DMSN '07.

[294]  Toby J. Teorey,et al.  A comparative analysis of disk scheduling policies , 1972, CACM.

[295]  V. S. Subrahmanian,et al.  A multi-similarity algebra , 1998, SIGMOD '98.

[296]  William B. Rouse,et al.  Big graphics and little screens: designing graphical displays for maintenance tasks , 1992, IEEE Trans. Syst. Man Cybern..

[297]  Sharon L. Oviatt,et al.  Ten myths of multimodal interaction , 1999, Commun. ACM.

[298]  S. Sudarshan,et al.  Pipelining in multi-query optimization , 2003, J. Comput. Syst. Sci..

[299]  Emanuele Danovaro,et al.  Level-of-detail for data analysis and exploration: A historical overview and some new perspectives , 2006, Comput. Graph..

[300]  Simone Santini,et al.  Emergent Semantics through Interaction in Image Databases , 2001, IEEE Trans. Knowl. Data Eng..

[301]  Johannes Gehrke,et al.  MAFIA: a maximal frequent itemset algorithm for transactional databases , 2001, Proceedings 17th International Conference on Data Engineering.

[302]  Philip A. Bernstein,et al.  A vision for management of complex models , 2000, SGMD.

[303]  Luc De Raedt,et al.  Molecular feature mining in HIV data , 2001, KDD '01.

[304]  Harold Borko,et al.  Automatic indexing , 1981, ACM '81.

[305]  Paolo Atzeni,et al.  Management of Multiple Models in an Extensible Database Design Tool , 1996, EDBT.

[306]  Shojiro Nishio,et al.  Multi-version Concurrency Control Scheme for a Database System , 1984, J. Comput. Syst. Sci..

[307]  Vasant Honavar,et al.  On Evaluating MHC-II Binding Peptide Prediction Methods , 2008, PloS one.

[308]  Dan Suciu,et al.  Efficient query evaluation on probabilistic databases , 2004, The VLDB Journal.

[309]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[310]  C. Mohan,et al.  Repeating History Beyond ARIES , 1999, VLDB.

[311]  Fabio Crestani,et al.  Logic and Uncertainty in Information Retrieval , 2001, ESSIR.

[312]  Panos K. Chrysanthis,et al.  A taxonomy of correctness criteria in database applications , 1996, The VLDB Journal.

[313]  Philip R. Cohen,et al.  QuickSet: multimodal interaction for distributed applications , 1997, MULTIMEDIA '97.

[314]  Michael J. Franklin,et al.  On-the-fly sharing for streamed aggregation , 2006, SIGMOD Conference.

[315]  Michael Stonebraker,et al.  Operating system support for database management , 1981, CACM.

[316]  Zhigang Li,et al.  Efficient data mining for maximal frequent subtrees , 2003, Third IEEE International Conference on Data Mining.

[317]  K. Selçuk Candan,et al.  Sum-Max Monotonic Ranked Joins for Evaluating Top-K Twig Queries on Weighted Data Graphs , 2007, VLDB.

[318]  Matthias Jarke,et al.  ConceptBase: Managing Conceptual Models about Information Systems , 2006, Handbook on Architectures of Information Systems.

[319]  Hans-Peter Kriegel,et al.  A Storage and Access Architecture for Efficient Query Processing in Spatial Database Systems , 1993, SSD.

[320]  Sudipto Guha,et al.  Asking the right questions: model-driven optimization using probes , 2006, PODS.

[321]  Bo Xu,et al.  MOBI-DIC: MOBIle DIscovery of loCal Resources in Peer-to-Peer Wireless Network , 2005, IEEE Data Eng. Bull..

[322]  King-Lup Liu,et al.  A Methodology to Retrieve Text Documents from Multiple Databases , 2002, IEEE Trans. Knowl. Data Eng..

[323]  Krithi Ramamritham,et al.  Materialized view selection and maintenance using multi-query optimization , 2000, SIGMOD '01.

[324]  Andreas Krause,et al.  Intelligent light control using sensor networks , 2005, SenSys '05.

[325]  Marvin Schaefer,et al.  Secure Data Management System. , 1975 .

[326]  C. Mohan Tutorial: application servers and associated technologies , 2002, SIGMOD '02.

[327]  John Alan McDonald,et al.  Interactive graphics for data analysis , 1982 .

[328]  A. Prasad Sistla,et al.  Modeling and querying moving objects , 1997, Proceedings 13th International Conference on Data Engineering.

[329]  Philip S. Yu,et al.  Grouped sweeping scheduling for DASD-based multimedia storage management , 1993, Multimedia Systems.

[330]  Bratin Saha,et al.  Open nesting in software transactional memory , 2007, PPOPP.

[331]  Asit Dan,et al.  Scheduling policies for an on-demand video server with batching , 1994, MULTIMEDIA '94.

[332]  C. Hansch Quantitative approach to biochemical structure-activity relationships , 1969 .

[333]  Ouri Wolfson,et al.  Cost and imprecision in modeling the position of moving objects , 1998, Proceedings 14th International Conference on Data Engineering.

[334]  Nick Roussopoulos,et al.  K-Nearest Neighbor Search for Moving Query Point , 2001, SSTD.

[335]  David P. Anderson,et al.  A continuous media I/O server and its synchronization mechanism , 1991, Computer.

[336]  Hisashi Kashima,et al.  Marginalized Kernels Between Labeled Graphs , 2003, ICML.

[337]  Roberto J. Bayardo,et al.  Efficiently mining long patterns from databases , 1998, SIGMOD '98.

[338]  Horst Bunke,et al.  A graph distance metric based on the maximal common subgraph , 1998, Pattern Recognit. Lett..

[339]  Debasish Ghose,et al.  Scheduling Video Streams in Video-on-Demand Systems: A Survey , 2004, Multimedia Tools and Applications.

[340]  Banu Özden,et al.  Disk striping in video server environments , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[341]  K. Gabriel,et al.  The biplot graphic display of matrices with application to principal component analysis , 1971 .

[342]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[343]  Marc H. Scholl,et al.  Transactional information systems: theory, algorithms, and the practice of concurrency control and recovery , 2001, SGMD.

[344]  Gregory R. Grant,et al.  Bioinformatics - The Machine Learning Approach , 2000, Comput. Chem..

[345]  Lynda Hardman,et al.  That obscure object of desire: multimedia metadata on the Web, Part-1 , 2004, IEEE MultiMedia.

[346]  Andreas Zell,et al.  Optimal assignment kernels for attributed molecular graphs , 2005, ICML.

[347]  Sibel Adali,et al.  Ranked Relations: Query Languages and Query Processing Methods for Multimedia , 2004, Multimedia Tools and Applications.

[348]  Philip S. Yu,et al.  Storage and retrieval methods to support fully interactive playout in a disk-array-based video server , 2005, Multimedia Systems.

[349]  Samuel Madden,et al.  MauveDB: supporting model-based user views in database systems , 2006, SIGMOD Conference.

[350]  Samuel Madden,et al.  Using Probabilistic Models for Data Management in Acquisitional Environments , 2005, CIDR.

[351]  Hongjun Lu,et al.  Efficient Mining of Frequent Patterns Using Ascending Frequency Ordered Prefix-Tree , 2004, Data Mining and Knowledge Discovery.

[352]  Vannevar Bush,et al.  As we may think , 1945, INTR.

[353]  Daniel R. Dolk,et al.  Model management and structured modeling: the role of an information resource dictionary system , 1988, CACM.

[354]  Frank Leymann,et al.  Architectural Decisions and Patterns for Transactional Workflows in SOA , 2007, ICSOC.

[355]  Jihad El-Sana,et al.  Generalized View‐Dependent Simplification , 1999, Comput. Graph. Forum.

[356]  Laks V. S. Lakshmanan,et al.  ProbView: a flexible probabilistic database system , 1997, TODS.

[357]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[358]  Yury Lifshits,et al.  Disorder inequality: a combinatorial approach to nearest neighbor search , 2008, WSDM '08.

[359]  Philip A. Bernstein Repositories and object oriented databases , 1998, SGMD.

[360]  Frank Manola,et al.  PROBE Spatial Data Modeling and Query Processing in an Image Database Application , 1988, IEEE Trans. Software Eng..

[361]  A. Prasad Sistla,et al.  Updating and Querying Databases that Track Mobile Units , 1999, Distributed and Parallel Databases.

[362]  Hongjun Lu,et al.  Query translation from XPath to SQL in the presence of recursive DTDs , 2009, The VLDB Journal.

[363]  Leland Wilkinson,et al.  Playfair’s commerical and political atlas and statistical breviary , 2007 .

[364]  Seung-won Hwang,et al.  Boolean + ranking: querying a database by k-constrained optimization , 2006, SIGMOD Conference.

[365]  Irving L. Traiger,et al.  The Recovery Manager of the System R Database Manager , 1981, CSUR.

[366]  Sabine Timpf,et al.  Map cube model - a model for multi-scale data , 1998 .

[367]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[368]  Lynda Hardman,et al.  Canonical processes of semantically annotated media production , 2008, Multimedia Systems.

[369]  Roger B. Dannenberg,et al.  CHI'90 workshop on multimedia and multimodal interface design , 1990, SGCH.

[370]  Jim Gemmell,et al.  Multimedia Network File Servers: Multi-Channel Delay Sensitive Data Retrieval , 1993, ACM Multimedia.

[371]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[372]  A. L. Narasimha Reddy,et al.  I/O issues in a multimedia system , 1994, Computer.