Semi-Supervised Learning for Improving Prediction of HIV Drug Resistance

Abstract: Resistance testing is an important tool in today’s anti-HIV therapy management for improving the success of antiretroviral therapy. Routinely, the genetic sequence of viral target proteins is obtained. These sequences are then inspected for mutations that might confer resistance to antiretroviral drugs. However, interpretation of the genomic data is challenging. In recent years, approaches that employ supervised statistical learning methods were made available to assist the interpretation of the complex genetic information (e.g. geno2pheno and VircoTYPE). However, these methods rely on large amounts of labeled training data, which are expensive and labor-intensive to obtain. This work evaluates the application of semi-supervised learning (SSL) for improving the prediction of resistance from the viral genome.

[1]  B. Schmidt,et al.  Rapid, phenotypic HIV-1 drug sensitivity assay for protease and reverse transcriptase inhibitors. , 1999, Journal of clinical virology : the official publication of the Pan American Society for Clinical Virology.

[2]  Joachim Selbig,et al.  HIV-1 Drug Resistance Prediction and Therapy Optimization: A Case Study for the Application of Classification and Clustering Methods , 2009, Similarity-Based Clustering.

[3]  Joachim Büch,et al.  Arevir: A Secure Platform for Designing Personalized Antiretroviral Therapies Against HIV , 2006, DILS.

[4]  Thomas Lengauer,et al.  Geno2pheno: estimating phenotypic drug resistance from HIV-1 genotypes , 2003, Nucleic Acids Res..

[5]  Ulf Brefeld,et al.  Co-EM support vector learning , 2004, ICML.

[6]  Alexander Gammerman,et al.  Ridge Regression Learning Algorithm in Dual Variables , 1998, ICML.

[7]  Thomas Lengauer,et al.  Selecting anti-HIV therapies based on a variety of genomic and clinical factors , 2008, ISMB.

[8]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[9]  Alexander Zien,et al.  Semi-Supervised Classification by Low Density Separation , 2005, AISTATS.

[10]  Bryan Chan,et al.  Human immunodeficiency virus reverse transcriptase and protease sequence database , 2003, Nucleic Acids Res..

[11]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[12]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[13]  L. Bacheler,et al.  Prediction of HIV-1 drug susceptibility phenotype from the viral genotype using linear regression modeling. , 2007, Journal of virological methods.