Baseline Regularization for Computational Drug Repositioning with Longitudinal Observational Data

Computational Drug Repositioning (CDR) is the knowledge discovery process of finding new indications for existing drugs leveraging heterogeneous drug-related data. Longitudinal observational data such as Electronic Health Records (EHRs) have become an emerging data source for CDR. To address the high-dimensional, irregular, subject and time-heterogeneous nature of EHRs, we propose Baseline Regularization (BR) and a variant that extend the one-way fixed effect model, which is a standard approach to analyze small-scale longitudinal data. For evaluation, we use the proposed methods to search for drugs that can lower Fasting Blood Glucose (FBG) level in the Marshfield Clinic EHR. Experimental results suggest that the proposed methods are capable of rediscovering drugs that can lower FBG level as well as identifying some potential blood sugar lowering drugs in the literature.

[1]  JAMA Internal Medicine. , 2017, JAMA internal medicine.

[2]  David Page,et al.  Computational Drug Repositioning Using Continuous Self-Controlled Case Series , 2016, KDD.

[3]  Grey Giddins,et al.  Statistics , 2016, The Journal of hand surgery, European volume.

[4]  Sandra Lowe,et al.  Longitudinal And Panel Data Analysis And Applications In The Social Sciences , 2016 .

[5]  J. Fournier,et al.  Tramadol use and the risk of hospitalization for hypoglycemia in patients with noncancer pain. , 2015, JAMA internal medicine.

[6]  Ying Li,et al.  Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality , 2014, J. Am. Medical Informatics Assoc..

[7]  J. Mitri,et al.  Vitamin D and diabetes. , 2014, Endocrinology and metabolism clinics of North America.

[8]  David Madigan,et al.  Multiple Self‐Controlled Case Series for Large‐Scale Longitudinal Observational Databases , 2013, Biometrics.

[9]  Zhiyong Lu,et al.  Pathway-based drug repositioning using causal inference , 2013, BMC Bioinformatics.

[10]  Louiqa Raschid,et al.  Drug-target interaction prediction for drug repurposing with probabilistic similarity logic , 2013, BioKDD '13.

[11]  A. Martelli,et al.  A Case Report on Escitalopram-Induced Hyperglycaemia in a Diabetic Patient , 2013, International journal of psychiatry in medicine.

[12]  M. Persson,et al.  Using Lasso-Type Penalties to Model Time-Varying Covariate Effects in Panel Data Regressions – A Novel Approach Illustrated by the 'Death of Distance' In International Trade , 2013 .

[13]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[14]  A. Tiryaki,et al.  The effects of sertraline on blood lipids, glucose, insulin and HBA1C levels: A prospective clinical trial on depressive patients , 2011, Journal of research in medical sciences : the official journal of Isfahan University of Medical Sciences.

[15]  George Karypis,et al.  Proceedings of the 12th International Workshop on Data Mining in Bioinformatics , 2011, KDD 2013.

[16]  F. Hu,et al.  Effects of vitamin D and calcium supplementation on pancreatic β cell function, insulin sensitivity, and glycemia in adults at high risk of diabetes: the Calcium and Vitamin D for Diabetes Mellitus (CaDDM) randomized controlled trial. , 2011, The American journal of clinical nutrition.

[17]  Vassilis Virvilis,et al.  Literature mining, ontologies and information visualization for drug repurposing , 2011, Briefings Bioinform..

[18]  R. Tibshirani,et al.  The solution path of the generalized lasso , 2010, 1005.1971.

[19]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[20]  Jan de Leeuw,et al.  Journal of Statistical Software , 2009 .

[21]  B. Kestenbaum,et al.  Calcium plus vitamin D supplementation and the risk of incident diabetes mellitus in the Women ’ s Health Initiative , 2008 .

[22]  Greet van den Berghe Md Endocrinology and Metabolism Clinics of North America , 2007 .

[23]  G. Saridis,et al.  Journal of Optimization Theory and Applications Approximate Solutions to the Time-invariant Hamilton-jacobi-bellman Equation 1 , 1998 .

[24]  K. Chow,et al.  Risk factors of vitamin B(12) deficiency in patients receiving metformin. , 2006, Archives of internal medicine.

[25]  G. Ko,et al.  Effects of age on plasma glucose levels in non-diabetic Hong Kong Chinese. , 2006, Croatian medical journal.

[26]  P. Shah,et al.  Propoxyphene-induced hypoglycemia in renal failure. , 2006, Endocrine practice : official journal of the American College of Endocrinology and the American Association of Clinical Endocrinologists.

[27]  BMC Bioinformatics , 2005 .

[28]  R. Tibshirani,et al.  On the “degrees of freedom” of the lasso , 2007, 0712.0881.

[29]  Edward W. Frees,et al.  Longitudinal and Panel Data , 2004 .

[30]  Z. Oşar,et al.  Fenofibrate treatment is associated with better glycemic control and lower serum leptin and insulin levels in type 2 diabetic patients with hypertriglyceridemia. , 2003, European journal of internal medicine.

[31]  P. Tseng Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization , 2001 .

[32]  James W. Anderson,et al.  Effects of psyllium on glucose and serum lipid responses in men with type 2 diabetes and hypercholesterolemia. , 1999, The American journal of clinical nutrition.

[33]  A. Marušić,et al.  Croatian Medical Journal and the war. , 1998, The National medical journal of India.

[34]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[35]  J. O'sullivan Age Gradient in Blood Glucose Levels: Magnitude and Clinical Implications , 1974, Diabetes.

[36]  B F CHOW,et al.  The relationship of vitamin B12 to carbohydrate metabolism and diabetes mellitus. , 1957, The American journal of clinical nutrition.

[37]  R. Pearl Biometrics , 1914, The American Naturalist.