Systematic identification of correlates of HIV infection: an X-wide association study

Objective: Better identification of at-risk groups could benefit HIV-1 care programmes. We systematically identified HIV-1 risk factors in two nationally representative cohorts of women in the Demographic and Health Surveys. Methods: We identified and replicated the association of 1415 social, economic, environmental, and behavioral factors with HIV-1 status. We used the 2007 and 2013–2014 surveys conducted among 5715 and 15 433 Zambian women, respectively (688 shared factors). We used false discovery rate criteria to identify factors that are strongly associated with HIV-1 in univariate and multivariate models of the entire population, as well as in subgroups stratified by wealth, residence, age, and past HIV-1 testing. Results: In the univariate analysis, we identified 102 and 182 variables that are associated with HIV-1 in the two surveys, respectively (79 factors were associated in both). Factors that were associated with HIV-1 status in full-sample analyses and in subgroups include being formerly married (adjusted OR 2007, 2.8, P < 10−16; 2013–2014 2.8, P < 10−29), widowhood (aOR 3.7, P < 10−12; and 4.2, P < 10−30), genital ulcers within 12 months (aOR 2.4, P < 10−5; and 2.2, P < 10−6), and having a woman head of the household (aOR 1.7, P < 10−7; and 2.1, P < 10−26), while owning a bicycle (aOR 0.6, P < 10−6; and 0.6, P < 10−8) and currently breastfeeding (aOR 0.5, P < 10−9; and 0.4, P < 10−26) were associated with decreased risk. Area under the curve for HIV-1 positivity was 0.76–0.82. Conclusion: Our X-wide association study identifies under-recognized factors related to HIV-1 infection, including widowhood, breastfeeding, and gender of head of the household. These features could be used to improve HIV-1 identification programs.

[1]  Atul J. Butte,et al.  A Nutrient-Wide Association Study on Blood Pressure , 2012, Circulation.

[2]  T. Hallett,et al.  Maximising the effect of combination HIV prevention through prioritisation of the people and places in greatest need: a modelling study , 2014, The Lancet.

[3]  J. Ioannidis,et al.  The False-positive to False-negative Ratio in Epidemiologic Studies , 2011, Epidemiology.

[4]  Atul J. Butte,et al.  An Environment-Wide Association Study (EWAS) on Type 2 Diabetes Mellitus , 2010, PloS one.

[5]  John P A Ioannidis,et al.  Studying the elusive environment in large scale. , 2014, JAMA.

[6]  S. Lewin,et al.  The end of AIDS: HIV infection as a chronic disease , 2013, The Lancet.

[7]  S. Kalichman,et al.  Risk Factors for HIV-AIDS Among Youth in Cape Town, South Africa , 2005, AIDS and Behavior.

[8]  Atul J Butte,et al.  Systematic evaluation of environmental factors: persistent pollutants and nutrients correlated with serum lipid levels , 2012, International journal of epidemiology.

[9]  C. Y. Peng,et al.  Principled missing data methods for researchers , 2013, SpringerPlus.

[10]  P. Visscher,et al.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits , 2012, Nature Genetics.

[11]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[12]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[13]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[14]  Chirag J Patel,et al.  Systematic evaluation of environmental and behavioural factors associated with all-cause mortality in the United States national health and nutrition examination survey. , 2013, International journal of epidemiology.

[15]  Kiersten B. Johnson,et al.  The DHS wealth index , 2004 .

[16]  John P A Ioannidis,et al.  Placing epidemiological results in the context of multiplicity and typical correlations of exposures , 2014, Journal of Epidemiology & Community Health.

[17]  John P A Ioannidis,et al.  Systematic assessment of the correlations of household income with infectious, biochemical, physiological, and environmental factors in the United States, 1999-2006. , 2015, American journal of epidemiology.

[18]  John P A Ioannidis,et al.  Exposure‐wide epidemiology: revisiting Bradford Hill , 2016, Statistics in medicine.

[19]  S. Staveteig,et al.  Demographic patterns of HIV testing uptake in sub-Saharan Africa , 2013 .

[20]  M. Wawer,et al.  HIV risk factors in three geographic strata of rural Rakai District, Uganda , 1992, AIDS.

[21]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[22]  G. Rutherford,et al.  Kenya AIDS Indicator Survey 2012. , 2014, Journal of acquired immune deficiency syndromes.