Optimized Feature Subset Selection Using Genetic Algorithm for Preterm Labor Prediction Based on Electrohysterography

Electrohysterography (EHG) has emerged as an alternative technique to predict preterm labor, which still remains a challenge for the scientific-technical community. Based on EHG parameters, complex classification algorithms involving non-linear transformation of the input features, which clinicians found difficult to interpret, were generally used to predict preterm labor. We proposed to use genetic algorithm to identify the optimum feature subset to predict preterm labor using simple classification algorithms. A total of 203 parameters from 326 multichannel EHG recordings and obstetric data were used as input features. We designed and validated 3 base classifiers based on k-nearest neighbors, linear discriminant analysis and logistic regression, achieving F1-score of 84.63 ± 2.76%, 89.34 ± 3.5% and 86.87 ± 4.53%, respectively, for incoming new data. The results reveal that temporal, spectral and non-linear EHG parameters computed in different bandwidths from multichannel recordings provide complementary information on preterm labor prediction. We also developed an ensemble classifier that not only outperformed base classifiers but also reduced their variability, achieving an F1-score of 92.04 ± 2.97%, which is comparable with those obtained using complex classifiers. Our results suggest the feasibility of developing a preterm labor prediction system with high generalization capacity using simple easy-to-interpret classification algorithms to assist in transferring the EHG technique to clinical practice.

[1]  Javier Mas-Cabo,et al.  Uterine electromyography for discrimination of labor imminence in women with threatened preterm labor under tocolytic treatment , 2018, Medical & Biological Engineering & Computing.

[2]  N. Lackey,et al.  Making Sense of Factor Analysis: The Use of Factor Analysis for Instrument Development in Health Care Research , 2003 .

[3]  Yiyao Ye-Lin,et al.  Prediction of labor onset type: Spontaneous vs induced; role of electrohysterography? , 2017, Comput. Methods Programs Biomed..

[4]  Marimuthu Palaniswami,et al.  Do existing measures of Poincare plot geometry reflect nonlinear features of heart rate variability? , 2001, IEEE Transactions on Biomedical Engineering.

[5]  Michel Verleysen,et al.  A Comparison of Multivariate Mutual Information Estimators for Feature Selection , 2012, ICPRAM.

[6]  C Marque,et al.  Uterine electromyography: a critical review. , 1993, American journal of obstetrics and gynecology.

[7]  Javier Mas-Cabo,et al.  A Comparative Study of Vaginal Labor and Caesarean Section Postpartum Uterine Myoelectrical Activity , 2020, Sensors.

[8]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[9]  G. Fele-Zorz,et al.  A comparison of various linear and non-linear signal processing techniques to separate uterine EMG records of term and pre-term delivery groups , 2008, Medical & Biological Engineering & Computing.

[10]  Catherine Marque,et al.  Surveillance des grossesses à risque par électromyographie utérine , 1995 .

[11]  Marta Borowska,et al.  Early diagnosis of threatened premature labor by electrohysterographic recordings – The use of digital signal processing , 2016 .

[12]  Oleg Okun Feature Selection and Ensemble Methods for Bioinformatics: Algorithmic Classification and Implementations , 2011 .

[13]  Tito G. Amaral,et al.  Genetic algorithm based optimization for EMG pattern recognition system , 2009 .

[14]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[15]  Habibollah Haron,et al.  Performance comparison of Genetic Algorithm, Differential Evolution and Particle Swarm Optimization towards benchmark functions , 2013, 2013 IEEE Conference on Open Systems (ICOS).

[16]  J Garcia-Casado,et al.  Electrohysterogram for ANN-Based Prediction of Imminent Labor in Women with Threatened Preterm Labor Undergoing Tocolytic Therapy , 2020, Sensors.

[17]  S. Awasthi,et al.  Interplay of cytokines in preterm birth , 2017, The Indian journal of medical research.

[18]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[19]  Mohamad Khalil,et al.  Comparison of Feature selection for Monopolar and Bipolar EHG signal , 2015 .

[20]  Javier Mas-Cabo,et al.  Robust Characterization of the Uterine Myoelectrical Activity in Different Obstetric Scenarios , 2020, Entropy.

[21]  David Beasley,et al.  An overview of genetic algorithms: Part 1 , 1993 .

[22]  Catherine Marque,et al.  Comparison of Different EHG Feature Selection Methods for the Detection of Preterm Labor , 2013, Comput. Math. Methods Medicine.

[23]  Ian H. Witten,et al.  Issues in Stacked Generalization , 2011, J. Artif. Intell. Res..

[24]  M. Klebanoff,et al.  The epidemiology, etiology, and costs of preterm birth. , 2016, Seminars in fetal & neonatal medicine.

[25]  Robert J. Trotter Born Too Soon , 1980 .

[26]  V. Berghella,et al.  Fetal fibronectin testing for reducing the risk of preterm birth. , 2008, The Cochrane database of systematic reviews.

[27]  Javier Garcia-Casado,et al.  Feasibility and Analysis of Bipolar Concentric Recording of Electrohysterogram with Flexible Active Electrode , 2014, Annals of Biomedical Engineering.

[28]  N. Lackey,et al.  Making Sense of Factor Analysis , 2003 .

[29]  M. J. Katz,et al.  Fractals and the analysis of waveforms. , 1988, Computers in biology and medicine.

[30]  Jaime S. Cardoso,et al.  Machine Learning Interpretability: A Survey on Methods and Metrics , 2019, Electronics.

[31]  Daoqiang Zhang,et al.  Ensemble sparse classification of Alzheimer's disease , 2012, NeuroImage.

[32]  William L. Maner,et al.  Characterization of abdominally acquired uterine electrical signals in humans, using a non-linear analytic method , 2006, Medical and Biological Engineering and Computing.

[33]  David B. Beasley,et al.  An overview of genetic algorithms: Part 1 , 1993 .

[34]  W. Maner,et al.  Physiology and electrical activity of uterine contractions. , 2007, Seminars in cell & developmental biology.

[35]  E. Miller,et al.  Predicting preterm birth: Cervical length and fetal fibronectin. , 2017, Seminars in perinatology.

[36]  G. Prats-Boluda,et al.  Electrohysterography in the diagnosis of preterm birth: a review , 2018, Physiological measurement.

[37]  Brian Litt,et al.  A comparison of waveform fractal dimension algorithms , 2001 .

[38]  Danyang Yuan,et al.  Genetic algorithm for the optimization of features and neural networks in ECG signals classification , 2017, Scientific Reports.

[39]  Yiyao Ye-Lin,et al.  Prediction of Labor Induction Success from the Uterine Electrohysterogram , 2019, J. Sensors.

[40]  Chelsea Dobbins,et al.  Prediction of Preterm Deliveries from EHG Signals Using Machine Learning , 2013, PloS one.

[41]  Javier Mas-Cabo,et al.  Design and Assessment of a Robust and Generalizable ANN-Based Classifier for the Prediction of Premature Birth by means of Multichannel Electrohysterographic Records , 2019, J. Sensors.

[42]  Bouaguel Waad Proceedings in Adaptation, Learning and Optimization , 2015, IES.

[43]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[44]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[45]  Jinsong Leng,et al.  A genetic Algorithm-Based feature selection , 2014 .

[46]  Eric Chicken,et al.  Nonparametric Statistical Methods: Hollander/Nonparametric Statistical Methods , 1973 .

[47]  C. Gorman Born too soon. , 2004, Time.

[48]  F. Jager,et al.  Characterization and automatic classification of preterm and term uterine records , 2018, bioRxiv.