Sampling Bias and Class Imbalance in Maximum-likelihood Logistic Regression

[1]  Jean-Paul Chilès,et al.  Wiley Series in Probability and Statistics , 2012 .

[2]  Alan Agresti,et al.  Categorical Data Analysis , 2003 .

[3]  Thomas Oommen,et al.  Validation and Application of Empirical Liquefaction Models , 2010 .

[4]  L. Correia,et al.  HDL-cholesterol level provides additional prognosis in acute coronary syndromes. , 2009, International journal of cardiology.

[5]  L. López,et al.  Discriminant methods for radar detection of hail , 2009 .

[6]  David P. Williams,et al.  Mine Classification With Imbalanced Data , 2009, IEEE Geoscience and Remote Sensing Letters.

[7]  Peng Li,et al.  Fingerprint Matching Based on Neighboring Information and Penalized Logistic Regression , 2009, ICB.

[8]  Andrew K. C. Wong,et al.  Classification of Imbalanced Data: a Review , 2009, Int. J. Pattern Recognit. Artif. Intell..

[9]  Ömer Kaan Baykan,et al.  Predicting bank financial failures using neural networks, support vector machines and multivariate statistical methods: A comparative analysis in the sample of savings deposit insurance fund (SDIF) transferred banks in Turkey , 2009, Expert Syst. Appl..

[10]  Yanqing Zhang,et al.  SVMs Modeling for Highly Imbalanced Classification , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  C. Ellison,et al.  Does Religiosity Affect Health Risk Behaviors in Pregnant and Postpartum Women? , 2009, Maternal and Child Health Journal.

[12]  Li Zhu,et al.  Data Mining on Imbalanced Data Sets , 2008, 2008 International Conference on Advanced Computer Theory and Engineering.

[13]  Taghi M. Khoshgoftaar,et al.  Hybrid sampling for imbalanced data , 2008, 2008 IEEE International Conference on Information Reuse and Integration.

[14]  Dirk Van den Poel,et al.  Separating financial from commercial customer churn: A modeling step towards resolving the conflict between the sales and credit department , 2008, Expert Syst. Appl..

[15]  H. A. Nefeslioglu,et al.  Landslide susceptibility mapping for a part of tectonic Kelkit Valley (Eastern Black Sea region of Turkey) , 2008 .

[16]  José Salvador Sánchez,et al.  On the k-NN performance in a challenging scenario of imbalance and overlapping , 2008, Pattern Analysis and Applications.

[17]  Edgar Berrezueta,et al.  Landslides in the Central Coalfield (Cantabrian Mountains, NW Spain): Geomorphological features, conditioning factors and methodological implications in susceptibility assessment , 2007 .

[18]  Faculteit Economie,et al.  Separating Financial From Commercial Customer Churn: A Modeling Step Towards Resolving The Conflict Between The Sales And Credit Department , 2007 .

[19]  Zhi-Hua Zhou,et al.  Exploratory Under-Sampling for Class-Imbalance Learning , 2006, Sixth International Conference on Data Mining (ICDM'06).

[20]  Jonathan P. Stewart,et al.  CPT-Based Probabilistic and Deterministic Assessment of In Situ Seismic Soil Liquefaction Potential - eScholarship , 2006 .

[21]  M. Eeckhaut,et al.  Prediction of landslide susceptibility using rare events logistic regression: A case-study in the Flemish Ardennes (Belgium) , 2006 .

[22]  Sheng Yao Lai,et al.  Logistic Regression Model for Evaluating Soil Liquefaction Probability Using CPT Data , 2006 .

[23]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[24]  Saro Lee,et al.  Landslide susceptibility mapping in the Damrei Romel area, Cambodia using frequency ratio and logistic regression models , 2006 .

[25]  Scott A. Olson,et al.  A logistic regression equation for estimating the probability of a stream in Vermont having intermittent flow , 2006 .

[26]  Gardner C. Bent,et al.  A revised logistic regression equation and an automated procedure for mapping the probability of a stream flowing perennially in Massachusetts , 2006 .

[27]  H. A. Nefeslioglu,et al.  Susceptibility assessments of shallow earthflows triggered by heavy rainfall at three catchments by logistic regression analyses , 2005 .

[28]  K. T. Chau,et al.  Regional bias of landslide data in generating susceptibility maps using logistic regression: Case of Hong Kong Island , 2005 .

[29]  L. Ayalew,et al.  Landslides in Sado Island of Japan: Part II. GIS-based susceptibility mapping with comparisons of results from two methods and verifications , 2005 .

[30]  H. Wang,et al.  Comparative evaluation of landslide susceptibility in Minamata area, Japan , 2005 .

[31]  L. Ayalew,et al.  The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan , 2005 .

[32]  David R. Brillinger,et al.  Probability based models for estimation of wildfire risk , 2004 .

[33]  Saro Lee Application of Likelihood Ratio and Logistic Regression Models to Landslide Susceptibility Mapping Using GIS , 2004, Environmental management.

[34]  Andrea G. Fabbri,et al.  Validation of Spatial Prediction Models for Landslide Hazard Mapping , 2003 .

[35]  Foster J. Provost,et al.  Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction , 2003, J. Artif. Intell. Res..

[36]  John C. Davis,et al.  Using multiple logistic regression and GIS technology to predict landslide hazard in northeast Kansas, USA , 2003 .

[37]  C. F. Lee,et al.  A spatiotemporal probabilistic modelling of storm‐induced shallow landsliding using aerial photographs and logistic regression , 2003 .

[38]  Ronald D. Andrus,et al.  Assessing probability-based methods for liquefaction potential evaluation , 2002 .

[39]  C. Hsein Juang,et al.  Probabilistic Framework for Liquefaction Potential by Shear Wave Velocity , 2001 .

[40]  Gary King,et al.  Explaining Rare Events in International Relations , 2001, International Organization.

[41]  P. Atkinson,et al.  Generalised linear modelling of susceptibility to landsliding in the Central Apennines, Italy , 1998 .

[42]  Maureen M. Toner,et al.  RIVER HYDROLOGY AND RIPARIAN WETLANDS: A PREDICTIVE MODEL FOR ECOLOGICAL ASSEMBLY , 1997 .

[43]  Nitin R. Patel,et al.  Exact logistic regression: theory and examples. , 1995, Statistics in medicine.

[44]  Guido W. Imbens,et al.  An efficient method of moments estimator for discrete choice models with choice-based sampling , 1992 .

[45]  M. Kavvas New directions for surface water modeling , 1989 .

[46]  Nitin R. Patel,et al.  Computing Distributions for Exact Logistic Regression , 1987 .

[47]  Alberto Carrara,et al.  Multivariate models for landslide hazard evaluation , 1983 .

[48]  G. F. Bonham-Carter,et al.  Integration of mineral resource data for Kasmere Lake area, Northwest Manitoba, with emphasis on uranium , 1983 .

[49]  S. Cosslett,et al.  Maximum likelihood estimator for choice-based samples , 1981 .

[50]  F. Agterberg,et al.  Automatic contouring of geological maps to detect target areas for mineral exploration , 1974 .

[51]  D. Cox,et al.  The analysis of binary data , 1971 .