Discovering knowledge of medical quality in total hip arthroplasty (THA).

As the incidence of THA is expected to rise with an aging population and improvements in surgery, a satisfactory outcome in health care can effectively increase medical quality. This paper uses a serious data screening function by THA physician to reduce data dimension after data collected from the NHI database, then 8576 cases are obtained from the original cases of 10,388 after screening procedure. The proposed model adopts an imbalanced sampling method to solve class imbalance problem, and utilizes rough set to locate core attributes. Based on the core attributes, the extracted rules can be comprehensive for the rules of medical quality. In verification, THA dataset is taken as case study; the performance of the proposed model is verified and compared with other data-mining methods under some criteria. And the generated decision rules and core attributes could find more managerial implication. Moreover, the result can provide stakeholders with useful THA information to help to make decision.

[1]  Wojciech Ziarko,et al.  Discovering attribute relationships, dependencies and rules by using rough sets , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[2]  Stan Matwin,et al.  Machine Learning for the Detection of Oil Spills in Satellite Radar Images , 1998, Machine Learning.

[3]  P. Katz,et al.  Hospital resource utilization for primary and revision total hip arthroplasty. , 2005, The Journal of bone and joint surgery. American volume.

[4]  Larry R. Medsker,et al.  Hybrid Intelligent Systems , 1995, Springer US.

[5]  R. Słowiński Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory , 1992 .

[6]  Nitesh V. Chawla,et al.  C4.5 and Imbalanced Data sets: Investigating the eect of sampling method, probabilistic estimate, and decision tree structure , 2003 .

[7]  Kevin Ong,et al.  Patient, hospital, and procedure characteristics influencing total hip and knee arthroplasty procedure duration. , 2009, The Journal of arthroplasty.

[8]  Pang-Ning Tan,et al.  Instructor's Solution Manual , 2006 .

[9]  Gustavo E. A. P. A. Batista,et al.  A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[10]  Chun-Chin Hsu,et al.  An information granulation based data mining approach for classifying imbalanced data , 2008, Inf. Sci..

[11]  David J Zukor,et al.  In-hospital cost of total hip arthroplasty in Canada and the United States. , 2004, The Journal of bone and joint surgery. American volume.

[12]  H. Rubash,et al.  Predictors of discharge to an inpatient extended care facility after total hip or knee arthroplasty. , 2006, The Journal of arthroplasty.

[13]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[14]  S. Kurtz,et al.  Projections of primary and revision hip and knee arthroplasty in the United States from 2005 to 2030. , 2007, The Journal of bone and joint surgery. American volume.

[15]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[16]  Jerzy W. Grzymala-Busse,et al.  Melanoma prediction using data mining system LERS , 2001, 25th Annual International Computer Software and Applications Conference. COMPSAC 2001.

[17]  Nitesh V. Chawla,et al.  Editorial: special issue on learning from imbalanced data sets , 2004, SKDD.

[18]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[19]  Jerzy W. Grzymala-Busse,et al.  A Comparison of Two Approaches to Data Mining from Imbalanced Data , 2004, J. Intell. Manuf..

[20]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[21]  Susan Jaglal,et al.  Provider volume and other predictors of outcome after total knee arthroplasty: a population study in Ontario. , 2003, Canadian journal of surgery. Journal canadien de chirurgie.

[22]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[23]  Jerzy W. Grzymala-Busse,et al.  A New Version of the Rule Induction System LERS , 1997, Fundam. Informaticae.

[24]  J. Bernstein,et al.  Orthopaedic Scientific Update: A special report on the scientific sessions of the 51st annual meeting of the American Academy of Orthopaedic Surgeons , 1983, Orthopedic nursing.

[25]  Jerzy W. Grzymala-Busse,et al.  Global discretization of continuous attributes as preprocessing for machine learning , 1996, Int. J. Approx. Reason..

[26]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[27]  L. Dorr,et al.  Psychologic Reasons for Patients Preferring Minimally Invasive Total Hip Arthroplasty , 2007, Clinical orthopaedics and related research.

[28]  James G. Wright,et al.  Understanding the Meaning of Satisfaction With Treatment Outcome , 2004, Medical care.

[29]  Z. Pawlak Rough set approach to knowledge-based decision support , 1997 .

[30]  Xindong Wu,et al.  10 Challenging Problems in Data Mining Research , 2006, Int. J. Inf. Technol. Decis. Mak..

[31]  Robert C. Holte,et al.  C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .

[32]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[33]  Andrew W. Moore,et al.  Fast Robust Logistic Regression for Large Sparse Datasets with Binary Outputs , 2003, AISTATS.

[34]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[35]  María José del Jesús,et al.  A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data-sets , 2008, Fuzzy Sets Syst..

[36]  Salvatore Greco,et al.  Rough sets theory for multicriteria decision analysis , 2001, Eur. J. Oper. Res..

[37]  Harry E Rubash,et al.  Differences in patient and procedure characteristics and hospital resource use in primary and revision total joint arthroplasty: a multicenter study. , 2005, The Journal of arthroplasty.

[38]  Fabrice Rossi,et al.  Multi-layer Perceptrons for Functional Data Analysis: A Projection Based Approach , 2002, ICANN.

[39]  Foster J. Provost,et al.  Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction , 2003, J. Artif. Intell. Res..

[40]  J. R. Lieberman,et al.  Impact of declining reimbursement and rising hospital costs on the feasibility of total hip arthroplasty. , 1997, The Journal of arthroplasty.

[41]  M. Maloof Learning When Data Sets are Imbalanced and When Costs are Unequal and Unknown , 2003 .

[42]  P A Dieppe,et al.  Analysis of regional variation in hip and knee joint replacement rates in England using Hospital Episodes Statistics. , 2006, Public health.

[43]  Taeho Jo,et al.  A Multiple Resampling Method for Learning from Imbalanced Data Sets , 2004, Comput. Intell..

[44]  Taeho Jo,et al.  Class imbalances versus small disjuncts , 2004, SKDD.

[45]  Yuehwern Yih,et al.  Knowledge acquisition through information granulation for imbalanced data , 2006, Expert Syst. Appl..

[46]  Roman Słowiński,et al.  Intelligent Decision Support , 1992, Theory and Decision Library.

[47]  Lech Polkowski,et al.  Rough Sets in Knowledge Discovery 2 , 1998 .