Automatic classification of medical reports, the CIREA project

Choosing a patient's reasons for staying in hospital amongst the 52, 000 pathology codes listed in the ICD-10 (International Classification of Diseases) requires that the practitioner spends a large amount of time keyboarding and searching, which may discourage him. However these codes are mandatory in many countries when the patient leaves the hospital, for biostatistical and administrative studies. The aim of the CIREA project is to propose an automatic ICD coding approach by mining textual medical reports. For that purpose we have proposed new algorithms such the EDA desuffixer, the CLO3 classification algorithm and the K-measure indicator.