Natural language processing and machine learning to enable automatic extraction and classification of patients’ smoking status from electronic medical records