Exploring Diseases and Syndromes in Neurology Case Reports from 1955 to 2017 with Text Mining

BACKGROUND A large number of neurology case reports have been published, but it is a challenging task for human medical experts to explore all of these publications. Text mining offers a computational approach to investigate neurology literature and capture meaningful patterns. The overarching goal of this study is to provide a new perspective on case reports of neurological disease and syndrome analysis over the last six decades using text mining. METHODS We extracted diseases and syndromes (DsSs) from more than 65,000 neurology case reports from 66 journals in PubMed over the last six decades from 1955 to 2017. Text mining was applied to reports on the detected DsSs to investigate high-frequency DsSs, categorize them, and explore the linear trends over the 63-year time frame. RESULTS The text mining methods explored high-frequency neurologic DsSs and the relationships between them from 1955 to 2017. We detected more than 18,000 unique DsSs and found 10 categories of neurologic DsSs. While the trend analysis showed the increasing trends in the case reports for top-10 high-frequency DsSs, the categories had mixed trends. CONCLUSION Our study provided new insights into the application of text mining methods to investigate DsSs in a large number of medical case reports that occur over several decades. The proposed approach can be used to provide a macro level analysis of medical literature by discovering interesting patterns and tracking them over several years to help physicians explore these case reports more efficiently.

[1]  Amir Karami,et al.  Fuzzy Topic Modeling for Medical Corpora , 2015 .

[2]  C. Taber,et al.  Taber's Cyclopedic Medical Dictionary , 1963 .

[3]  Aeilko H. Zwinderman,et al.  Understanding big data themes from scientific biomedical literature through topic modeling , 2016, Journal of Big Data.

[4]  Kun Lu,et al.  Measuring author research relatedness: A comparison of word-based, topic-based, and author cocitation approaches , 2012, J. Assoc. Inf. Sci. Technol..

[5]  A. Meves,et al.  Case reports and case series from Lancet had significant impact on medical literature. , 2005, Journal of clinical epidemiology.

[6]  Michael S. Okun,et al.  A Case of Mania following Deep Brain Stimulation for Obsessive Compulsive Disorder , 2010, Stereotactic and Functional Neurosurgery.

[7]  Linda Teri,et al.  Clinico‐Neuropathological Correlation of Alzheimer's Disease in a Community‐Based Case Series , 1999, Journal of the American Geriatrics Society.

[8]  Geraldo J Oliveira,et al.  Critical appraisal and positive outcome bias in case reports published in Brazilian dental journals. , 2006, Journal of dental education.

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  Amir Karami,et al.  Characterizing transgender health issues in Twitter , 2018, ArXiv.

[11]  George Shaw,et al.  Computational content analysis of negative tweets for obesity, diet, diabetes, and exercise , 2017, ASIST.

[12]  Mari Yoshida,et al.  An autopsied case of progressive supranuclear palsy presenting with cerebellar ataxia and severe cerebellar involvement , 2013, Neuropathology : official journal of the Japanese Society of Neuropathology.

[13]  Ni Ai,et al.  Revealing topics and their evolution in biomedical literature using Bio-DTM: a case study of ginseng , 2017, Chinese Medicine.

[14]  Rolf Wynn,et al.  The clinical case report: a review of its merits and limitations , 2014, BMC Research Notes.

[15]  Xiaoyu Peng,et al.  Anti-NMDA-receptor encephalitis: case series and analysis of the effects of antibodies , 2008, The Lancet Neurology.

[16]  David B. Dunson,et al.  Probabilistic topic models , 2011, KDD '11 Tutorials.

[17]  George Shaw,et al.  An Exploratory Study of (#)Exercise in the Twittersphere , 2018, iConference 2019 Proceedings.

[18]  Romana Höftberger,et al.  Encephalitis and GABAB receptor antibodies , 2013, Neurology.

[19]  Andrew J Lees,et al.  Pathological gambling in Parkinson's disease: Risk factors and differences from dopamine dysregulation. An analysis of published case series , 2007, Movement disorders : official journal of the Movement Disorder Society.

[20]  Hadi Kharrazi,et al.  FLATM: A fuzzy logic approach topic model for medical documents , 2015, 2015 Annual Conference of the North American Fuzzy Information Processing Society (NAFIPS) held jointly with 2015 5th World Conference on Soft Computing (WConSC).

[21]  Michele Rizzi,et al.  sei-Deep brain stimulation for the treatment of drug-refractory epilepsy in a patient with a hypothalamic hamartoma , 2011 .

[22]  Amir Karami,et al.  Computational Analysis of Insurance Complaints: GEICO Case Study , 2018, ArXiv.

[24]  Xiaoyun He,et al.  Mining Public Opinion about Economic Issues: Twitter and the U.S. Presidential Election , 2018, Int. J. Strateg. Decis. Sci..

[25]  Cassidy R. Sugimoto,et al.  The shifting sands of disciplinary development: Analyzing North American Library and Information Science dissertations using latent Dirichlet allocation , 2011, J. Assoc. Inf. Sci. Technol..

[26]  Hadi Kharrazi,et al.  Characterizing Diabetes, Diet, Exercise, and Obesity Comments on Twitter , 2017, Int. J. Inf. Manag..

[27]  Amir Karami,et al.  Political Popularity Analysis in Social Media , 2018, iConference.

[28]  Takashi Yoshiura,et al.  Moyamoya Syndrome in a Splenectomized Patient With β-Thalassemia Intermedia , 2006 .

[29]  Amir Karami,et al.  What do the US West Coast public libraries post on Twitter? , 2018, ArXiv.

[30]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[31]  Sheng Tang,et al.  A density-based method for adaptive LDA model selection , 2009, Neurocomputing.

[32]  Thomas Ertl,et al.  Word Cloud Explorer: Text Analytics Based on Word Clouds , 2014, 2014 47th Hawaii International Conference on System Sciences.

[33]  Bin Zhou,et al.  A Fuzzy Approach Model for Uncovering Hidden Latent Semantic Structure in Medical Text Collections , 2015 .

[34]  Antoine Pariente,et al.  Case Series in Drug Safety , 2010, Drug safety.

[35]  James J. Chen,et al.  Text mining for identifying topics in the literatures about adolescent substance use and depression , 2016, BMC Public Health.

[36]  C H Tator,et al.  Microcystic spinal cord degeneration causing posttraumatic myelopathy. Report of two cases. , 1988, Journal of neurosurgery.

[37]  Amir Karami,et al.  Social Media Analysis For Organizations: Us Northeastern Public And State Libraries Case Study , 2018, ArXiv.

[38]  G. Schuierer,et al.  “Symptomatic Migraine”: Intracranial Lesions Mimicking Migrainous Headache ‐A Report of Three Cases , 1991, Headache.

[39]  A. Meves,et al.  A survey of case reports and case series of therapeutic interventions in the Archives of Dermatology , 2009, International journal of dermatology.

[40]  Amir Karami,et al.  Twitter speaks: A case of national disaster situational awareness , 2019, J. Inf. Sci..

[41]  Zhiyong Lu,et al.  PubTator: a web-based text mining tool for assisting biocuration , 2013, Nucleic Acids Res..

[42]  Valerie Parham,et al.  Taber's Cyclopedic Medical Dictionary (Book) , 1998 .

[43]  R. Kenward,et al.  The feasibility of electronic tracking devices in dementia: a telephone survey and case series , 1998, International journal of geriatric psychiatry.

[44]  A. Folpe,et al.  Intracranial phosphaturic mesenchymal tumors: report of 2 cases. , 2013, Journal of neurosurgery.

[45]  Annette Majnemer,et al.  Etiologic yield of cerebral palsy: a contemporary case series. , 2003, Pediatric neurology.

[46]  Amir Karami,et al.  Characterizing Diseases and disorders in Gay Users' tweets , 2018, ArXiv.

[47]  R. Mason,et al.  The case report – an endangered species? , 2001, Anaesthesia.

[48]  Bin Zhou,et al.  Fuzzy Approach Topic Discovery in Health and Medical Corpora , 2017, Int. J. Fuzzy Syst..

[49]  J. Dion,et al.  Intracranial hemorrhage associated with stent-assisted coil embolization of cerebral aneurysms: a cautionary report. , 2008, Journal of neurosurgery.

[50]  Loet Leydesdorff,et al.  Co‐word maps and topic modeling: A comparison using small and medium‐sized corpora (N < 1,000) , 2015, J. Assoc. Inf. Sci. Technol..

[51]  Kunihiko Imai,et al.  Secondary brain abscess following simple renal cyst infection: a case report , 2014, BMC Neurology.

[52]  Judy Huang,et al.  Intracranial aneurysms in the pediatric population: case series and literature review. , 2005, Surgical neurology.

[53]  Joseph J. DeFerio,et al.  Understanding the research landscape of major depressive disorder via literature mining: an entity-level analysis of PubMed data from 1948 to 2017 , 2018, JAMIA open.

[54]  Danna Zhou,et al.  d. , 1934, Microbial pathogenesis.

[55]  K. Borg,et al.  Proximal myotonic myopathy , 1997, Acta neurologica Scandinavica.

[56]  R. Moxley,et al.  Proximal myotonic myopathy. , 1995, Muscle & nerve.