Technological Innovations in Disease Management: Text Mining US Patent Data From 1995 to 2017

Background Patents are important intellectual property protecting technological innovations that inspire efficient research and development in biomedicine. The number of awarded patents serves as an important indicator of economic growth and technological innovation. Researchers have mined patents to characterize the focuses and trends of technological innovations in many fields. Objective To expand patent mining to biomedicine and facilitate future resource allocation in biomedical research for the United States, we analyzed US patent documents to determine the focuses and trends of protected technological innovations across the entire disease landscape. Methods We analyzed more than 5 million US patent documents between 1995 and 2017, using summary statistics and dynamic topic modeling. More specifically, we investigated the disease coverage and latent topics in patent documents over time. We also incorporated the patent data into the calculation of our recently developed Research Opportunity Index (ROI) and Public Health Index (PHI), to recalibrate the resource allocation in biomedical research. Results Our analysis showed that protected technological innovations have been primarily focused on socioeconomically critical diseases such as “other cancers” (malignant neoplasm of head, face, neck, abdomen, pelvis, or limb; disseminated malignant neoplasm; Merkel cell carcinoma; and malignant neoplasm, malignant carcinoid tumors, neuroendocrine tumor, and carcinoma in situ of an unspecified site), diabetes mellitus, and obesity. The United States has significantly improved resource allocation to biomedical research and development over the past 17 years, as illustrated by the decreasing PHI. Diseases with positive ROI, such as ankle and foot fracture, indicate potential research opportunities for the future. Development of novel chemical or biological drugs and electrical devices for diagnosis and disease management is the dominating topic in patented inventions. Conclusions This multifaceted analysis of patent documents provides a deep understanding of the focuses and trends of technological innovations in disease management in patents. Our findings offer insights into future research and innovation opportunities and provide actionable information to facilitate policy makers, payers, and investors to make better evidence-based decisions regarding resource allocation in biomedicine.

[1]  Markus Bundschus,et al.  Text mining patents for biomedical knowledge. , 2016, Drug discovery today.

[2]  J. Murray,et al.  Monoclonal antibody therapy for solid tumors. , 2000, Cancer treatment reviews.

[3]  Medicaid Services,et al.  International Classification of Diseases, Ninth Revision, Clinical Modification , 2011 .

[4]  M. Pirrung,et al.  Modular synthesis of candidate indole-based insulin mimics by Claisen rearrangement. , 2008, Organic letters.

[5]  Michael Röder,et al.  Exploring the Space of Topic Coherence Measures , 2015, WSDM.

[6]  M. Ladjemi Anti-idiotypic antibodies as cancer vaccines: achievements and future improvements , 2012, Front. Oncol..

[7]  Maryam Zolnoori,et al.  Public Opinions Toward Diseases: Infodemiological Study on News Media Data , 2018, Journal of medical Internet research.

[8]  Ivo M Foppa,et al.  Annual estimates of the burden of seasonal influenza in the United States: A tool for strengthening influenza surveillance and preparedness , 2018, Influenza and other respiratory viruses.

[9]  C. Elger,et al.  Expression of mRNAs encoding for 17β-hydroxisteroid dehydrogenase isozymes 1, 2, 3 and 4 in epileptic human hippocampus , 2000, Epilepsy Research.

[10]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[11]  Ruili Huang,et al.  Application of dynamic topic models to toxicogenomics data , 2016, BMC Bioinformatics.

[12]  W. Raghupathi,et al.  Innovation at country-level: association between economic development and patents , 2017, Journal of Innovation and Entrepreneurship.

[13]  Nada Ahmed,et al.  Breast Cancer: Conventional Diagnosis and Treatment Modalities and Recent Patents and Technologies , 2015, Breast cancer : basic and clinical research.

[14]  Mohammed K. Abdelhameid,et al.  Synthesis of some novel thieno[3,2-d]pyrimidines as potential cytotoxic small molecules against breast cancer. , 2013, Chemical & pharmaceutical bulletin.

[15]  Doron Lancet,et al.  Mapping of molecular pathways, biomarkers and drug targets for diabetic nephropathy , 2011, Proteomics. Clinical applications.

[16]  Matthew E. Crawford,et al.  Diabetes area patent participation analysis – part II: years 2011-2016 , 2018, Expert opinion on therapeutic patents.

[17]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[18]  1995 Liu,et al.  United States Patent , 2011 .

[19]  Nilay D Shah,et al.  Optum Labs: building a novel node in the learning health care system. , 2014, Health affairs.

[20]  Daniela M. Witten,et al.  An Introduction to Statistical Learning: with Applications in R , 2013 .

[21]  Gang Feng,et al.  Disease Ontology: a backbone for disease semantic integration , 2011, Nucleic Acids Res..

[22]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[23]  Fang-Mei Tseng,et al.  Using patent data to analyze trends and the technological strategies of the amorphous silicon thin-film solar cell industry , 2011 .

[24]  H. Boulaiz,et al.  What’s new in treatment of pancreatic cancer: a patent review (2010–2017) , 2017, Expert opinion on therapeutic patents.

[25]  Kevin R. Harwell United States Patent and Trademark Office , 2002 .

[26]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[27]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[28]  Zhiyong Lu,et al.  Mining chemical patents with an ensemble of open systems , 2016, Database J. Biol. Databases Curation.

[29]  M. Goldenberg,et al.  Overview of drugs used for epilepsy and seizures: etiology, diagnosis, and treatment. , 2010, P & T : a peer-reviewed journal for formulary management.

[30]  S. Bernell,et al.  Use Your Words Carefully: What Is a Chronic Disease? , 2016, Front. Public Health.

[31]  M. Simonato Gene therapy for epilepsy , 2014, Epilepsy & Behavior.

[32]  Y. Nakano,et al.  Discovery of novel 5-oxa-2,6-diazaspiro[3.4]oct-6-ene derivatives as potent, selective, and orally available somatostatin receptor subtype 5 (SSTR5) antagonists for treatment of type 2 diabetes mellitus. , 2017, Bioorganic & medicinal chemistry.

[33]  Qiaohong Yang,et al.  Synthesis and Biological Evaluation of Novel Gigantol Derivatives as Potential Agents in Prevention of Diabetic Cataract , 2015, PloS one.

[34]  N. Cox,et al.  Evaluating phecodes, clinical classification software, and ICD-9-CM codes for phenome-wide association studies in the electronic health record , 2017, PloS one.

[35]  P. Álvarez,et al.  New gene therapy strategies for cancer treatment: a review of recent patents. , 2012, Recent patents on anti-cancer drug discovery.

[36]  B. Wörmann Breast cancer: basics, screening, diagnostics and treatment. , 2017, Medizinische Monatsschrift fur Pharmazeuten.

[37]  Cyril Grouin Biomedical entity extraction using machine-learning based approaches , 2014, LREC.

[38]  W. Dietz The response of the US Centers for Disease Control and Prevention to the obesity epidemic. , 2015, Annual review of public health.

[39]  Vikram Singh,et al.  Patent data mining in fisheries sector: An analysis using Questel-Orbit and Espacenet , 2017 .

[40]  Jiang Bian,et al.  Using Social Media Data to Understand the Impact of Promotional Information on Laypeople’s Discussions: A Case Study of Lynch Syndrome , 2017, Journal of medical Internet research.

[41]  Annie T. Chen,et al.  What Online Communities Can Tell Us About Electronic Cigarettes and Hookah Use: A Study Using Text Mining and Visualization Techniques , 2015, Journal of medical Internet research.

[42]  R. D. Connell Glucagon antagonists for the treatment of Type 2 diabetes , 1999 .

[43]  Tao Li,et al.  Patent Mining: A Survey , 2015, SKDD.

[44]  A. Chawla,et al.  3-Aryl-2-{4-[4-(2,4-dioxothiazolidin-5-ylmethyl)phenoxy]-phenyl}-acrylic acid alkyl ester: synthesis and antihyperglycemic evaluation , 2011, Medicinal Chemistry Research.

[45]  I. Cockburn,et al.  The importance of patents to innovation: updated cross-industry comparisons with biopharmaceuticals , 2015, Expert opinion on therapeutic patents.

[46]  S. W. Kim,et al.  GLP-1 gene delivery for the treatment of type 2 diabetes. , 2003, Molecular therapy : the journal of the American Society of Gene Therapy.

[47]  M. J. Ahsan Anticonvulsant activity and neuroprotection assay of 3-substituted-N-aryl-6,7-dimethoxy-3a,4-dihydro-3H-indeno[1,2-c]pyrazole-2-carboxamide analogues , 2017 .

[48]  Wolfgang H. B. Sauer,et al.  Design, synthesis, and biological activity of novel, potent, and selective (benzoylaminomethyl)thiophene sulfonamide inhibitors of c-Jun-N-terminal kinase. , 2004, Journal of medicinal chemistry.

[49]  N. Harbeck,et al.  Antibody therapy for breast cancer. , 2005, Anticancer research.

[50]  Yi-Ping Phoebe Chen,et al.  Image based computer aided diagnosis system for cancer detection , 2015, Expert Syst. Appl..

[51]  Jae Ha Gwak,et al.  Identifying the trends in wound-healing patents for successful investment strategies , 2017, PloS one.

[52]  Manuel C Peitsch,et al.  Competitive intelligence and patent analysis in drug discovery. , 2005, Drug discovery today. Technologies.