Using a Machine Learning Logistic Regression Algorithm to Classify Nanomedicine Clinical Trials in a Known Repository

Today, the nanotechnology is the most critical technology that helps in many scientific advances, because this science allows us to work with molecular structures and their atoms, obtaining material that acts chemical and biologically different to those manifesting in bigger longitudes. Many sciences join nanotechnology to improve their researches, and one of them is medicine. In nanomedicine, many researchers are looking for a way to obtain information about these nanometric materials to enhance their studies that lead in many occasions to prove these methods or to create a new compound that helps modern medicine against dominant diseases. Years after years the world increase trials with these nanomaterials and in this work the authors are going to demonstrate this issue, using the clinical trials repository in one of the most famous trials web pages. Many scientists that work with these trials, wish to obtain only the ones those that need, but in these repositories, they have to search and read each one to make sure that the trial is about what they are researching. The authors implement an application for build a train model that involved a new method of pre-processing text to classify through logistic regression in these trials. For this classification, the authors downloaded an entire database (www.clinicaltrials.gov) and used nanoinformatic with an artificial intelligence machine learning supervised algorithm to classify them. The authors classified trials that are about nanomedicine and trials that not. And finally present the results of the number of clinical trials that are about nanomedicine.

[1]  Miguel García-Remesal,et al.  Building an Index of Nanomedical Resources: An Automatic Approach Based on Text Mining , 2008, KES.

[2]  Alberto Anguita,et al.  A Machine Learning Approach to Identify Clinical Trials Involving Nanodrugs and Nanodevices from ClinicalTrials.gov , 2014, PloS one.

[3]  Miguel García-Remesal,et al.  Using Nanoinformatics Methods for Automatically Identifying Relevant Nanotoxicology Entities from the Literature , 2012, BioMed research international.

[4]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[5]  Sholom M. Weiss,et al.  Automated learning of decision rules for text categorization , 1994, TOIS.

[6]  Arthur G Erdman,et al.  The big picture on nanomedicine: the state of investigational and approved nanomedicine products. , 2013, Nanomedicine : nanotechnology, biology, and medicine.

[7]  V Maojo,et al.  Nanoinformatics knowledge infrastructures: bringing efficient information management to nanomedical research. , 2013, Computational science & discovery.

[8]  V Maojo,et al.  International Efforts in Nanoinformatics Research Applied to Nanomedicine , 2010, Methods of Information in Medicine.

[9]  David R. Liu,et al.  Small-molecule discovery from DNA-encoded chemical libraries. , 2011, Chemical Society reviews.

[10]  Oded Maimon,et al.  Predictive Toxicology of cobalt ferrite nanoparticles: comparative in-vitro study of different cellular models using methods of knowledge discovery from data , 2013, Particle and Fibre Toxicology.

[11]  Victor Maojo,et al.  Nanoinformatics: New Challenges for Biomedical Informatics at the Nano Level , 2009, MIE.

[12]  D. Buxton,et al.  Current status of nanotechnology approaches for cardiovascular disease: a personal perspective. , 2009, Wiley interdisciplinary reviews. Nanomedicine and nanobiotechnology.

[13]  Panos Macheras,et al.  Current regulatory approaches of bioequivalence testing , 2012, Expert opinion on drug metabolism & toxicology.

[14]  David Madigan,et al.  Large-Scale Bayesian Logistic Regression for Text Categorization , 2007, Technometrics.

[15]  Jon D. Patrick,et al.  Research and applications: Supervised machine learning and active learning in classification of radiology reports , 2014, J. Am. Medical Informatics Assoc..

[16]  K.J. Cios,et al.  From the guest editor medical data mining and knowledge discovery , 2000, IEEE Engineering in Medicine and Biology Magazine.

[17]  V. Quirke,et al.  The Era of Biomedicine: Science, Medicine, and Public Health in Britain and France after the Second World War , 2008, Medical History.

[18]  Daniel A. Vallero,et al.  Toward a Blended Ontology: Applying Knowledge Systems to Compare Therapeutic and Toxicological Nanoscale Domains , 2012, Journal of biomedicine & biotechnology.

[19]  C. Apte,et al.  Data mining with decision trees and decision rules , 1997, Future Gener. Comput. Syst..

[20]  Khairullah Khan,et al.  A Review of Machine Learning Algorithms for Text-Documents Classification , 2010 .