Deep Learning Based Biomedical Named Entity Recognition Systems

In this chapter, we are proposing a really crucial downside known as medicine Named Entity Recognition system. Named entity recognition could be a vital mission in linguistic communication process referring to artificial intelligence, information Retrieval and data Extraction. Linguistic communication process could be a subfield of engineering, computer science and data engineering that deals that the interaction between the pc and human language. It deals with the method and analyse the language information. It’s a pc activity during which computers square measure subjected to know, alter and analyse which has automation of activities, strategies of communication. One amongst the vital elements of linguistic communication process (NLP) is called Entity Recognition (NER), which is employed to search out and classify the expressions of specific which means in texts, written in linguistic communication. The various varieties of named entities includes person name, association name, place name, numbers etc. During this book chapter we tend to area unit solely handling medicine named entity recognition (Bio-NER) that could be a basic assignment within the conducting of medicine text terms, like ribonucleic acid, cell type, cell line, protein, and DNA. Biomedical NER be one amongst the foremost core and crucial task in medicine data extraction from documents. Recognizing or characteristic medicine named entities looks to be tougher than characteristic traditional named entities. During this book chapter we tend to area unit victimization Deep learning formula that is additionally called deep structural learning or gradable learning. It’s a division of a broader unit of machine learning ways supported learning knowledge representation conflicting such task algorithms. This kind of learning is supervised, semi supervised or unsupervised. Deep learning model area units are largely inspired by IP and communication pattern in biological nervous systems nonetheless with various variations from structural and purposeful functions of biological brains. For experiment and analysis, we’ve used GENIA Corpus that was created by a gaggle of researchers to develop the analysis of knowledge and text mining system in biological science. It consists of one, 999 MEDLINE abstracts. The GENIA Corpus has been loosely employed by linguistic communication process community for improvement of linguistics search system and institution Bio human language technology tasks. During this analysis, we tend to propose a multi-tasking learning arrangement for Bio-NER that supports NN models to avoid wasting human effort. Deep neural spec that has several layers and every layer abstract options primarily based on the standard generated by the lower layers. After comparing with the results of various experiments like Saha et al.’s (Pattern Recogn. Lett 3:1591–1597, 2010) with a Precision of 68.12, Recall 67.66 and F-Score 67.89; Liao et al.’s (Biomedical Named Entity Recognition Based on Skip-Chain Crfs. pp. 1495–1498, 2012) with a Precision of 72.8, Recall 73.6 and F-Score73.2; ABNER (A Biomedical Named Entity Recognizer, pp. 46–51, 2013) with a Precision of 69.1, Recall 72.0 and F-Score 70.5; Sasaki et al. (How to Make the Most of Ne Dictionaries in Statistical Ner. pp. 63–70, 2008) with a Precision of 68.58, Recall 79.85 and F-Score 73.78; Sun et al.’s (Comput. Biol. Med 37:1327–1333, 2007) with a Precision of 70.2, Recall 72.3 and F-Score 71.2; Our system has achieved a Precision of 66.54, Recall 76.13 and F-score 71.01% on GENIA normal take a look at corpus, that is near to the progressive performance using simply Part-of-speech feature and shows that deep learning will efficiently be performed upon medical specialty Named Entity Recognition. This book chapter deals with the following section: Introduction, Literature review, Architecture, Experiment, Results and analysis, conclusion and future work and References.

[1]  Martijn J. Schuemie,et al.  A dictionary to identify small molecules and drugs in free text , 2009, Bioinform..

[2]  L. Bottou Stochastic Gradient Learning in Neural Networks , 1991 .

[3]  Zhiyong Lu,et al.  NCBI disease corpus: A resource for disease name recognition and concept normalization , 2014, J. Biomed. Informatics.

[4]  Wen-Lian Hsu,et al.  BIOSMILE web search: a web application for annotating biomedical entities and relations , 2008, Nucleic Acids Res..

[5]  Wen-Lian Hsu,et al.  New Challenges for Biological Text-Mining in the Next Decade , 2010, Journal of Computer Science and Technology.

[6]  Yee Whye Teh,et al.  A fast and simple algorithm for training neural probabilistic language models , 2012, ICML.

[7]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[8]  Rich Caruana,et al.  Multitask Learning , 1997, Machine Learning.

[9]  Tapio Salakoski,et al.  Distributional Semantics Resources for Biomedical Text Processing , 2013 .

[10]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[11]  Zhihua Liao,et al.  Biomedical Named Entity Recognition Based on Skip-Chain CRFS , 2012, 2012 International Conference on Industrial Control and Electronics Engineering.

[12]  Sampo Pyysalo,et al.  A neural network multi-task learning approach to biomedical named entity recognition , 2017, BMC Bioinformatics.

[13]  Heng Ji,et al.  Entity Linking for Biomedical Literature , 2014, DTMBIO '14.

[14]  Wen-Lian Hsu,et al.  NERBio: using selected word conjunctions, term normalization, and global patterns to improve biomedical named entity recognition , 2006, BMC Bioinformatics.

[15]  Jaehoon Choi,et al.  BEST: Next-Generation Biomedical Entity Search Tool for Knowledge Discovery from Biomedical Literature , 2016, PloS one.

[16]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[17]  Lishuang Li,et al.  Two-phase biomedical named entity recognition using CRFs , 2009, Comput. Biol. Chem..

[18]  Yoshua Bengio,et al.  Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[19]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[20]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[21]  A. Valencia,et al.  Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge , 2008, Genome Biology.

[22]  Sophia Ananiadou,et al.  Developing a Robust Part-of-Speech Tagger for Biomedical Text , 2005, Panhellenic Conference on Informatics.

[23]  Sophia Ananiadou,et al.  A Neural Layered Model for Nested Named Entity Recognition , 2018, NAACL.

[24]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[25]  Min Song,et al.  Developing a hybrid dictionary-based bio-entity recognition technique , 2015, BMC Medical Informatics and Decision Making.

[26]  Pabitra Mitra,et al.  A composite kernel for named entity recognition , 2010, Pattern Recognit. Lett..

[27]  Dietrich Rebholz-Schuhmann,et al.  Text processing through Web services: calling Whatizit , 2008, Bioinform..

[28]  Yi Guan,et al.  Rich features based Conditional Random Fields for biological named entities recognition , 2007, Comput. Biol. Medicine.

[29]  Holger Schwenk,et al.  Continuous space language models , 2007, Comput. Speech Lang..

[30]  Jaewoo Kang,et al.  Drug drug interaction extraction from the literature using a recursive neural network , 2018, PloS one.

[31]  Hae-Chang Rim,et al.  Biomedical named entity recognition using two-phase model based on SVMs , 2004, J. Biomed. Informatics.

[32]  Sophia Ananiadou,et al.  How to make the most of NE dictionaries in statistical NER , 2008, BMC Bioinformatics.

[33]  Ronan Collobert,et al.  Deep Learning for Efficient Discriminative Parsing , 2011, AISTATS.

[34]  Proux,et al.  Detecting Gene Symbols and Names in Biological Texts: A First Step toward Pertinent Information Extraction. , 1998, Genome informatics. Workshop on Genome Informatics.

[35]  Luo Si,et al.  Boosting performance of bio-entity recognition by combining results from multiple systems , 2005, BIOKDD.

[36]  Lishuang Li,et al.  A Two-Phase Bio-NER System Based on Integrated Classifiers and Multiagent Strategy , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[37]  Nigel Collier,et al.  Introduction to the Bio-entity Recognition Task at JNLPBA , 2004, NLPBA/BioNLP.

[38]  Andreas Vlachos Evaluating and combining and biomedical named entity recognition systems , 2007, BioNLP@ACL.