Master clinical medical knowledge at certificated-doctor-level with deep learning model

Mastering of medical knowledge to human is a lengthy process that typically involves several years of school study and residency training. Recently, deep learning algorithms have shown potential in solving medical problems. Here we demonstrate mastering clinical medical knowledge at certificated-doctor-level via a deep learning framework Med3R, which utilizes a human-like learning and reasoning process. Med3R becomes the first AI system that has successfully passed the written test of National Medical Licensing Examination in China 2017 with 456 scores, surpassing 96.3% human examinees. Med3R is further applied for providing aided clinical diagnosis service based on real electronic medical records. Compared to human experts and competitive baselines, our system can provide more accurate and consistent clinical diagnosis results. Med3R provides a potential possibility to alleviate the severe shortage of qualified doctors in countries and small cities of China by providing computer-aided medical care and health services for patients.AI is used increasingly in medical diagnostics. Here, the authors present a deep learning model that masters medical knowledge, demonstrated by it having passed the written test of the 2017 National Medical Licensing Examination in China, and can provide help with clinical diagnosis based on electronic health care records.

[1]  Jennifer Chu-Carroll,et al.  Special Questions and techniques , 2012, IBM J. Res. Dev..

[2]  Siddharth Patwardhan,et al.  Question analysis: How Watson reads a clue , 2012, IBM J. Res. Dev..

[3]  James Fan,et al.  Textual evidence gathering and analysis , 2012, IBM J. Res. Dev..

[4]  Chang Wang,et al.  Relation extraction and scoring in DeepQA , 2012, IBM J. Res. Dev..

[5]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[6]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[7]  Gerald Tesauro,et al.  Statistical Approaches to Question Answering in Watson , 2012 .

[8]  Zhenchao Jiang,et al.  Training word embeddings for deep learning in biomedical text mining tasks , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[9]  Chris H. Q. Ding,et al.  Robust nonnegative matrix factorization using L21-norm , 2011, CIKM '11.

[10]  Philip Bachman,et al.  Iterative Alternating Neural Attention for Machine Reading , 2016, ArXiv.

[11]  Aditya Kalyanpur,et al.  A framework for merging and ranking of answers in DeepQA , 2012, IBM J. Res. Dev..

[12]  Zuzana Pelikánová,et al.  Google Knowledge Graph , 2014 .

[13]  Jennifer Chu-Carroll,et al.  Textual resource acquisition and engineering , 2012, IBM J. Res. Dev..

[14]  Dmitry Zelenko,et al.  Kernel Methods for Relation Extraction , 2002, J. Mach. Learn. Res..

[15]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[16]  Aditya Kalyanpur,et al.  Automatic knowledge extraction from documents , 2012, IBM J. Res. Dev..

[17]  Kam-Fai Wong,et al.  Towards Neural Network-based Reasoning , 2015, ArXiv.

[18]  Zhiyuan Liu,et al.  Learning Entity and Relation Embeddings for Knowledge Graph Completion , 2015, AAAI.

[19]  Omer Levy,et al.  Linguistic Regularities in Sparse and Explicit Word Representations , 2014, CoNLL.

[20]  Fei Li,et al.  A neural joint model for entity and relation extraction from biomedical text , 2017, BMC Bioinformatics.

[21]  Siddharth Patwardhan,et al.  When Did that Happen? - Linking Events and Relations to Timestamps , 2012, EACL.

[22]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[23]  Siddharth Patwardhan,et al.  Labeling by landscaping: classifying tokens in context by pruning and decorating trees , 2012, CIKM '12.

[24]  Jennifer Chu-Carroll,et al.  Finding needles in the haystack: Search and candidate generation , 2012, IBM J. Res. Dev..

[25]  Eric W. Brown,et al.  Making Watson fast , 2012, IBM J. Res. Dev..

[26]  Haixun Wang,et al.  Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.

[27]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[28]  Jennifer Chu-Carroll,et al.  Identifying implicit relationships , 2012, IBM J. Res. Dev..

[29]  Aditya Kalyanpur,et al.  A Comparison of Hard Filters and Soft Evidence for Answer Typing in Watson , 2012, International Semantic Web Conference.

[30]  Michael C. McCord,et al.  Deep parsing in Watson , 2012, IBM J. Res. Dev..

[31]  Ming Zhou,et al.  Gated Self-Matching Networks for Reading Comprehension and Question Answering , 2017, ACL.

[32]  Aditya Kalyanpur,et al.  Typing candidate answers using type coercion , 2012, IBM J. Res. Dev..

[33]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[34]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[35]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[36]  Siddharth Patwardhan,et al.  Fact-based question decomposition in DeepQA , 2012, IBM J. Res. Dev..

[37]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.