A Validation Study of a Deep Learning-Based Doping Drug Text Recognition System to Ensure Safe Drug Use among Athletes

This study aimed to develop an English version of a doping drug-recognition system using deep learning-based optical character recognition (OCR) technology. A database of 336 banned substances was built based on the World Anti-Doping Agency’s International Standard Prohibited List and the Korean Pharmaceutical Information Center’s Drug Substance Information. For accuracy and validity analysis, 886 drug substance images, including 152 images of prescriptions and drug substance labels collected using data augmentation, were used. The developed hybrid system, based on the Tesseract OCR model, can be accessed by both a smartphone and website. A total of 5379 words were extracted, and the system showed character recognition errors regarding 91 words, showing high accuracy (98.3%). The system correctly classified all 624 images for acceptable substances, 218 images for banned substances, and incorrectly recognized 44 of the banned substances as acceptable. The validity analysis showed a high level of accuracy (0.95), sensitivity (1.00), and specificity (0.93), suggesting system validity. The system has the potential of allowing athletes who lack knowledge about doping to quickly and accurately check whether they are taking banned substances. It may also serve as an efficient option to support the development of a fair and healthy sports culture.

[1]  Ji-yong Lee,et al.  Developement of Doping Drug Recognition System: Application of Deep Learning-Based OCR Technology , 2022, The Korean Journal of Physical Education.

[2]  David Pavot A Gap or Lacuna in the World Anti-Doping Code? Remarks on the CAS Interpretation in IOC, WADA, and ISU v. RUSADA, Kamila Valieva and Russian Olympic Committee (CAS OG 22-08, CAS OG 22-09, and CAS OG 22-10) , 2022, Frontiers in Sports and Active Living.

[3]  Meenu Gupta,et al.  E-Challan Automation for RTO using OCR , 2021, 2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA).

[4]  S. Rhie,et al.  Sports Pharmacy: New Specialty of Pharmacists and Pharmaceutical Care Services , 2021 .

[5]  Lobna Shaheen,et al.  Medical Prescription Recognition using Machine Learning , 2021, 2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC).

[6]  G. Peterson,et al.  Pharmacists as a Source of Advice on Medication Use for Athletes , 2020, Pharmacy.

[7]  Monirul Islam Pavel,et al.  IoT Enabled Prescription Reading Smart Medicine Dispenser Implementing Maximally Stable Extremal Regions and OCR , 2019, 2019 Third International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC).

[8]  Zheng Huang,et al.  ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[9]  Graham W. Taylor,et al.  Learning Confidence for Out-of-Distribution Detection in Neural Networks , 2018, ArXiv.

[10]  Awais Ahmad,et al.  Deep learning in big data Analytics: A comparative study , 2017, Comput. Electr. Eng..

[11]  Taegyu Kim,et al.  Korean national athletes’ knowledge, practices, and attitudes of doping: a cross-sectional study , 2017, Substance Abuse Treatment, Prevention, and Policy.

[12]  Soumya K. Ghosh,et al.  Optical Character Recognition Systems for Different Languages with Soft Computing , 2016, Studies in Fuzziness and Soft Computing.

[13]  Ariel Linden,et al.  Using data mining techniques to characterize participation in observational studies. , 2016, Journal of evaluation in clinical practice.

[14]  Mark D. McDonnell,et al.  Understanding Data Augmentation for Classification: When to Warp? , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[15]  T. Krosshaug,et al.  Doping prevention through anti-doping education and practical strength training: The Hercules program , 2016 .

[16]  Peter Bell,et al.  A case study analysis of a sophisticated sports doping network: Lance Armstrong and the USPS Team , 2016 .

[17]  A. Awaisu,et al.  Perspective of pharmacists in Qatar regarding doping and anti-doping in sports. , 2016, The Journal of sports medicine and physical fitness.

[18]  Marie Overbye Doping control in sport: An investigation of how elite athletes perceive and trust the functioning of the doping testing system in their sport , 2016 .

[19]  Younghan Cho Sport celebrity in South Korea: Park, Tae-Hwan from new generation to fallen angel , 2015 .

[20]  E. Rintaugu,et al.  Influence of sports disciplines and demographics of Kenya colleges athletes on their awareness of doping in sports in Kenya : A cas e of the University of Nairobi , 2015 .

[21]  D. Baron,et al.  Clinical sports psychiatry : an international perspective , 2013 .

[22]  K. Sapna,et al.  An Android based Medication Reminder System based on OCR using ANN , 2013 .

[23]  Ray W. Smith,et al.  History of the Tesseract OCR engine: what worked and what didn't , 2013, Electronic Imaging.

[24]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[25]  J. McKenna,et al.  Doping in sport: a review of medical practitioners' knowledge, attitudes and beliefs. , 2011, The International journal on drug policy.

[26]  Shane J. Schvaneveldt,et al.  Using Statistical Process Control Charts to Identify the Steroids Era in Major League Baseball: An Educational Exercise , 2011 .

[27]  Raymond Smith,et al.  Adapting the Tesseract open source OCR engine for multilingual OCR , 2009, MOCR '09.

[28]  Dionne L. Koller From Medals to Morality: Sportive Nationalism and the Problem of Doping in Sports , 2008 .

[29]  G. Lippi,et al.  Doping in competition or doping in sport? , 2008, British medical bulletin.

[30]  R. Smith,et al.  An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[31]  D. Baron,et al.  Doping in sports and its spread to at-risk populations: an international review. , 2007, World psychiatry : official journal of the World Psychiatric Association.

[32]  Mark Fainaru-Wada,et al.  Game of Shadows: Barry Bonds, BALCO, and the Steroids Scandal that Rocked Professional Sports , 2006 .

[33]  D. Mackinnon,et al.  Effects of a multidimensional anabolic steroid prevention intervention. The Adolescents Training and Learning to Avoid Steroids (ATLAS) Program. , 1996, JAMA.

[34]  Bipin Kumar Rai,et al.  OCR based medical prescription and report analyzer , 2022, AIP Conference Proceedings.

[35]  B. K. Tripathy,et al.  A Survey on Deep Learning Methodologies of Recent Applications , 2021, Studies in Big Data.

[36]  Soumya K. Ghosh,et al.  Optical Character Recognition Systems , 2017 .

[37]  Yafang Xue,et al.  Optical Character Recognition , 2022 .

[38]  Shaikh Abdul Hannan,et al.  AN OVERVIEW AND APPLICATIONS OF OPTICAL CHARACTER RECOGNITION , 2014 .

[39]  Kerry B. Bernes,et al.  Life After Sport: Athletic Career Transition and Transferable Skills , 2009 .

[40]  A. Nathan The Possible Effect of Steroids on Home-Run Production , 2009 .

[41]  H. Alaranta,et al.  Use of Prescription Drugs in Athletes , 2008, Sports medicine.

[42]  N. Robinson,et al.  Detection window of Darbepoetin-alpha following one single subcutaneous injection. , 2007, Clinica chimica acta; international journal of clinical chemistry.

[43]  R. Casaburi,et al.  The effects of supraphysiologic doses of testosterone on muscle size and strength in normal men. , 1996, The New England journal of medicine.