Towards toxic and narcotic medication detection with rotated object detectors

Correspondence: Feifan Wang Email: woodywff@aliyun.com or feifan.wang@nubomed.com Abstract: Recent years have witnessed the advancement of deep learning vision technologies and applications in the medical industry. Intelligent devices for special medication management are in great need of, which requires more precise detection algorithms to identify the specifications and locations. In this work, YOLO (You only look once) based object detectors are tailored for toxic and narcotic medications detection tasks. Specifically, a more flexible annotation with rotated degree ranging from 0◦ to 90◦ and a mask-mapping-based non-maximum suppression method are proposed to achieve a feasible and efficient medication detector aiming at arbitrarily oriented bounding boxes. Extensive experiments demonstrate that the rotated YOLO detectors are more suitable for identifying densely arranged drugs. The best shot mean average precision of the proposed network reaches 0.811 while the inference time is less than 300ms.

[1]  Xue Yang,et al.  Learning Modulated Loss for Rotated Object Detection , 2019, ArXiv.

[2]  John See,et al.  PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments , 2020, ECCV.

[3]  Ting Chen,et al.  Pix2seq: A Language Modeling Framework for Object Detection , 2021, ArXiv.

[4]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5]  Junchi Yan,et al.  Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence , 2021, ArXiv.

[6]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[7]  Zhaohui Zheng,et al.  Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression , 2019, AAAI.

[8]  Yongchao Gong,et al.  Mask Scoring R-CNN , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[10]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[11]  Zeming Li,et al.  YOLOX: Exceeding YOLO Series in 2021 , 2021, ArXiv.

[12]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Yue Zhang,et al.  SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[14]  Junchi Yan,et al.  Arbitrary-Oriented Object Detection with Circular Smooth Label , 2020, ECCV.

[15]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Junchi Yan,et al.  R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object , 2019, AAAI.

[18]  Bo Liu,et al.  Oriented Object Detection in Aerial Images with Box Boundary-Aware Vectors , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[19]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Zhiqiang Zhou,et al.  CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote-Sensing Images , 2021, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Gui-Song Xia,et al.  ReDet: A Rotation-equivariant Detector for Aerial Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Stephen Lin,et al.  Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[24]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[25]  Gui-Song Xia,et al.  Align Deep Features for Oriented Object Detection , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Lingjuan Miao,et al.  Sparse Label Assignment for Oriented Object Detection in Aerial Images , 2021, Remote. Sens..

[27]  W. Hall The future of the international drug control system and national drug prohibitions , 2018, Addiction.

[28]  Lingjuan Miao,et al.  Dynamic Anchor Learning for Arbitrary-Oriented Object Detection , 2020, AAAI.