Artificial Data Generation with Language Models for Imbalanced Classification in Maintenance

[1]  Ioannis Korkontzelos,et al.  Bug Severity Prediction Using a Hierarchical One-vs.-Remainder Approach , 2019, NLDB.

[2]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[3]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[4]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[5]  Yejin Choi,et al.  The Curious Case of Neural Text Degeneration , 2019, ICLR.

[6]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[7]  Bin Wu,et al.  Using Improved Conditional Generative Adversarial Networks to Detect Social Bots on Twitter , 2020, IEEE Access.

[8]  Samir Lamouri,et al.  Estimation of Production Inhibition Time Using Data Mining to Improve Production Planning and Control , 2019, 2019 International Conference on Industrial Engineering and Systems Management (IESM).

[9]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[10]  Kazuhiko Tsuda,et al.  A Management Method of the Corporate Brand Image Based on Customers' Perception , 2018, KES.

[11]  Taghi M. Khoshgoftaar,et al.  Survey on deep learning with class imbalance , 2019, J. Big Data.

[12]  Andrew Kusiak,et al.  Data-driven smart manufacturing , 2018, Journal of Manufacturing Systems.

[13]  Samir Lamouri,et al.  Machine learning applied in production planning and control: a state-of-the-art in the era of industry 4.0 , 2020, Journal of Intelligent Manufacturing.

[14]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[15]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[16]  Xavier-Andoni Tibau,et al.  Why Cohen’s Kappa should be avoided as performance measure in classification , 2019, PloS one.

[17]  Laurent Romary,et al.  CamemBERT: a Tasty French Language Model , 2019, ACL.

[18]  Peter Christen,et al.  A note on using the F-measure for evaluating record linkage algorithms , 2017, Statistics and Computing.

[19]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[20]  Rico Sennrich,et al.  Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[21]  David Masko,et al.  The Impact of Imbalanced Training Data for Convolutional Neural Networks , 2015 .

[22]  Longbing Cao,et al.  Training deep neural networks on imbalanced data sets , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[23]  T. V. Geetha,et al.  Cross-Corpus Training with CNN to Classify Imbalanced Biomedical Relation Data , 2019, NLDB.

[24]  Pingyu Jiang,et al.  Manifold learning based rescheduling decision mechanism for recessive disturbances in RFID-driven job shops , 2016, Journal of Intelligent Manufacturing.

[25]  J. Benach,et al.  Evaluation of the effectiveness and equity of the maternity protection reform in Chile from 2000 to 2015 , 2019, PloS one.

[26]  Thomas Wolf,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.