A Targeted Attack on Black-Box Neural Machine Translation with Parallel Data Poisoning
暂无分享,去创建一个
Francisco Guzmán | Benjamin I. P. Rubinstein | Trevor Cohn | Yuqing Tang | Jun Wang | Chang Xu | Trevor Cohn | Francisco Guzmán | Jun Wang | Yuqing Tang | Chang Xu
[1] Tudor Dumitras,et al. Poison Frogs! Targeted Clean-Label Poisoning Attacks on Neural Networks , 2018, NeurIPS.
[2] Blaine Nelson,et al. Can machine learning be secure? , 2006, ASIACCS '06.
[3] Kenneth Heafield,et al. ParaCrawl: Web-Scale Acquisition of Parallel Corpora , 2020, ACL.
[4] Ben Y. Zhao,et al. Neural Cleanse: Identifying and Mitigating Backdoor Attacks in Neural Networks , 2019, 2019 IEEE Symposium on Security and Privacy (SP).
[5] Myle Ott,et al. fairseq: A Fast, Extensible Toolkit for Sequence Modeling , 2019, NAACL.
[6] Marjan Ghazvininejad,et al. Multilingual Denoising Pre-training for Neural Machine Translation , 2020, Transactions of the Association for Computational Linguistics.
[7] Brendan Dolan-Gavitt,et al. BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain , 2017, ArXiv.
[8] Patrick D. McDaniel,et al. Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial Samples , 2016, ArXiv.
[9] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.
[10] Philipp Koehn,et al. Zipporah: a Fast and Scalable Data Cleaning System for Noisy Web-Crawled Parallel Corpora , 2017, EMNLP.
[11] Sameer Singh,et al. Generating Natural Adversarial Examples , 2017, ICLR.
[12] Graham Neubig,et al. Weight Poisoning Attacks on Pretrained Models , 2020, ACL.
[13] Ahmed Abdelali,et al. The AMARA Corpus: Building Parallel Language Resources for the Educational Domain , 2014, LREC.
[14] Myle Ott,et al. Understanding Back-Translation at Scale , 2018, EMNLP.
[15] Shi Feng,et al. Customizing Triggers with Concealed Data Poisoning , 2020, ArXiv.
[16] Xu Tan,et al. MASS: Masked Sequence to Sequence Pre-training for Language Generation , 2019, ICML.
[17] Ling Huang,et al. ANTIDOTE: understanding and defending against poisoning of anomaly detectors , 2009, IMC '09.
[18] Timothy Baldwin,et al. langid.py: An Off-the-shelf Language Identification Tool , 2012, ACL.
[19] Myle Ott,et al. Facebook FAIR’s WMT19 News Translation Task Submission , 2019, WMT.
[20] Dawn Xiaodong Song,et al. Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning , 2017, ArXiv.
[21] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[22] Holger Schwenk,et al. CCMatrix: Mining Billions of High-Quality Parallel Sentences on the WEB , 2019, ArXiv.
[23] Jure Leskovec,et al. Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes , 2016, WWW.
[24] Jörg Tiedemann,et al. OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles , 2016, LREC.
[25] Lawrence K. Saul,et al. Beyond blacklists: learning to detect malicious web sites from suspicious URLs , 2009, KDD.
[26] D. Song,et al. Imitation Attacks and Defenses for Black-box Machine Translation Systems , 2020, EMNLP.
[27] Mauro Cettolo,et al. The IWSLT 2016 Evaluation Campaign , 2016, IWSLT.
[28] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.
[29] Graham Neubig,et al. On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models , 2019, NAACL.
[30] Fabio Roli,et al. Towards Poisoning of Deep Learning Algorithms with Back-gradient Optimization , 2017, AISec@CCS.
[31] Philipp Koehn,et al. A Massive Collection of Cross-Lingual Web-Document Pairs , 2019, EMNLP.
[32] Yong Cheng,et al. Robust Neural Machine Translation with Doubly Adversarial Inputs , 2019, ACL.
[33] András Kornai,et al. Parallel corpora for medium density languages , 2007 .
[34] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[35] Damien Deville,et al. SpyProxy: Execution-based Detection of Malicious Web Content , 2007, USENIX Security Symposium.
[36] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[37] Víctor M. Sánchez-Cartagena,et al. Prompsit’s submission to WMT 2018 Parallel Corpus Filtering shared task , 2018, WMT.
[38] Dejing Dou,et al. On Adversarial Examples for Character-Level Neural Machine Translation , 2018, COLING.
[39] Gabriel Bernier-Colborne,et al. NRC Parallel Corpus Filtering System for WMT 2019 , 2019, WMT.
[40] Holger Schwenk,et al. WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia , 2019, EACL.
[41] Yonatan Belinkov,et al. Synthetic and Natural Noise Both Break Neural Machine Translation , 2017, ICLR.
[42] Matt Post,et al. A Call for Clarity in Reporting BLEU Scores , 2018, WMT.
[43] Percy Liang,et al. Certified Defenses for Data Poisoning Attacks , 2017, NIPS.
[44] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.
[45] Jörg Tiedemann,et al. Parallel Data, Tools and Interfaces in OPUS , 2012, LREC.
[46] Jacob Eisenstein,et al. AdvAug: Robust Adversarial Augmentation for Neural Machine Translation , 2020, ACL.
[47] Giovanni Vigna,et al. Prophiler: a fast filter for the large-scale detection of malicious web pages , 2011, WWW.
[48] Philipp Koehn,et al. Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions , 2019, WMT.
[49] Paolo Rosso,et al. SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter , 2019, *SEMEVAL.
[50] Sameer Singh,et al. Universal Adversarial Triggers for Attacking and Analyzing NLP , 2019, EMNLP.
[51] Philipp Koehn,et al. Low-Resource Corpus Filtering Using Multilingual Sentence Embeddings , 2019, WMT.
[52] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[53] Jinfeng Yi,et al. Seq2Sick: Evaluating the Robustness of Sequence-to-Sequence Models with Adversarial Examples , 2018, AAAI.