论文信息 - Detection of ChatGPT Fake Science with the xFakeBibs Learning Algorithm

Detection of ChatGPT Fake Science with the xFakeBibs Learning Algorithm

ChatGPT is becoming a new reality. In this paper, we demonstrate a method for distinguishing ChatGPT-generated publications from those produced by scientists. The objective of this work is to introduce a newly designed supervised network-driven algorithm that illustrates how to predict machine-generated content. The premise is that ChatGPT content exhibits behavior that is distinctive and can be set apart from scientific articles. The algorithm was trained and tested on three disease-specific publications, with each model constructed from 100 abstracts. Additionally, the algorithm underwent k-Folds calibration (depending on the availability of the data) to establish a lower-upper bound range of acceptance. The network training model of ChatGPT showed a lower number of nodes and a higher number of edges when compared with models of real article abstracts. The algorithm was executed in single-mode to predict the class of one type of dataset at a time and achieved >94%. It was also executed in multi-mode on mixed documents of ChatGPT and PubMed abstracts. The algorithm remarkably predicted real articles with a precision of 100% and, on rare occasions, 96%-98%. However, ChatGPT content was often misclassified as real publications with up to 88% accuracy in all datasets of the three diseases. Our results also showed that the year of publications mixed with ChatGPT-generated content may play a factor in detecting the correct class, where the older the publication, the better the prediction.

Xin Wu | A. Hamed

[1] A. Hamed,et al. Safeguarding authenticity for mitigating the harms of generative AI: Issues, research agenda, and policies for detection, fact-checking, and ethical AI , 2024, iScience.

[2] Gemma Conroy. How ChatGPT and other AI tools could disrupt scientific publishing , 2023, Nature.

[3] H. Rashidi,et al. The ChatGPT conundrum: Human-generated scientific manuscripts misidentified as AI creations by AI text detection tool , 2023, Journal of pathology informatics.

[4] Ahmed M. Elkhatat,et al. Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text , 2023, International Journal for Educational Integrity.

[5] Detecting AI content in responses generated by ChatGPT, YouChat, and Chatsonic: The case of five AI content detection tools , 2023, Journal of Applied Learning & Teaching.

[6] Ilker Cingillioglu. Detecting AI-generated essays: the ChatGPT challenge , 2023, The International Journal of Information and Learning Technology.

[7] B. Bozkurt,et al. JACC Journals' Pathway Forward With AI Tools: The Future Is Now. , 2023, Journal of the American College of Cardiology.

[8] G. Eysenbach. The Role of ChatGPT, Generative Language Models, and Artificial Intelligence in Medical Education: A Conversation With ChatGPT and a Call for Papers , 2023, JMIR medical education.

[9] E. Verhagen,et al. AI did not write this manuscript, or did it? Can we trick the AI text detector into generated texts? The potential future of ChatGPT and AI in Sports & Exercise Medicine manuscript generation , 2023, BMJ Open Sport & Exercise Medicine.

[10] A. Flanagin,et al. Nonhuman "Authors" and Implications for the Integrity of Scientific Publication and Medical Knowledge. , 2023, JAMA.

[11] R. Frederickson,et al. Addressing the big business of fake science. , 2022, Molecular therapy : the journal of the American Society of Gene Therapy.

[12] Larissa Deadame de Figueiredo Nicolete,et al. The impact of fake news on social media and its influence on health during the COVID-19 pandemic: a systematic review , 2021, Journal of Public Health.

[13] Xindong Wu,et al. Fighting the COVID-19 Infodemic in News articles and False Publications: The NeoNet Text Classifier, a Supervised Machine Learning Algorithm , 2021, Applied Sciences.

[14] H. Larson,et al. Measuring the impact of COVID-19 vaccine misinformation on vaccination intent in the UK and USA , 2021, Nature Human Behaviour.

[15] Nathan Walter,et al. Evaluating the Impact of Attempts to Correct Health Misinformation on Social Media: A Meta-Analysis , 2020, Health communication.

[16] Francesco Scotognella,et al. Link and Node Removal in Real Social Networks: A Review , 2020, Frontiers in Physics.

[17] S. Ho,et al. Let’s nab fake science news: Predicting scientists’ support for interventions using the influence of presumed media influence model , 2020, Journalism.

[18] Stephen A. Matlin,et al. Fake science and the knowledge crisis: ignorance can be fatal , 2019, Royal Society Open Science.

[19] Shahzad Qaiser,et al. Text Mining: Use of TF-IDF to Examine the Relevance of Words to Documents , 2018, International Journal of Computer Applications.

[20] Monika Mital,et al. Knowledge discovery out of text data: a systematic review via text mining , 2018, J. Knowl. Manag..

[21] Igor Linkov,et al. Stability of a giant connected component in a complex network. , 2017, Physical review. E.

[22] D. Shiffman,et al. Fish tales: Combating fake science in popular media , 2015 .

[23] André Calero Valdez,et al. On Graph Entropy Measures for Knowledge Discovery from Publication Network Data , 2013, CD-ARES.

[24] Kam-Fai Wong,et al. Interpreting TF-IDF term weights as making relevance decisions , 2008, TOIS.

[25] Graeme Hirst,et al. Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts , 2007, Lit. Linguistic Comput..

[26] R. Linsker,et al. Improving network robustness by edge modification , 2005 .

[27] M. Sacks,et al. Special issue , 2003, Journal of biomechanical engineering.

[28] Yuan-Fang Wang,et al. The use of bigrams to enhance text categorization , 2002, Inf. Process. Manag..

[29] S. N. Dorogovtsev,et al. Giant strongly connected component of directed networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30] Vladimir Vapnik,et al. An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[31] D. Moir. Medicine , 1894, The Indian medical gazette.

[32] Hongbo Duan,et al. Network stability, connectivity and innovation output , 2017 .

[33] Bruno Trstenjak,et al. on Intelligent Manufacturing and Automation , 2013 KNN with TF-IDF Based Framework for Text Categorization , 2014 .

[34] Ullrich K. H. Ecker,et al. Psychological Science in the Public Interest , in press Misinformation and its Correction : Continued Influence and Successful Debiasing , 2012 .

[35] M. Myers,et al. Misinformation about Vaccines , 2009 .

[36] Chaomei Chen,et al. CiteSpace II: Visualization and Knowledge Discovery in Bibliographic Databases , 2005, AMIA.

[37] Juan Enrique Ramos,et al. Using TF-IDF to Determine Word Relevance in Document Queries , 2003 .

[38] Akiko Aizawa,et al. An information-theoretic perspective of tf-idf measures , 2003, Inf. Process. Manag..