Detection of ChatGPT Fake Science with the xFakeBibs Learning Algorithm

ChatGPT is becoming a new reality. In this paper, we demonstrate a method for distinguishing ChatGPT-generated publications from those produced by scientists. The objective of this work is to introduce a newly designed supervised network-driven algorithm that illustrates how to predict machine-generated content. The premise is that ChatGPT content exhibits behavior that is distinctive and can be set apart from scientific articles. The algorithm was trained and tested on three disease-specific publications, with each model constructed from 100 abstracts. Additionally, the algorithm underwent k-Folds calibration (depending on the availability of the data) to establish a lower-upper bound range of acceptance. The network training model of ChatGPT showed a lower number of nodes and a higher number of edges when compared with models of real article abstracts. The algorithm was executed in single-mode to predict the class of one type of dataset at a time and achieved >94%. It was also executed in multi-mode on mixed documents of ChatGPT and PubMed abstracts. The algorithm remarkably predicted real articles with a precision of 100% and, on rare occasions, 96%-98%. However, ChatGPT content was often misclassified as real publications with up to 88% accuracy in all datasets of the three diseases. Our results also showed that the year of publications mixed with ChatGPT-generated content may play a factor in detecting the correct class, where the older the publication, the better the prediction.

[1]  A. Hamed,et al.  Safeguarding authenticity for mitigating the harms of generative AI: Issues, research agenda, and policies for detection, fact-checking, and ethical AI , 2024, iScience.

[2]  Gemma Conroy How ChatGPT and other AI tools could disrupt scientific publishing , 2023, Nature.

[3]  H. Rashidi,et al.  The ChatGPT conundrum: Human-generated scientific manuscripts misidentified as AI creations by AI text detection tool , 2023, Journal of pathology informatics.

[4]  Ahmed M. Elkhatat,et al.  Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text , 2023, International Journal for Educational Integrity.

[5]  Detecting AI content in responses generated by ChatGPT, YouChat, and Chatsonic: The case of five AI content detection tools , 2023, Journal of Applied Learning & Teaching.

[6]  Ilker Cingillioglu Detecting AI-generated essays: the ChatGPT challenge , 2023, The International Journal of Information and Learning Technology.

[7]  B. Bozkurt,et al.  JACC Journals' Pathway Forward With AI Tools: The Future Is Now. , 2023, Journal of the American College of Cardiology.

[8]  G. Eysenbach The Role of ChatGPT, Generative Language Models, and Artificial Intelligence in Medical Education: A Conversation With ChatGPT and a Call for Papers , 2023, JMIR medical education.

[9]  E. Verhagen,et al.  AI did not write this manuscript, or did it? Can we trick the AI text detector into generated texts? The potential future of ChatGPT and AI in Sports & Exercise Medicine manuscript generation , 2023, BMJ Open Sport & Exercise Medicine.

[10]  A. Flanagin,et al.  Nonhuman "Authors" and Implications for the Integrity of Scientific Publication and Medical Knowledge. , 2023, JAMA.

[11]  R. Frederickson,et al.  Addressing the big business of fake science. , 2022, Molecular therapy : the journal of the American Society of Gene Therapy.

[12]  Larissa Deadame de Figueiredo Nicolete,et al.  The impact of fake news on social media and its influence on health during the COVID-19 pandemic: a systematic review , 2021, Journal of Public Health.

[13]  Xindong Wu,et al.  Fighting the COVID-19 Infodemic in News articles and False Publications: The NeoNet Text Classifier, a Supervised Machine Learning Algorithm , 2021, Applied Sciences.

[14]  H. Larson,et al.  Measuring the impact of COVID-19 vaccine misinformation on vaccination intent in the UK and USA , 2021, Nature Human Behaviour.

[15]  Nathan Walter,et al.  Evaluating the Impact of Attempts to Correct Health Misinformation on Social Media: A Meta-Analysis , 2020, Health communication.

[16]  Francesco Scotognella,et al.  Link and Node Removal in Real Social Networks: A Review , 2020, Frontiers in Physics.

[17]  S. Ho,et al.  Let’s nab fake science news: Predicting scientists’ support for interventions using the influence of presumed media influence model , 2020, Journalism.

[18]  Stephen A. Matlin,et al.  Fake science and the knowledge crisis: ignorance can be fatal , 2019, Royal Society Open Science.

[19]  Shahzad Qaiser,et al.  Text Mining: Use of TF-IDF to Examine the Relevance of Words to Documents , 2018, International Journal of Computer Applications.

[20]  Monika Mital,et al.  Knowledge discovery out of text data: a systematic review via text mining , 2018, J. Knowl. Manag..

[21]  Igor Linkov,et al.  Stability of a giant connected component in a complex network. , 2017, Physical review. E.

[22]  D. Shiffman,et al.  Fish tales: Combating fake science in popular media , 2015 .

[23]  André Calero Valdez,et al.  On Graph Entropy Measures for Knowledge Discovery from Publication Network Data , 2013, CD-ARES.

[24]  Kam-Fai Wong,et al.  Interpreting TF-IDF term weights as making relevance decisions , 2008, TOIS.

[25]  Graeme Hirst,et al.  Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts , 2007, Lit. Linguistic Comput..

[26]  R. Linsker,et al.  Improving network robustness by edge modification , 2005 .

[27]  M. Sacks,et al.  Special issue , 2003, Journal of biomechanical engineering.

[28]  Yuan-Fang Wang,et al.  The use of bigrams to enhance text categorization , 2002, Inf. Process. Manag..

[29]  S. N. Dorogovtsev,et al.  Giant strongly connected component of directed networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[31]  D. Moir Medicine , 1894, The Indian medical gazette.

[32]  Hongbo Duan,et al.  Network stability, connectivity and innovation output , 2017 .

[33]  Bruno Trstenjak,et al.  on Intelligent Manufacturing and Automation , 2013 KNN with TF-IDF Based Framework for Text Categorization , 2014 .

[34]  Ullrich K. H. Ecker,et al.  Psychological Science in the Public Interest , in press Misinformation and its Correction : Continued Influence and Successful Debiasing , 2012 .

[35]  M. Myers,et al.  Misinformation about Vaccines , 2009 .

[36]  Chaomei Chen,et al.  CiteSpace II: Visualization and Knowledge Discovery in Bibliographic Databases , 2005, AMIA.

[37]  Juan Enrique Ramos,et al.  Using TF-IDF to Determine Word Relevance in Document Queries , 2003 .

[38]  Akiko Aizawa,et al.  An information-theoretic perspective of tf-idf measures , 2003, Inf. Process. Manag..