A practical study of CITES wood species identification by untargeted DART/QTOF, GC/QTOF and LC/QTOF together with machine learning processes and statistical analysis

Abstract Illegal logging and trafficking of endangered timber species has attracted the world's major organized crime groups, with associated deforestation and serious social damage. The inability of traditional methodologies and DNA analysis to readily perform wood identification to the species level for monitoring has stimulated research on chemotyping techniques. In this study, simple wood extraction of endangered rosewoods (Dalbergia spp), amenable to use in the field, produced colorful hues that were suggestive of wood species. A more definitive study was conducted to develop wood species identification procedures using high-resolution quadrupole time-of-flight (QTOF) mass spectrometers interfaced with liquid chromatography (LC), gas chromatography (GC), and Direct Analysis in Real Time (DART). The time consuming process of extracting “identifying” mass spectral ions for species identification, contentious due to their ubiquitous nature, was supplanted by application of machine learning processes. The unbiased software mining of raw data from multiple analytical batches, followed by statistical Random Forest analysis, enabled discrimination between both anatomically and chemotypically similar Dalbergia species. Statistical Principal Component Analysis (PCA) scatterplots with 95% confidence ellipses were visually compelling in showing a differential clustering of Dalbergia from other commonly traded and lookalike wood species. The information rich raw data from GC or LC analyses offered a corroborative, legally defensible, and widely available confirmatory tool in the identification of timber species.

[1]  M. Simmonds,et al.  Dalnigrin, a neoflavonoid marker for the identification of Brazilian rosewood (Dalbergia nigra) in CITES enforcement. , 2010, Phytochemistry.

[2]  N. Isabel,et al.  Chemotyping and identification of protected Dalbergiatimber using gas chromatography quadrupole time of flight mass spectrometry. , 2019, Journal of chromatography. A.

[3]  A. Ramos,et al.  Phylogeography of the endangered rosewood Dalbergia nigra (Fabaceae): insights into the evolutionary history and conservation of the Brazilian Atlantic Forest , 2011, Heredity.

[4]  Prabu Ravindran,et al.  The XyloTron: Flexible, Open-Source, Image-Based Macroscopic Field Identification of Wood Products , 2020, Frontiers in Plant Science.

[5]  Honghai Liu,et al.  Effects of Different Drying Treatments on Preservation of Organic Compounds in Dalbergia bariensis Wood , 2015 .

[6]  A. Lowe,et al.  The Applicat ion of DNA methods to Timber Tracking and Origin Verificat ion , 2011 .

[7]  E. Espinoza,et al.  FORENSIC ANALYSIS OF CITES-PROTECTED DALBERGIA TIMBER FROM THE AMERICAS , 2015 .

[8]  Tereza Cristina Monteiro Pastore,et al.  Near infrared spectroscopy (NIRS) as a potential tool for monitoring trade of similar woods: Discrimination of true mahogany, cedar, andiroba, and curupixá , 2010 .

[9]  R. Cody,et al.  Distinguishing wild from cultivated agarwood (Aquilaria spp.) using direct analysis in real time and time of-flight mass spectrometry. , 2014, Rapid communications in mass spectrometry : RCM.

[10]  Tuo He,et al.  DNA barcoding authentication for the wood of eight endangered Dalbergia timber species using machine learning approaches , 2018, Holzforschung.

[11]  Peter Gasson,et al.  How precise can wood identification be? Wood anatomy’s role in support of the legal timber trade, especially cites , 2011 .

[12]  Yafang Yin,et al.  Extraction and amplification of DNA from aged and archaeological Populus euphratica wood for species identification , 2015 .

[13]  Yafang Yin,et al.  Forensic timber identification: It's time to integrate disciplines to combat illegal logging , 2015 .

[14]  I. Mundo,et al.  Identification of selected CITES-protected Araucariaceae using DART TOFMS , 2017 .

[15]  Hans Beeckman,et al.  Overview of current practices in data analysis for wood identification: a guide for the different timber tracking methods. , 2020 .

[16]  Charles C. Liu,et al.  Timber species identification from chemical fingerprints using direct analysis in real time (DART) coupled to Fourier transform ion cyclotron resonance mass spectrometry (FTICR-MS): comparison of wood samples subjected to different treatments , 2019, Holzforschung.

[17]  C. Meyer,et al.  DNA Barcoding: Error Rates Based on Comprehensive Sampling , 2005, PLoS biology.

[18]  H. Beeckman,et al.  Comparison of species classification models of mass spectrometry data: Kernel Discriminant Analysis vs Random Forest; A case study of Afrormosia (Pericopsis elata (Harms) Meeuwen). , 2017, Rapid communications in mass spectrometry : RCM.

[19]  E. Espinoza,et al.  Evaluating agarwood products for 2-(2-phenylethyl)chromones using direct analysis in real time time-of-flight mass spectrometry. , 2012, Rapid communications in mass spectrometry : RCM.

[20]  Edgard O. Espinoza,et al.  A High Throughput Ambient Mass Spectrometric Approach to Species Identification and Classification from Chemical Fingerprint Signatures , 2015, Scientific Reports.

[21]  J. Tollefson Why deforestation and extinctions make pandemics more likely , 2020, Nature.

[22]  Anmin Huang,et al.  Identification of Three Dalbergia Species Based on Differences in Extractive Components , 2018, Molecules.

[23]  W. Ollis,et al.  Pterocarpans from Dalbergia spruceana , 1978 .

[24]  F. Bongers,et al.  Chemical differentiation of Bolivian Cedrela species as a tool to trace illegal timber trade , 2018, Forestry: An International Journal of Forest Research.

[25]  E. Espinoza,et al.  Analysis of select Dalbergia and trade timber using direct analysis in real time and time-of-flight mass spectrometry for CITES enforcement. , 2012, Rapid Communications in Mass Spectrometry.

[26]  John C. Hermanson,et al.  A brief review of machine vision in the context of automated wood identification systems , 2011 .