Automated Classification of Osteosarcoma and Benign Tumors using RNA-seq and Plain X-ray

Osteosarcoma is a prominent bone cancer that typically affects adolescents or people in late adulthood. Early recognition of this disease relies on imaging technologies such as x-ray radiography to detect tumor size and location. This paper aims to differentiate osteosarcoma from benign tumors by analyzing both imaging and RNA-seq data through a combination of image processing and machine learning. In experimental results, the proposed method achieved an Area Under the Receiver Operator Characteristic Curve (AUC) of 0.7272 in three-fold cross-validation, and an AUC of 0.9015 using leave-one-out cross-validation.

[1]  Jae Won Lee,et al.  An extensive comparison of recent classification tools applied to microarray data , 2004, Comput. Stat. Data Anal..

[2]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[3]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[4]  S. Altekruse,et al.  Declining childhood and adolescent cancer mortality , 2014, Cancer.

[5]  Gokmen Zararsiz,et al.  MLSeq: Machine learning interface for RNA-sequencing data , 2019, Comput. Methods Programs Biomed..

[6]  G. Ottaviani,et al.  The epidemiology of osteosarcoma. , 2009, Cancer treatment and research.

[7]  Stephen Hartley,et al.  QoRTs: a comprehensive toolset for quality control and data processing of RNA-Seq experiments , 2015, BMC Bioinformatics.

[8]  Mark Gerstein,et al.  GENCODE reference annotation for the human and mouse genomes , 2018, Nucleic Acids Res..

[9]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[10]  Zile Singh Kundu,et al.  Classification, imaging, biopsy and staging of osteosarcoma , 2014, Indian journal of orthopaedics.

[11]  David Ward,et al.  Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data , 2003, Bioinform..

[12]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[13]  Kayvan Najarian,et al.  Osteosarcoma Patients Classification Using Plain X-Rays and Metabolomic Data , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[14]  Harry Shum,et al.  Lazy snapping , 2004, ACM Trans. Graph..

[15]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.