Identification of trans-splicing sites in Leishmania major using probabilistic methods

Leishmania major, a member of the Kinetoplastida family, is a primitive protozoan that causes a human disease, called leishmaniases, affecting numerous people worldwide. The identification of new drug targets to combat leishmaniases necessitates a thorough understanding of how genomic instructions are transformed into functional proteins. It requires not only the prediction and categorization of all the genes, but also a profound understanding of their regulation. Much of gene regulation may occur through a process known as frarcs-splicing. Trans-splicing, which is mechanistically similar to cissplicing, is the process of cleaving a large polycistronic transcript into smaller monocistronic components. The goal of this project was to establish a model to accurately predict sites where /rans-splicing occurs. After carefully analyzing the data set, a second-order log odds ratio model was created. This method achieved an overall accuracy of 89% in predicting trans-