The off-target effect occurring in the CRISPR-Cas9 system has been a challenging problem for the practical application of this gene editing technology. In recent years, various prediction models have been… Click to show full abstract
The off-target effect occurring in the CRISPR-Cas9 system has been a challenging problem for the practical application of this gene editing technology. In recent years, various prediction models have been proposed to predict potential off-target activities. However, most of the existing prediction methods do not fully exploit guide RNA (gRNA) and DNA sequence pair information effectively. In addition, available prediction methods usually ignore the noise effect in original off-target datasets. To address these issues, we design a novel coding scheme, which considers the key features of mismatch type, mismatch location and the gRNA-DNA sequence pair information. Furthermore, a transformer-based anti-noise model called CrisprDNT is developed to solve the noise problem that exists in the off-target data. Experimental results of eight existing datasets demonstrate that the method with the inclusion of the anti-noise loss functions is superior to available state-of-the-art prediction methods. CrisprDNT is available at https://github.com/gzrgzx/CrisprDNT.
               
Click one of the above tabs to view related content.