Motivation: Capturing long‐range interactions between structural but not sequence neighbors of proteins is a long‐standing challenging problem in bioinformatics. Recently, long short‐term memory (LSTM) networks have significantly improved the accuracy… Click to show full abstract
Motivation: Capturing long‐range interactions between structural but not sequence neighbors of proteins is a long‐standing challenging problem in bioinformatics. Recently, long short‐term memory (LSTM) networks have significantly improved the accuracy of speech and image classification problems by remembering useful past information in long sequential events. Here, we have implemented deep bidirectional LSTM recurrent neural networks in the problem of protein intrinsic disorder prediction. Results: The new method, named SPOT‐Disorder, has steadily improved over a similar method using a traditional, window‐based neural network (SPINE‐D) in all datasets tested without separate training on short and long disordered regions. Independent tests on four other datasets including the datasets from critical assessment of structure prediction (CASP) techniques and >10 000 annotated proteins from MobiDB, confirmed SPOT‐Disorder as one of the best methods in disorder prediction. Moreover, initial studies indicate that the method is more accurate in predicting functional sites in disordered regions. These results highlight the usefulness combining LSTM with deep bidirectional recurrent neural networks in capturing non‐local, long‐range interactions for bioinformatics applications. Availability and Implementation: SPOT‐disorder is available as a web server and as a standalone program at: http://sparks‐lab.org/server/SPOT‐disorder/index.php. Contact: [email protected] or [email protected] or [email protected] Supplementary information: Supplementary data is available at Bioinformatics online.
               
Click one of the above tabs to view related content.