TY - JOUR
T1 - Direct Inference of Base-Pairing Probabilities with Neural Networks Improves Prediction of RNA Secondary Structures with Pseudoknots
AU - Akiyama, Manato
AU - Sakakibara, Yasubumi
AU - Sato, Kengo
PY - 2022/11/18
Y1 - 2022/11/18
N2 - Existing approaches to predicting RNA secondary structures depend on how the secondary structure is decomposed into substructures, that is, the architecture, to define their parameter space. However, architecture dependency has not been sufficiently investigated, especially for pseudoknotted secondary structures. In this study, we propose a novel algorithm for directly inferring base-pairing probabilities with neural networks that do not depend on the architecture of RNA secondary structures, and then implement this approach using two maximum expected accuracy (MEA)-based decoding algorithms: Nussinov-style decoding for pseudoknot-free structures and IPknot-style decoding for pseudoknotted structures. To train the neural networks connected to each base pair, we adopt a max-margin framework, called structured support vector machines (SSVM), as the output layer. Our benchmarks for predicting RNA secondary structures with and without pseudoknots show that our algorithm outperforms existing methods in prediction accuracy.
AB - Existing approaches to predicting RNA secondary structures depend on how the secondary structure is decomposed into substructures, that is, the architecture, to define their parameter space. However, architecture dependency has not been sufficiently investigated, especially for pseudoknotted secondary structures. In this study, we propose a novel algorithm for directly inferring base-pairing probabilities with neural networks that do not depend on the architecture of RNA secondary structures, and then implement this approach using two maximum expected accuracy (MEA)-based decoding algorithms: Nussinov-style decoding for pseudoknot-free structures and IPknot-style decoding for pseudoknotted structures. To train the neural networks connected to each base pair, we adopt a max-margin framework, called structured support vector machines (SSVM), as the output layer. Our benchmarks for predicting RNA secondary structures with and without pseudoknots show that our algorithm outperforms existing methods in prediction accuracy.
KW - RNA secondary structure
KW - deep learning
KW - pseudoknots
UR - http://www.scopus.com/inward/record.url?scp=85142658756&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85142658756&partnerID=8YFLogxK
U2 - 10.3390/genes13112155
DO - 10.3390/genes13112155
M3 - Article
C2 - 36421829
AN - SCOPUS:85142658756
SN - 2073-4425
VL - 13
JO - Genes
JF - Genes
IS - 11
M1 - 2155
ER -