TY - JOUR
T1 - Inferring rules of Escherichia coli translational efficiency using an artificial neural network
AU - Mori, Koya
AU - Saito, Rintaro
AU - Kikuchi, Shinichi
AU - Tomita, Masaru
N1 - Funding Information:
We thank Yuko Osada and Dr. Yoshiaki Ohashi of the Institute for Advanced Biosciences, Keio University, for their helpful discussions. This work was supported by a grant from the Ministry of Education, Culture, Sports, Science and Technology; a Grant-in-Aid from the 21st Century Center of Excellence (COE) program entitled “Understanding and control of life's function via systems biology” (Keio University); a grant from the New Energy and Industrial Technology Development and Organization (NEDO) of the Ministry of Economy, Trade and Industry of Japan (Development of a Technological Infrastructure for Industrial Bioprocess Project); a grant from the Japan Science and Technology Agency (JST).
PY - 2007/9
Y1 - 2007/9
N2 - Although the machinery for translation initiation in Escherichia coli is very complicated, the translational efficiency has been reported to be predictable from upstream oligonucleotide sequences. Conventional models have difficulties in their generalization ability and prediction nonlinearity and in their ability to deal with a variety of input attributions. To address these issues, we employed structural learning by artificial neural networks to infer general rules for translational efficiency. The correlation between translational activities measured by biological experiments and those predicted by our method in the test data was significant (r = 0.78), and our method uncovered underlying rules of translational activities and sequence patterns from the obtained skeleton structure. The significant rules for predicting translational efficiency were (1) G- and A-rich oligonucleotide sequences, resembling the Shine-Dalgarno sequence, at positions -10 to -7; (2) first base A in the initiation codon; (3) transport/binding or amino acid metabolism gene function; (4) high binding energy between mRNA and 16S rRNA at positions -15 to -5. An additional inferred novel rule was that C at position -1 increases translational efficiency. When our model was applied to the entire genomic sequence of E. coli, translational activities of genes for metabolism and translational were significantly high.
AB - Although the machinery for translation initiation in Escherichia coli is very complicated, the translational efficiency has been reported to be predictable from upstream oligonucleotide sequences. Conventional models have difficulties in their generalization ability and prediction nonlinearity and in their ability to deal with a variety of input attributions. To address these issues, we employed structural learning by artificial neural networks to infer general rules for translational efficiency. The correlation between translational activities measured by biological experiments and those predicted by our method in the test data was significant (r = 0.78), and our method uncovered underlying rules of translational activities and sequence patterns from the obtained skeleton structure. The significant rules for predicting translational efficiency were (1) G- and A-rich oligonucleotide sequences, resembling the Shine-Dalgarno sequence, at positions -10 to -7; (2) first base A in the initiation codon; (3) transport/binding or amino acid metabolism gene function; (4) high binding energy between mRNA and 16S rRNA at positions -15 to -5. An additional inferred novel rule was that C at position -1 increases translational efficiency. When our model was applied to the entire genomic sequence of E. coli, translational activities of genes for metabolism and translational were significantly high.
KW - Neural network
KW - Prediction
KW - Shine-Dalgarno sequence
KW - Structural learning method
KW - Translation efficiency
UR - http://www.scopus.com/inward/record.url?scp=34548437657&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34548437657&partnerID=8YFLogxK
U2 - 10.1016/j.biosystems.2006.10.005
DO - 10.1016/j.biosystems.2006.10.005
M3 - Article
C2 - 17150301
AN - SCOPUS:34548437657
SN - 0303-2647
VL - 90
SP - 414
EP - 420
JO - Currents in modern biology
JF - Currents in modern biology
IS - 2
ER -