TY - JOUR
T1 - High solubility of random-sequence proteins consisting of five kinds of primitive amino acids
AU - Doi, Nobuhide
AU - Kakukawa, Koichi
AU - Oishi, Yuko
AU - Yanagawa, Hiroshi
N1 - Funding Information:
The authors thank Dr Toru Tsuji for helpful discussions. This research was supported in part by the Industrial Technology Research Grant Program in ’04 from the NEDO of Japan and by a Grant-in-Aid for Scientific Research and a Special Coordination Fund grant from the MEXT of Japan.
PY - 2005/6
Y1 - 2005/6
N2 - Searching for functional proteins among random-sequence libraries is a major challenge of protein engineering; the difficulties include the poor solubility of many random-sequence proteins. A library in which most of the polypeptides are soluble and stable would therefore be of great benefit. Although modern proteins consist of 20 amino acids, it has been suggested that early proteins evolved from a reduced alphabet. Here, we have constructed a library of random-sequence proteins consisting of only five amino acids, Ala, Gly, Val, Asp and Glu, which are believed to have been the most abundant in the prebiotic environment. Expression and characterization of arbitrarily chosen proteins in the library indicated that five-alphabet random-sequence proteins have higher solubility than do 20-alphabet random-sequence proteins with a similar level of hydrophobicity. The results support the reduced-alphabet hypothesis of the primordial genetic code and should also be helpful in constructing optimized protein libraries for evolutionary protein engineering.
AB - Searching for functional proteins among random-sequence libraries is a major challenge of protein engineering; the difficulties include the poor solubility of many random-sequence proteins. A library in which most of the polypeptides are soluble and stable would therefore be of great benefit. Although modern proteins consist of 20 amino acids, it has been suggested that early proteins evolved from a reduced alphabet. Here, we have constructed a library of random-sequence proteins consisting of only five amino acids, Ala, Gly, Val, Asp and Glu, which are believed to have been the most abundant in the prebiotic environment. Expression and characterization of arbitrarily chosen proteins in the library indicated that five-alphabet random-sequence proteins have higher solubility than do 20-alphabet random-sequence proteins with a similar level of hydrophobicity. The results support the reduced-alphabet hypothesis of the primordial genetic code and should also be helpful in constructing optimized protein libraries for evolutionary protein engineering.
KW - Genetic code
KW - Protein design
KW - Protein evolution
KW - Reduced amino acid alphabet
KW - Synthetic library
UR - http://www.scopus.com/inward/record.url?scp=21744445913&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=21744445913&partnerID=8YFLogxK
U2 - 10.1093/protein/gzi034
DO - 10.1093/protein/gzi034
M3 - Article
C2 - 15928003
AN - SCOPUS:21744445913
SN - 1741-0126
VL - 18
SP - 279
EP - 284
JO - Protein Engineering, Design and Selection
JF - Protein Engineering, Design and Selection
IS - 6
ER -