TY - JOUR
T1 - COPICAT
T2 - A software system for predicting interactions between proteins and chemical compounds
AU - Sakakibara, Yasubumi
AU - Hachiya, Tsuyoshi
AU - Uchida, Miho
AU - Nagamine, Nobuyoshi
AU - Sugawara, Yohei
AU - Yokota, Masahiro
AU - Nakamura, Masaomi
AU - Popendorf, Kris
AU - Komori, Takashi
AU - Sato, Kengo
N1 - Funding Information:
Funding: Bioinformatics research and development from the Japan Science and Technology Agency, in part.
PY - 2012/3
Y1 - 2012/3
N2 - Summary: Since tens of millions of chemical compounds have been accumulated in public chemical databases, fast comprehensive computational methods to predict interactions between chemical compounds and proteins are needed for virtual screening of lead compounds. Previously, we proposed a novel method for predicting protein-chemical interactions using two-layer Support Vector Machine classifiers that require only readily available biochemical data, i.e. amino acid sequences of proteins and structure formulas of chemical compounds. In this article, the method has been implemented as the COPICAT web service, with an easy-to-use front-end interface. Users can simply submit a protein-chemical interaction prediction job using a pre-trained classifier, or can even train their own classification model by uploading training data. COPICAT's fast and accurate computational prediction has enhanced lead compound discovery against a database of tens of millions of chemical compounds, implying that the search space for drug discovery is extended by >1000 times compared with currently well-used high-throughput screening methodologies.
AB - Summary: Since tens of millions of chemical compounds have been accumulated in public chemical databases, fast comprehensive computational methods to predict interactions between chemical compounds and proteins are needed for virtual screening of lead compounds. Previously, we proposed a novel method for predicting protein-chemical interactions using two-layer Support Vector Machine classifiers that require only readily available biochemical data, i.e. amino acid sequences of proteins and structure formulas of chemical compounds. In this article, the method has been implemented as the COPICAT web service, with an easy-to-use front-end interface. Users can simply submit a protein-chemical interaction prediction job using a pre-trained classifier, or can even train their own classification model by uploading training data. COPICAT's fast and accurate computational prediction has enhanced lead compound discovery against a database of tens of millions of chemical compounds, implying that the search space for drug discovery is extended by >1000 times compared with currently well-used high-throughput screening methodologies.
UR - http://www.scopus.com/inward/record.url?scp=84863277948&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863277948&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/bts031
DO - 10.1093/bioinformatics/bts031
M3 - Article
C2 - 22257668
AN - SCOPUS:84863277948
SN - 1367-4803
VL - 28
SP - 745
EP - 746
JO - Bioinformatics
JF - Bioinformatics
IS - 5
M1 - bts031
ER -