Prediction of evolutionarily conserved interologs in Mus musculus

Sailu Yellaboina, Dawood B. Dudekula, Minoru Ko

Research output: Contribution to journalArticle

18 Citations (Scopus)

Abstract

Background: Identification of protein-protein interactions is an important first step to understand living systems. High-throughput experimental approaches have accumulated large amount of information on protein-protein interactions in human and other model organisms. Such interaction information has been successfully transferred to other species, in which the experimental data are limited. However, the annotation transfer method could yield false positive interologs due to the lack of conservation of interactions when applied to phylogenetically distant organisms. Results: To address this issue, we used phylogenetic profile method to filter false positives in interologs based on the notion that evolutionary conserved interactions show similar patterns of occurrence along the genomes. The approach was applied to Mus musculus, in which the experimentally identified interactions are limited. We first inferred the protein-protein interactions in Mus musculus by using two approaches: i) identifying mouse orthologs of interacting proteins (interologs) based on the experimental protein-protein interaction data from other organisms; and ii) analyzing frequency of mouse ortholog co-occurrence in predicted operons of bacteria. We then filtered possible false-positives in the predicted interactions using the phylogenetic profiles. We found that this filtering method significantly increased the frequency of interacting protein-pairs coexpressed in the same cells/tissues in gene expression omnibus (GEO) database as well as the frequency of interacting protein-pairs shared the similar Gene Ontology (GO) terms for biological processes and cellular localizations. The data supports the notion that phylogenetic profile helps to reduce the number of false positives in interologs. Conclusion: We have developed protein-protein interaction database in mouse, which contains 41109 interologs. We have also developed a web interface to facilitate the use of database http://lgsun.grc.nia.nih.gov/ mppi/.

Original languageEnglish
Article number465
JournalBMC Genomics
Volume9
DOIs
Publication statusPublished - 2008 Oct 8
Externally publishedYes

Fingerprint

Proteins
Databases
Biological Phenomena
Protein Databases
Gene Ontology
Operon
Genome
Bacteria
Gene Expression

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

Prediction of evolutionarily conserved interologs in Mus musculus. / Yellaboina, Sailu; Dudekula, Dawood B.; Ko, Minoru.

In: BMC Genomics, Vol. 9, 465, 08.10.2008.

Research output: Contribution to journalArticle

Yellaboina, Sailu ; Dudekula, Dawood B. ; Ko, Minoru. / Prediction of evolutionarily conserved interologs in Mus musculus. In: BMC Genomics. 2008 ; Vol. 9.
@article{3af3c6e1e1c24f8bbb68ead24c0c106a,
title = "Prediction of evolutionarily conserved interologs in Mus musculus",
abstract = "Background: Identification of protein-protein interactions is an important first step to understand living systems. High-throughput experimental approaches have accumulated large amount of information on protein-protein interactions in human and other model organisms. Such interaction information has been successfully transferred to other species, in which the experimental data are limited. However, the annotation transfer method could yield false positive interologs due to the lack of conservation of interactions when applied to phylogenetically distant organisms. Results: To address this issue, we used phylogenetic profile method to filter false positives in interologs based on the notion that evolutionary conserved interactions show similar patterns of occurrence along the genomes. The approach was applied to Mus musculus, in which the experimentally identified interactions are limited. We first inferred the protein-protein interactions in Mus musculus by using two approaches: i) identifying mouse orthologs of interacting proteins (interologs) based on the experimental protein-protein interaction data from other organisms; and ii) analyzing frequency of mouse ortholog co-occurrence in predicted operons of bacteria. We then filtered possible false-positives in the predicted interactions using the phylogenetic profiles. We found that this filtering method significantly increased the frequency of interacting protein-pairs coexpressed in the same cells/tissues in gene expression omnibus (GEO) database as well as the frequency of interacting protein-pairs shared the similar Gene Ontology (GO) terms for biological processes and cellular localizations. The data supports the notion that phylogenetic profile helps to reduce the number of false positives in interologs. Conclusion: We have developed protein-protein interaction database in mouse, which contains 41109 interologs. We have also developed a web interface to facilitate the use of database http://lgsun.grc.nia.nih.gov/ mppi/.",
author = "Sailu Yellaboina and Dudekula, {Dawood B.} and Minoru Ko",
year = "2008",
month = "10",
day = "8",
doi = "10.1186/1471-2164-9-465",
language = "English",
volume = "9",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",

}

TY - JOUR

T1 - Prediction of evolutionarily conserved interologs in Mus musculus

AU - Yellaboina, Sailu

AU - Dudekula, Dawood B.

AU - Ko, Minoru

PY - 2008/10/8

Y1 - 2008/10/8

N2 - Background: Identification of protein-protein interactions is an important first step to understand living systems. High-throughput experimental approaches have accumulated large amount of information on protein-protein interactions in human and other model organisms. Such interaction information has been successfully transferred to other species, in which the experimental data are limited. However, the annotation transfer method could yield false positive interologs due to the lack of conservation of interactions when applied to phylogenetically distant organisms. Results: To address this issue, we used phylogenetic profile method to filter false positives in interologs based on the notion that evolutionary conserved interactions show similar patterns of occurrence along the genomes. The approach was applied to Mus musculus, in which the experimentally identified interactions are limited. We first inferred the protein-protein interactions in Mus musculus by using two approaches: i) identifying mouse orthologs of interacting proteins (interologs) based on the experimental protein-protein interaction data from other organisms; and ii) analyzing frequency of mouse ortholog co-occurrence in predicted operons of bacteria. We then filtered possible false-positives in the predicted interactions using the phylogenetic profiles. We found that this filtering method significantly increased the frequency of interacting protein-pairs coexpressed in the same cells/tissues in gene expression omnibus (GEO) database as well as the frequency of interacting protein-pairs shared the similar Gene Ontology (GO) terms for biological processes and cellular localizations. The data supports the notion that phylogenetic profile helps to reduce the number of false positives in interologs. Conclusion: We have developed protein-protein interaction database in mouse, which contains 41109 interologs. We have also developed a web interface to facilitate the use of database http://lgsun.grc.nia.nih.gov/ mppi/.

AB - Background: Identification of protein-protein interactions is an important first step to understand living systems. High-throughput experimental approaches have accumulated large amount of information on protein-protein interactions in human and other model organisms. Such interaction information has been successfully transferred to other species, in which the experimental data are limited. However, the annotation transfer method could yield false positive interologs due to the lack of conservation of interactions when applied to phylogenetically distant organisms. Results: To address this issue, we used phylogenetic profile method to filter false positives in interologs based on the notion that evolutionary conserved interactions show similar patterns of occurrence along the genomes. The approach was applied to Mus musculus, in which the experimentally identified interactions are limited. We first inferred the protein-protein interactions in Mus musculus by using two approaches: i) identifying mouse orthologs of interacting proteins (interologs) based on the experimental protein-protein interaction data from other organisms; and ii) analyzing frequency of mouse ortholog co-occurrence in predicted operons of bacteria. We then filtered possible false-positives in the predicted interactions using the phylogenetic profiles. We found that this filtering method significantly increased the frequency of interacting protein-pairs coexpressed in the same cells/tissues in gene expression omnibus (GEO) database as well as the frequency of interacting protein-pairs shared the similar Gene Ontology (GO) terms for biological processes and cellular localizations. The data supports the notion that phylogenetic profile helps to reduce the number of false positives in interologs. Conclusion: We have developed protein-protein interaction database in mouse, which contains 41109 interologs. We have also developed a web interface to facilitate the use of database http://lgsun.grc.nia.nih.gov/ mppi/.

UR - http://www.scopus.com/inward/record.url?scp=54549088958&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=54549088958&partnerID=8YFLogxK

U2 - 10.1186/1471-2164-9-465

DO - 10.1186/1471-2164-9-465

M3 - Article

C2 - 18842131

AN - SCOPUS:54549088958

VL - 9

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

M1 - 465

ER -