Discriminative discovery of transcription factor binding sites from location data

Yuji Kawada, Yasubumi Sakakibara

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

Motivation: The availability of genome-wide location analyses based on chromatin immunoprecipitation (ChIP) data gives a new insight for in silico analysis of transcriptional regulations. Results: We propose a novel discriminative discovery framework for precisely identifying transcriptional regulatory motifs from both positive and negative samples (sets of upstream sequences of both bound and unbound genes by a transcription factor (TF)) based on the genome-wide location data. In this framework, our goal is to find such discriminative motifs that best explain the location data in the sense that the motifs precisely discriminate the positive samples from the negative ones. First, in order to discover an initial set of discriminative substrings between positive and negative samples, we apply a decision tree learning method which produces a text-classification tree. We extract several clusters consisting of similar substrings from the internal nodes of the learned tree. Second, we start with initial profile-HMMs constructed from each cluster for representing putative motifs and iteratively refine the profile-HMMs to improve the discrimination accuracies. Our genome-wide experimental results on yeast show that our method successfully identifies the consensus sequences for known TFs in the literature and further presents significant performances for discriminating between positive and negative samples in all the TFs, while most other motif detecting methods show very poor performances on the problem of discriminations. Our learned profile-HMMs also improve false negative predictions of ChIP data.

本文言語English
ホスト出版物のタイトルProceedings - 2005 IEEE Computational SystemsBioinformatics Conference, CSB 2005
ページ86-92
ページ数7
DOI
出版ステータスPublished - 2005 12 1
イベント2005 IEEE Computational Systems Bioinformatics Conference, CSB 2005 - Stanford, CA, United States
継続期間: 2005 8 82005 8 11

出版物シリーズ

名前Proceedings - 2005 IEEE Computational Systems Bioinformatics Conference, CSB 2005
2005

Other

Other2005 IEEE Computational Systems Bioinformatics Conference, CSB 2005
国/地域United States
CityStanford, CA
Period05/8/805/8/11

ASJC Scopus subject areas

  • 工学(全般)
  • 医学(全般)

フィンガープリント

「Discriminative discovery of transcription factor binding sites from location data」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル