Validating the significance of genomic properties of Chi sites from the distribution of all octamers in Escherichia coli

Kazuharu Arakawa, Reina Uno, Yoichi Nakayama, Masaru Tomita

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Chi sites (5′-GCTGGTGG-3′) are homologous recombinational hotspot octamer sequences, which attenuate the exonuclease activity of RecBCD in Escherichia coli. They are overrepresented in the genome (1008 occurrences), preferentially located within coding regions (98%), oriented in the direction of replication (75%), and occur most commonly on the mRNA-synonymous sense strand of the double helix (79%). Previous statistical studies of the genome sequence suggested that these genomic properties of Chi sites appear to be related to their role in recombinational repair and therefore to replication and transcription. In this study, we employ three mathematical models to predict the properties of Chi sites from single nucleotide and multi-nucleotide compositions, and validate them statistically using the distribution of all octamer sequences in the entire genome, or exclusively within ORFs. The model based on the overall distribution of all octamers provided better predictions than the single nucleotide composition model, and the ORF and sense strand preference of Chi sites were shown to be within the standard deviation of all octamers. In contrast, the orientation bias of the Chi sites in the direction of replication was significant, although the bias was not as pronounced as with the single nucleotide composition model, suggesting a selective pressure related to the role of RecBCD in replication.

Original languageEnglish
Pages (from-to)239-246
Number of pages8
JournalGene
Volume392
Issue number1-2
DOIs
Publication statusPublished - 2007 May 1

Keywords

  • Bioinformatics
  • Homologous recombination
  • Orientation bias
  • RecBCD
  • Strand bias

ASJC Scopus subject areas

  • Genetics

Fingerprint Dive into the research topics of 'Validating the significance of genomic properties of Chi sites from the distribution of all octamers in Escherichia coli'. Together they form a unique fingerprint.

  • Cite this