Over-representation of Chi sequences caused by di-codon increase in Escherichia coli K-12

Reina Uno, Yoichi Nakayama, Masaru Tomita

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

Chi sequences (5′-GCTGGTGG-3′) are cis-acting 8 bp sequence elements that enhance homologous recombination promoted by the RecBCD pathway in Escherichia coli. The genome of E. coli K-12 MG1655 contains 1009 Chi sequences and this frequency far exceeds the expected value for occurrence of an 8 bp sequence in a genome of this size. It is generally thought that the over-representation of Chi sequences indicates that they have been selected for during evolution because of their function in recombination. The genes from three E. coli strains (K-12, O157 and CFT) were classified into three categories (island, match to other E. coli, and backbone). Island genes have a different base composition and codon usage in comparison with those in the backbone genes, therefore they were relatively new and not yet adapted to the base composition patterns and codon usage typical of the recipient genome. The over-representation of Chi sequences was examined by comparing Chi frequencies and codon frequencies between island and backbone genes. The difference in the CTGGTG di-codon frequency between the backbone and island genes was correlated with the frequency of Chi sequences which were translated in the Leu-Val (-G|CTG|GTG|G-) reading frame in the K-12 strain. These results suggest that the main reading frame of Chi sequences increased as a result of the di-codon CTG-GTG increasing under a genome-wide pressure for adapting to the codon usage and base composition of the E. coli K-12 strain, and that the RecBCD recombinase might adjust its recognition sequence to a frequently occurring oligomer such as G-CTG-GTG-G.

Original languageEnglish
Pages (from-to)30-37
Number of pages8
JournalGene
Volume380
Issue number1
DOIs
Publication statusPublished - 2006 Sep 15

Fingerprint

Codon
Escherichia coli
Islands
Base Composition
Reading Frames
Genes
Genome
Genome Size
Recombinases
Homologous Recombination
Genetic Recombination
Pressure

Keywords

  • Chi sequence
  • Codon usage
  • Comparative genomics
  • Homologous recombination
  • RecBCD

ASJC Scopus subject areas

  • Genetics

Cite this

Over-representation of Chi sequences caused by di-codon increase in Escherichia coli K-12. / Uno, Reina; Nakayama, Yoichi; Tomita, Masaru.

In: Gene, Vol. 380, No. 1, 15.09.2006, p. 30-37.

Research output: Contribution to journalArticle

Uno, Reina ; Nakayama, Yoichi ; Tomita, Masaru. / Over-representation of Chi sequences caused by di-codon increase in Escherichia coli K-12. In: Gene. 2006 ; Vol. 380, No. 1. pp. 30-37.
@article{bc41c531c5424564bb8565805384f99e,
title = "Over-representation of Chi sequences caused by di-codon increase in Escherichia coli K-12",
abstract = "Chi sequences (5′-GCTGGTGG-3′) are cis-acting 8 bp sequence elements that enhance homologous recombination promoted by the RecBCD pathway in Escherichia coli. The genome of E. coli K-12 MG1655 contains 1009 Chi sequences and this frequency far exceeds the expected value for occurrence of an 8 bp sequence in a genome of this size. It is generally thought that the over-representation of Chi sequences indicates that they have been selected for during evolution because of their function in recombination. The genes from three E. coli strains (K-12, O157 and CFT) were classified into three categories (island, match to other E. coli, and backbone). Island genes have a different base composition and codon usage in comparison with those in the backbone genes, therefore they were relatively new and not yet adapted to the base composition patterns and codon usage typical of the recipient genome. The over-representation of Chi sequences was examined by comparing Chi frequencies and codon frequencies between island and backbone genes. The difference in the CTGGTG di-codon frequency between the backbone and island genes was correlated with the frequency of Chi sequences which were translated in the Leu-Val (-G|CTG|GTG|G-) reading frame in the K-12 strain. These results suggest that the main reading frame of Chi sequences increased as a result of the di-codon CTG-GTG increasing under a genome-wide pressure for adapting to the codon usage and base composition of the E. coli K-12 strain, and that the RecBCD recombinase might adjust its recognition sequence to a frequently occurring oligomer such as G-CTG-GTG-G.",
keywords = "Chi sequence, Codon usage, Comparative genomics, Homologous recombination, RecBCD",
author = "Reina Uno and Yoichi Nakayama and Masaru Tomita",
year = "2006",
month = "9",
day = "15",
doi = "10.1016/j.gene.2006.05.013",
language = "English",
volume = "380",
pages = "30--37",
journal = "Gene",
issn = "0378-1119",
publisher = "Elsevier",
number = "1",

}

TY - JOUR

T1 - Over-representation of Chi sequences caused by di-codon increase in Escherichia coli K-12

AU - Uno, Reina

AU - Nakayama, Yoichi

AU - Tomita, Masaru

PY - 2006/9/15

Y1 - 2006/9/15

N2 - Chi sequences (5′-GCTGGTGG-3′) are cis-acting 8 bp sequence elements that enhance homologous recombination promoted by the RecBCD pathway in Escherichia coli. The genome of E. coli K-12 MG1655 contains 1009 Chi sequences and this frequency far exceeds the expected value for occurrence of an 8 bp sequence in a genome of this size. It is generally thought that the over-representation of Chi sequences indicates that they have been selected for during evolution because of their function in recombination. The genes from three E. coli strains (K-12, O157 and CFT) were classified into three categories (island, match to other E. coli, and backbone). Island genes have a different base composition and codon usage in comparison with those in the backbone genes, therefore they were relatively new and not yet adapted to the base composition patterns and codon usage typical of the recipient genome. The over-representation of Chi sequences was examined by comparing Chi frequencies and codon frequencies between island and backbone genes. The difference in the CTGGTG di-codon frequency between the backbone and island genes was correlated with the frequency of Chi sequences which were translated in the Leu-Val (-G|CTG|GTG|G-) reading frame in the K-12 strain. These results suggest that the main reading frame of Chi sequences increased as a result of the di-codon CTG-GTG increasing under a genome-wide pressure for adapting to the codon usage and base composition of the E. coli K-12 strain, and that the RecBCD recombinase might adjust its recognition sequence to a frequently occurring oligomer such as G-CTG-GTG-G.

AB - Chi sequences (5′-GCTGGTGG-3′) are cis-acting 8 bp sequence elements that enhance homologous recombination promoted by the RecBCD pathway in Escherichia coli. The genome of E. coli K-12 MG1655 contains 1009 Chi sequences and this frequency far exceeds the expected value for occurrence of an 8 bp sequence in a genome of this size. It is generally thought that the over-representation of Chi sequences indicates that they have been selected for during evolution because of their function in recombination. The genes from three E. coli strains (K-12, O157 and CFT) were classified into three categories (island, match to other E. coli, and backbone). Island genes have a different base composition and codon usage in comparison with those in the backbone genes, therefore they were relatively new and not yet adapted to the base composition patterns and codon usage typical of the recipient genome. The over-representation of Chi sequences was examined by comparing Chi frequencies and codon frequencies between island and backbone genes. The difference in the CTGGTG di-codon frequency between the backbone and island genes was correlated with the frequency of Chi sequences which were translated in the Leu-Val (-G|CTG|GTG|G-) reading frame in the K-12 strain. These results suggest that the main reading frame of Chi sequences increased as a result of the di-codon CTG-GTG increasing under a genome-wide pressure for adapting to the codon usage and base composition of the E. coli K-12 strain, and that the RecBCD recombinase might adjust its recognition sequence to a frequently occurring oligomer such as G-CTG-GTG-G.

KW - Chi sequence

KW - Codon usage

KW - Comparative genomics

KW - Homologous recombination

KW - RecBCD

UR - http://www.scopus.com/inward/record.url?scp=33747884625&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33747884625&partnerID=8YFLogxK

U2 - 10.1016/j.gene.2006.05.013

DO - 10.1016/j.gene.2006.05.013

M3 - Article

C2 - 16854534

AN - SCOPUS:33747884625

VL - 380

SP - 30

EP - 37

JO - Gene

JF - Gene

SN - 0378-1119

IS - 1

ER -