Restauro-G

A Rapid Genome Re-Annotation System for Comparative Genomics

Satoshi Tamaki, Kazuharu Arakawa, Nobuaki Kono, Masaru Tomita

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Annotations of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid data preparation of a large number of complete genomes, automation and speed are important for genome re-annotation. Here we introduce an open-source rapid genome re-annotation software system, Restauro-G, specialized for bacterial genomes. Restauro-G re-annotates a genome by similarity searches utilizing the BLAST-Like Alignment Tool, referring to protein databases such as UniProt KB, NCBI nr, NCBI COGs, Pfam, and PSORTb. Re-annotation by Restauro-G achieved over 98% accuracy for most bacterial chromosomes in comparison with the original manually curated annotation of EMBL releases. Restauro-G was developed in the generic bioinformatics workbench G-language Genome Analysis Environment and is distributed at http://restauro-g.iab.keio.ac.jp/under the GNU General Public License.

Original languageEnglish
Pages (from-to)53-58
Number of pages6
JournalGenomics, Proteomics and Bioinformatics
Volume5
Issue number1
DOIs
Publication statusPublished - 2007

Fingerprint

Comparative Genomics
Genomics
Annotation
Genome
Genes
Bacterial Chromosomes
Bacterial Genomes
Protein Databases
Automation
Licensure
Similarity Search
Computational Biology
Bioinformatics
Chromosomes
Inconsistency
Open Source
Software System
Language
Software
Sequencing

Keywords

  • annotation
  • bioinformatics
  • complete genomes
  • G-language Genome Analysis Environment
  • software

ASJC Scopus subject areas

  • Genetics
  • Biochemistry
  • Molecular Biology

Cite this

Restauro-G : A Rapid Genome Re-Annotation System for Comparative Genomics. / Tamaki, Satoshi; Arakawa, Kazuharu; Kono, Nobuaki; Tomita, Masaru.

In: Genomics, Proteomics and Bioinformatics, Vol. 5, No. 1, 2007, p. 53-58.

Research output: Contribution to journalArticle

@article{8633b5ade5334949af853181db25eaaa,
title = "Restauro-G: A Rapid Genome Re-Annotation System for Comparative Genomics",
abstract = "Annotations of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid data preparation of a large number of complete genomes, automation and speed are important for genome re-annotation. Here we introduce an open-source rapid genome re-annotation software system, Restauro-G, specialized for bacterial genomes. Restauro-G re-annotates a genome by similarity searches utilizing the BLAST-Like Alignment Tool, referring to protein databases such as UniProt KB, NCBI nr, NCBI COGs, Pfam, and PSORTb. Re-annotation by Restauro-G achieved over 98{\%} accuracy for most bacterial chromosomes in comparison with the original manually curated annotation of EMBL releases. Restauro-G was developed in the generic bioinformatics workbench G-language Genome Analysis Environment and is distributed at http://restauro-g.iab.keio.ac.jp/under the GNU General Public License.",
keywords = "annotation, bioinformatics, complete genomes, G-language Genome Analysis Environment, software",
author = "Satoshi Tamaki and Kazuharu Arakawa and Nobuaki Kono and Masaru Tomita",
year = "2007",
doi = "10.1016/S1672-0229(07)60014-X",
language = "English",
volume = "5",
pages = "53--58",
journal = "Genomics Proteomics Bioinformatics",
issn = "1672-0229",
publisher = "Beijing Genomics Institute",
number = "1",

}

TY - JOUR

T1 - Restauro-G

T2 - A Rapid Genome Re-Annotation System for Comparative Genomics

AU - Tamaki, Satoshi

AU - Arakawa, Kazuharu

AU - Kono, Nobuaki

AU - Tomita, Masaru

PY - 2007

Y1 - 2007

N2 - Annotations of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid data preparation of a large number of complete genomes, automation and speed are important for genome re-annotation. Here we introduce an open-source rapid genome re-annotation software system, Restauro-G, specialized for bacterial genomes. Restauro-G re-annotates a genome by similarity searches utilizing the BLAST-Like Alignment Tool, referring to protein databases such as UniProt KB, NCBI nr, NCBI COGs, Pfam, and PSORTb. Re-annotation by Restauro-G achieved over 98% accuracy for most bacterial chromosomes in comparison with the original manually curated annotation of EMBL releases. Restauro-G was developed in the generic bioinformatics workbench G-language Genome Analysis Environment and is distributed at http://restauro-g.iab.keio.ac.jp/under the GNU General Public License.

AB - Annotations of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid data preparation of a large number of complete genomes, automation and speed are important for genome re-annotation. Here we introduce an open-source rapid genome re-annotation software system, Restauro-G, specialized for bacterial genomes. Restauro-G re-annotates a genome by similarity searches utilizing the BLAST-Like Alignment Tool, referring to protein databases such as UniProt KB, NCBI nr, NCBI COGs, Pfam, and PSORTb. Re-annotation by Restauro-G achieved over 98% accuracy for most bacterial chromosomes in comparison with the original manually curated annotation of EMBL releases. Restauro-G was developed in the generic bioinformatics workbench G-language Genome Analysis Environment and is distributed at http://restauro-g.iab.keio.ac.jp/under the GNU General Public License.

KW - annotation

KW - bioinformatics

KW - complete genomes

KW - G-language Genome Analysis Environment

KW - software

UR - http://www.scopus.com/inward/record.url?scp=34250180888&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34250180888&partnerID=8YFLogxK

U2 - 10.1016/S1672-0229(07)60014-X

DO - 10.1016/S1672-0229(07)60014-X

M3 - Article

VL - 5

SP - 53

EP - 58

JO - Genomics Proteomics Bioinformatics

JF - Genomics Proteomics Bioinformatics

SN - 1672-0229

IS - 1

ER -