MassBank: A public repository for sharing mass spectral data for life sciences

Hisayuki Horai, Masanori Arita, Shigehiko Kanaya, Yoshito Nihei, Tasuku Ikeda, Kazuhiro Suwa, Yuya Ojima, Kenichi Tanaka, Satoshi Tanaka, Ken Aoshima, Yoshiya Oda, Yuji Kakazu, Miyako Kusano, Takayuki Tohge, Fumio Matsuda, Yuji Sawada, Masami Yokota Hirai, Hiroki Nakanishi, Kazutaka Ikeda, Naoshige Akimoto & 15 others Takashi Maoka, Hiroki Takahashi, Takeshi Ara, Nozomu Sakurai, Hideyuki Suzuki, Daisuke Shibata, Steffen Neumann, Takashi Iida, Ken Tanaka, Kimito Funatsu, Fumito Matsuura, Tomoyoshi Soga, Ryo Taguchi, Kazuki Saito, Takaaki Nishioka

Research output: Contribution to journalArticle

  • 705 Citations

Abstract

MassBank is the first public repository of mass spectra of small chemical compounds for life sciences (<3000 Da). The database contains 605 electron-ionizationmass spectrometry(EI-MS), 137 fast atom bombardment MS and 9276 electrospray ionization (ESI)-MSn data of 2337 authentic compounds of metabolites, 11 545 EI-MS and 834 other-MS data of 10 286 volatile natural and synthetic compounds, and 3045 ESI-MS2 data of 679 synthetic drugs contributed by 16 research groups (January 2010). ESI-MS 2 data were analyzed under nonstandardized, independent experimental conditions. MassBank is a distributed database. Each research group provides data from its own MassBank data servers distributed on the Internet. MassBank users can access either all of the MassBank data or a subset of the data by specifying one or more experimental conditions. In a spectral search to retrieve mass spectra similar to a query mass spectrum, the similarity score is calculated by a weighted cosine correlation in which weighting exponents on peak intensity and the mass-to-charge ratio are optimized to the ESI-MS2 data. MassBank also provides a merged spectrum for each compound prepared bymerging the analyzed ESI-MS2 data on an identical compound under different collision-induced dissociation conditions. Data merging has significantly improved the precision of the identification of a chemical compound by 21-23% at a similarity score of 0.6. Thus, MassBank is useful for the identification of chemical compounds and the publication of experimental data.

LanguageEnglish
Pages703-714
Number of pages12
JournalJournal of Mass Spectrometry
Volume45
Issue number7
DOIs
Publication statusPublished - 2010

Fingerprint

Electrospray ionization
Biological Science Disciplines
Chemical compounds
Computer Communication Networks
Research
Internet
Publications
Spectrum Analysis
Databases
Electrons
Pharmaceutical Preparations
Metabolites
Merging
Spectrometry
Servers
Atoms

Keywords

  • Distributed database
  • MassBank
  • Metabolite
  • Public database
  • Spectral similarity

ASJC Scopus subject areas

  • Spectroscopy
  • Medicine(all)

Cite this

Horai, H., Arita, M., Kanaya, S., Nihei, Y., Ikeda, T., Suwa, K., ... Nishioka, T. (2010). MassBank: A public repository for sharing mass spectral data for life sciences. Journal of Mass Spectrometry, 45(7), 703-714. https://doi.org/10.1002/jms.1777

MassBank : A public repository for sharing mass spectral data for life sciences. / Horai, Hisayuki; Arita, Masanori; Kanaya, Shigehiko; Nihei, Yoshito; Ikeda, Tasuku; Suwa, Kazuhiro; Ojima, Yuya; Tanaka, Kenichi; Tanaka, Satoshi; Aoshima, Ken; Oda, Yoshiya; Kakazu, Yuji; Kusano, Miyako; Tohge, Takayuki; Matsuda, Fumio; Sawada, Yuji; Hirai, Masami Yokota; Nakanishi, Hiroki; Ikeda, Kazutaka; Akimoto, Naoshige; Maoka, Takashi; Takahashi, Hiroki; Ara, Takeshi; Sakurai, Nozomu; Suzuki, Hideyuki; Shibata, Daisuke; Neumann, Steffen; Iida, Takashi; Tanaka, Ken; Funatsu, Kimito; Matsuura, Fumito; Soga, Tomoyoshi; Taguchi, Ryo; Saito, Kazuki; Nishioka, Takaaki.

In: Journal of Mass Spectrometry, Vol. 45, No. 7, 2010, p. 703-714.

Research output: Contribution to journalArticle

Horai, H, Arita, M, Kanaya, S, Nihei, Y, Ikeda, T, Suwa, K, Ojima, Y, Tanaka, K, Tanaka, S, Aoshima, K, Oda, Y, Kakazu, Y, Kusano, M, Tohge, T, Matsuda, F, Sawada, Y, Hirai, MY, Nakanishi, H, Ikeda, K, Akimoto, N, Maoka, T, Takahashi, H, Ara, T, Sakurai, N, Suzuki, H, Shibata, D, Neumann, S, Iida, T, Tanaka, K, Funatsu, K, Matsuura, F, Soga, T, Taguchi, R, Saito, K & Nishioka, T 2010, 'MassBank: A public repository for sharing mass spectral data for life sciences' Journal of Mass Spectrometry, vol. 45, no. 7, pp. 703-714. https://doi.org/10.1002/jms.1777
Horai, Hisayuki ; Arita, Masanori ; Kanaya, Shigehiko ; Nihei, Yoshito ; Ikeda, Tasuku ; Suwa, Kazuhiro ; Ojima, Yuya ; Tanaka, Kenichi ; Tanaka, Satoshi ; Aoshima, Ken ; Oda, Yoshiya ; Kakazu, Yuji ; Kusano, Miyako ; Tohge, Takayuki ; Matsuda, Fumio ; Sawada, Yuji ; Hirai, Masami Yokota ; Nakanishi, Hiroki ; Ikeda, Kazutaka ; Akimoto, Naoshige ; Maoka, Takashi ; Takahashi, Hiroki ; Ara, Takeshi ; Sakurai, Nozomu ; Suzuki, Hideyuki ; Shibata, Daisuke ; Neumann, Steffen ; Iida, Takashi ; Tanaka, Ken ; Funatsu, Kimito ; Matsuura, Fumito ; Soga, Tomoyoshi ; Taguchi, Ryo ; Saito, Kazuki ; Nishioka, Takaaki. / MassBank : A public repository for sharing mass spectral data for life sciences. In: Journal of Mass Spectrometry. 2010 ; Vol. 45, No. 7. pp. 703-714.
@article{41cd269a59b34510b52d8cd9906d6a4a,
title = "MassBank: A public repository for sharing mass spectral data for life sciences",
abstract = "MassBank is the first public repository of mass spectra of small chemical compounds for life sciences (<3000 Da). The database contains 605 electron-ionizationmass spectrometry(EI-MS), 137 fast atom bombardment MS and 9276 electrospray ionization (ESI)-MSn data of 2337 authentic compounds of metabolites, 11 545 EI-MS and 834 other-MS data of 10 286 volatile natural and synthetic compounds, and 3045 ESI-MS2 data of 679 synthetic drugs contributed by 16 research groups (January 2010). ESI-MS 2 data were analyzed under nonstandardized, independent experimental conditions. MassBank is a distributed database. Each research group provides data from its own MassBank data servers distributed on the Internet. MassBank users can access either all of the MassBank data or a subset of the data by specifying one or more experimental conditions. In a spectral search to retrieve mass spectra similar to a query mass spectrum, the similarity score is calculated by a weighted cosine correlation in which weighting exponents on peak intensity and the mass-to-charge ratio are optimized to the ESI-MS2 data. MassBank also provides a merged spectrum for each compound prepared bymerging the analyzed ESI-MS2 data on an identical compound under different collision-induced dissociation conditions. Data merging has significantly improved the precision of the identification of a chemical compound by 21-23{\%} at a similarity score of 0.6. Thus, MassBank is useful for the identification of chemical compounds and the publication of experimental data.",
keywords = "Distributed database, MassBank, Metabolite, Public database, Spectral similarity",
author = "Hisayuki Horai and Masanori Arita and Shigehiko Kanaya and Yoshito Nihei and Tasuku Ikeda and Kazuhiro Suwa and Yuya Ojima and Kenichi Tanaka and Satoshi Tanaka and Ken Aoshima and Yoshiya Oda and Yuji Kakazu and Miyako Kusano and Takayuki Tohge and Fumio Matsuda and Yuji Sawada and Hirai, {Masami Yokota} and Hiroki Nakanishi and Kazutaka Ikeda and Naoshige Akimoto and Takashi Maoka and Hiroki Takahashi and Takeshi Ara and Nozomu Sakurai and Hideyuki Suzuki and Daisuke Shibata and Steffen Neumann and Takashi Iida and Ken Tanaka and Kimito Funatsu and Fumito Matsuura and Tomoyoshi Soga and Ryo Taguchi and Kazuki Saito and Takaaki Nishioka",
year = "2010",
doi = "10.1002/jms.1777",
language = "English",
volume = "45",
pages = "703--714",
journal = "Journal of Mass Spectrometry",
issn = "1076-5174",
publisher = "John Wiley and Sons Ltd",
number = "7",

}

TY - JOUR

T1 - MassBank

T2 - Journal of Mass Spectrometry

AU - Horai, Hisayuki

AU - Arita, Masanori

AU - Kanaya, Shigehiko

AU - Nihei, Yoshito

AU - Ikeda, Tasuku

AU - Suwa, Kazuhiro

AU - Ojima, Yuya

AU - Tanaka, Kenichi

AU - Tanaka, Satoshi

AU - Aoshima, Ken

AU - Oda, Yoshiya

AU - Kakazu, Yuji

AU - Kusano, Miyako

AU - Tohge, Takayuki

AU - Matsuda, Fumio

AU - Sawada, Yuji

AU - Hirai, Masami Yokota

AU - Nakanishi, Hiroki

AU - Ikeda, Kazutaka

AU - Akimoto, Naoshige

AU - Maoka, Takashi

AU - Takahashi, Hiroki

AU - Ara, Takeshi

AU - Sakurai, Nozomu

AU - Suzuki, Hideyuki

AU - Shibata, Daisuke

AU - Neumann, Steffen

AU - Iida, Takashi

AU - Tanaka, Ken

AU - Funatsu, Kimito

AU - Matsuura, Fumito

AU - Soga, Tomoyoshi

AU - Taguchi, Ryo

AU - Saito, Kazuki

AU - Nishioka, Takaaki

PY - 2010

Y1 - 2010

N2 - MassBank is the first public repository of mass spectra of small chemical compounds for life sciences (<3000 Da). The database contains 605 electron-ionizationmass spectrometry(EI-MS), 137 fast atom bombardment MS and 9276 electrospray ionization (ESI)-MSn data of 2337 authentic compounds of metabolites, 11 545 EI-MS and 834 other-MS data of 10 286 volatile natural and synthetic compounds, and 3045 ESI-MS2 data of 679 synthetic drugs contributed by 16 research groups (January 2010). ESI-MS 2 data were analyzed under nonstandardized, independent experimental conditions. MassBank is a distributed database. Each research group provides data from its own MassBank data servers distributed on the Internet. MassBank users can access either all of the MassBank data or a subset of the data by specifying one or more experimental conditions. In a spectral search to retrieve mass spectra similar to a query mass spectrum, the similarity score is calculated by a weighted cosine correlation in which weighting exponents on peak intensity and the mass-to-charge ratio are optimized to the ESI-MS2 data. MassBank also provides a merged spectrum for each compound prepared bymerging the analyzed ESI-MS2 data on an identical compound under different collision-induced dissociation conditions. Data merging has significantly improved the precision of the identification of a chemical compound by 21-23% at a similarity score of 0.6. Thus, MassBank is useful for the identification of chemical compounds and the publication of experimental data.

AB - MassBank is the first public repository of mass spectra of small chemical compounds for life sciences (<3000 Da). The database contains 605 electron-ionizationmass spectrometry(EI-MS), 137 fast atom bombardment MS and 9276 electrospray ionization (ESI)-MSn data of 2337 authentic compounds of metabolites, 11 545 EI-MS and 834 other-MS data of 10 286 volatile natural and synthetic compounds, and 3045 ESI-MS2 data of 679 synthetic drugs contributed by 16 research groups (January 2010). ESI-MS 2 data were analyzed under nonstandardized, independent experimental conditions. MassBank is a distributed database. Each research group provides data from its own MassBank data servers distributed on the Internet. MassBank users can access either all of the MassBank data or a subset of the data by specifying one or more experimental conditions. In a spectral search to retrieve mass spectra similar to a query mass spectrum, the similarity score is calculated by a weighted cosine correlation in which weighting exponents on peak intensity and the mass-to-charge ratio are optimized to the ESI-MS2 data. MassBank also provides a merged spectrum for each compound prepared bymerging the analyzed ESI-MS2 data on an identical compound under different collision-induced dissociation conditions. Data merging has significantly improved the precision of the identification of a chemical compound by 21-23% at a similarity score of 0.6. Thus, MassBank is useful for the identification of chemical compounds and the publication of experimental data.

KW - Distributed database

KW - MassBank

KW - Metabolite

KW - Public database

KW - Spectral similarity

UR - http://www.scopus.com/inward/record.url?scp=77954439868&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77954439868&partnerID=8YFLogxK

U2 - 10.1002/jms.1777

DO - 10.1002/jms.1777

M3 - Article

VL - 45

SP - 703

EP - 714

JO - Journal of Mass Spectrometry

JF - Journal of Mass Spectrometry

SN - 1076-5174

IS - 7

ER -