MICOP: Maximal information coefficient-based oscillation prediction to detect biological rhythms in proteomics data

Hitoshi Iuchi, Masahiro Sugimoto, Masaru Tomita

Research output: Contribution to journalArticle

Abstract

Background: Circadian rhythms comprise oscillating molecular interactions, the disruption of the homeostasis of which would cause various disorders. To understand this phenomenon systematically, an accurate technique to identify oscillating molecules among omics datasets must be developed; however, this is still impeded by many difficulties, such as experimental noise and attenuated amplitude. Results: To address these issues, we developed a new algorithm named Maximal Information Coefficient-based Oscillation Prediction (MICOP), a sine curve-matching method. The performance of MICOP in labeling oscillation or non-oscillation was compared with four reported methods using Mathews correlation coefficient (MCC) values. The numerical experiments were performed with time-series data with (1) mimicking of molecular oscillation decay, (2) high noise and low sampling frequency and (3) one-cycle data. The first experiment revealed that MICOP could accurately identify the rhythmicity of decaying molecular oscillation (MCC > 0.7). The second experiment revealed that MICOP was robust against high-level noise (MCC > 0.8) even upon the use of low-sampling-frequency data. The third experiment revealed that MICOP could accurately identify the rhythmicity of noisy one-cycle data (MCC > 0.8). As an application, we utilized MICOP to analyze time-series proteome data of mouse liver. MICOP identified that novel oscillating candidates numbered 14 and 30 for C57BL/6 and C57BL/6 J, respectively. Conclusions: In this paper, we presented MICOP, which is an MIC-based algorithm, for predicting periodic patterns in large-scale time-resolved protein expression profiles. The performance test using artificially generated simulation data revealed that the performance of MICOP for decaying data was superior to that of the existing widely used methods. It can reveal novel findings from time-series data and may contribute to biologically significant results. This study suggests that MICOP is an ideal approach for detecting and characterizing oscillations in time-resolved omics data sets.

Original languageEnglish
Article number249
JournalBMC Bioinformatics
Volume19
Issue number1
DOIs
Publication statusPublished - 2018 Jun 28

Fingerprint

Proteomics
Periodicity
Oscillation
Prediction
Coefficient
Noise
Correlation coefficient
Time series
Time Series Data
Proteome
Circadian Rhythm
Homeostasis
Experiments
Sampling
Proteins
Molecular interactions
Liver
Experiment
Cycle
Labeling

Keywords

  • Circadian rhythm
  • Mutual information
  • Proteomics

ASJC Scopus subject areas

  • Structural Biology
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Applied Mathematics

Cite this

MICOP : Maximal information coefficient-based oscillation prediction to detect biological rhythms in proteomics data. / Iuchi, Hitoshi; Sugimoto, Masahiro; Tomita, Masaru.

In: BMC Bioinformatics, Vol. 19, No. 1, 249, 28.06.2018.

Research output: Contribution to journalArticle

@article{b2d04c4174124195aec7b1560ee3d681,
title = "MICOP: Maximal information coefficient-based oscillation prediction to detect biological rhythms in proteomics data",
abstract = "Background: Circadian rhythms comprise oscillating molecular interactions, the disruption of the homeostasis of which would cause various disorders. To understand this phenomenon systematically, an accurate technique to identify oscillating molecules among omics datasets must be developed; however, this is still impeded by many difficulties, such as experimental noise and attenuated amplitude. Results: To address these issues, we developed a new algorithm named Maximal Information Coefficient-based Oscillation Prediction (MICOP), a sine curve-matching method. The performance of MICOP in labeling oscillation or non-oscillation was compared with four reported methods using Mathews correlation coefficient (MCC) values. The numerical experiments were performed with time-series data with (1) mimicking of molecular oscillation decay, (2) high noise and low sampling frequency and (3) one-cycle data. The first experiment revealed that MICOP could accurately identify the rhythmicity of decaying molecular oscillation (MCC > 0.7). The second experiment revealed that MICOP was robust against high-level noise (MCC > 0.8) even upon the use of low-sampling-frequency data. The third experiment revealed that MICOP could accurately identify the rhythmicity of noisy one-cycle data (MCC > 0.8). As an application, we utilized MICOP to analyze time-series proteome data of mouse liver. MICOP identified that novel oscillating candidates numbered 14 and 30 for C57BL/6 and C57BL/6 J, respectively. Conclusions: In this paper, we presented MICOP, which is an MIC-based algorithm, for predicting periodic patterns in large-scale time-resolved protein expression profiles. The performance test using artificially generated simulation data revealed that the performance of MICOP for decaying data was superior to that of the existing widely used methods. It can reveal novel findings from time-series data and may contribute to biologically significant results. This study suggests that MICOP is an ideal approach for detecting and characterizing oscillations in time-resolved omics data sets.",
keywords = "Circadian rhythm, Mutual information, Proteomics",
author = "Hitoshi Iuchi and Masahiro Sugimoto and Masaru Tomita",
year = "2018",
month = "6",
day = "28",
doi = "10.1186/s12859-018-2257-4",
language = "English",
volume = "19",
journal = "BMC Bioinformatics",
issn = "1471-2105",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - MICOP

T2 - Maximal information coefficient-based oscillation prediction to detect biological rhythms in proteomics data

AU - Iuchi, Hitoshi

AU - Sugimoto, Masahiro

AU - Tomita, Masaru

PY - 2018/6/28

Y1 - 2018/6/28

N2 - Background: Circadian rhythms comprise oscillating molecular interactions, the disruption of the homeostasis of which would cause various disorders. To understand this phenomenon systematically, an accurate technique to identify oscillating molecules among omics datasets must be developed; however, this is still impeded by many difficulties, such as experimental noise and attenuated amplitude. Results: To address these issues, we developed a new algorithm named Maximal Information Coefficient-based Oscillation Prediction (MICOP), a sine curve-matching method. The performance of MICOP in labeling oscillation or non-oscillation was compared with four reported methods using Mathews correlation coefficient (MCC) values. The numerical experiments were performed with time-series data with (1) mimicking of molecular oscillation decay, (2) high noise and low sampling frequency and (3) one-cycle data. The first experiment revealed that MICOP could accurately identify the rhythmicity of decaying molecular oscillation (MCC > 0.7). The second experiment revealed that MICOP was robust against high-level noise (MCC > 0.8) even upon the use of low-sampling-frequency data. The third experiment revealed that MICOP could accurately identify the rhythmicity of noisy one-cycle data (MCC > 0.8). As an application, we utilized MICOP to analyze time-series proteome data of mouse liver. MICOP identified that novel oscillating candidates numbered 14 and 30 for C57BL/6 and C57BL/6 J, respectively. Conclusions: In this paper, we presented MICOP, which is an MIC-based algorithm, for predicting periodic patterns in large-scale time-resolved protein expression profiles. The performance test using artificially generated simulation data revealed that the performance of MICOP for decaying data was superior to that of the existing widely used methods. It can reveal novel findings from time-series data and may contribute to biologically significant results. This study suggests that MICOP is an ideal approach for detecting and characterizing oscillations in time-resolved omics data sets.

AB - Background: Circadian rhythms comprise oscillating molecular interactions, the disruption of the homeostasis of which would cause various disorders. To understand this phenomenon systematically, an accurate technique to identify oscillating molecules among omics datasets must be developed; however, this is still impeded by many difficulties, such as experimental noise and attenuated amplitude. Results: To address these issues, we developed a new algorithm named Maximal Information Coefficient-based Oscillation Prediction (MICOP), a sine curve-matching method. The performance of MICOP in labeling oscillation or non-oscillation was compared with four reported methods using Mathews correlation coefficient (MCC) values. The numerical experiments were performed with time-series data with (1) mimicking of molecular oscillation decay, (2) high noise and low sampling frequency and (3) one-cycle data. The first experiment revealed that MICOP could accurately identify the rhythmicity of decaying molecular oscillation (MCC > 0.7). The second experiment revealed that MICOP was robust against high-level noise (MCC > 0.8) even upon the use of low-sampling-frequency data. The third experiment revealed that MICOP could accurately identify the rhythmicity of noisy one-cycle data (MCC > 0.8). As an application, we utilized MICOP to analyze time-series proteome data of mouse liver. MICOP identified that novel oscillating candidates numbered 14 and 30 for C57BL/6 and C57BL/6 J, respectively. Conclusions: In this paper, we presented MICOP, which is an MIC-based algorithm, for predicting periodic patterns in large-scale time-resolved protein expression profiles. The performance test using artificially generated simulation data revealed that the performance of MICOP for decaying data was superior to that of the existing widely used methods. It can reveal novel findings from time-series data and may contribute to biologically significant results. This study suggests that MICOP is an ideal approach for detecting and characterizing oscillations in time-resolved omics data sets.

KW - Circadian rhythm

KW - Mutual information

KW - Proteomics

UR - http://www.scopus.com/inward/record.url?scp=85049171590&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85049171590&partnerID=8YFLogxK

U2 - 10.1186/s12859-018-2257-4

DO - 10.1186/s12859-018-2257-4

M3 - Article

C2 - 29954316

AN - SCOPUS:85049171590

VL - 19

JO - BMC Bioinformatics

JF - BMC Bioinformatics

SN - 1471-2105

IS - 1

M1 - 249

ER -