A majorization-minimization algorithm with projected gradient updates for time-domain spectrogram factorization

Hideaki Kagami, Hirokazu Kameoka, Masahiro Yukawa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

We previously introduced a framework called time-domain spectrogram factorization (TSF), which realizes nonnegative matrix factorization (NMF)-like source separation in the time domain. This framework is particularly noteworthy in that, while maintaining the ability of NMF to obtain a parts-based representation of magnitude spectra, it allows us to (i) circumvent the commonly made assumption with the NMF approach that the magnitude spectra of source components are additive and (ii) take account of the interdependence of the phase/amplitude components at different time-frequency points. In particular, the second factor has been overlooked despite its potential importance. Our previous study revealed that the conventional TSF algorithm was relatively slow due to large matrix inversions, and the early stopping of the algorithm often resulted in poor separation accuracy. To overcome this problem, this paper presents an iterative TSF solver using projected gradient updates. Simulation results show that the proposed TSF approach yields higher source separation performance than NMF and the other variants including the original TSF.

Original languageEnglish
Title of host publication2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages561-565
Number of pages5
ISBN (Electronic)9781509041176
DOIs
Publication statusPublished - 2017 Jun 16
Event2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - New Orleans, United States
Duration: 2017 Mar 52017 Mar 9

Other

Other2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017
CountryUnited States
CityNew Orleans
Period17/3/517/3/9

Fingerprint

Factorization
Source separation

Keywords

  • Audio source separation
  • non-negative matrix factorization(NMF)
  • projected gradient method

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

Kagami, H., Kameoka, H., & Yukawa, M. (2017). A majorization-minimization algorithm with projected gradient updates for time-domain spectrogram factorization. In 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings (pp. 561-565). [7952218] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2017.7952218

A majorization-minimization algorithm with projected gradient updates for time-domain spectrogram factorization. / Kagami, Hideaki; Kameoka, Hirokazu; Yukawa, Masahiro.

2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2017. p. 561-565 7952218.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kagami, H, Kameoka, H & Yukawa, M 2017, A majorization-minimization algorithm with projected gradient updates for time-domain spectrogram factorization. in 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings., 7952218, Institute of Electrical and Electronics Engineers Inc., pp. 561-565, 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017, New Orleans, United States, 17/3/5. https://doi.org/10.1109/ICASSP.2017.7952218
Kagami H, Kameoka H, Yukawa M. A majorization-minimization algorithm with projected gradient updates for time-domain spectrogram factorization. In 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2017. p. 561-565. 7952218 https://doi.org/10.1109/ICASSP.2017.7952218
Kagami, Hideaki ; Kameoka, Hirokazu ; Yukawa, Masahiro. / A majorization-minimization algorithm with projected gradient updates for time-domain spectrogram factorization. 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 561-565
@inproceedings{d9805e3e6fa24e06ad00d2c4a18490a4,
title = "A majorization-minimization algorithm with projected gradient updates for time-domain spectrogram factorization",
abstract = "We previously introduced a framework called time-domain spectrogram factorization (TSF), which realizes nonnegative matrix factorization (NMF)-like source separation in the time domain. This framework is particularly noteworthy in that, while maintaining the ability of NMF to obtain a parts-based representation of magnitude spectra, it allows us to (i) circumvent the commonly made assumption with the NMF approach that the magnitude spectra of source components are additive and (ii) take account of the interdependence of the phase/amplitude components at different time-frequency points. In particular, the second factor has been overlooked despite its potential importance. Our previous study revealed that the conventional TSF algorithm was relatively slow due to large matrix inversions, and the early stopping of the algorithm often resulted in poor separation accuracy. To overcome this problem, this paper presents an iterative TSF solver using projected gradient updates. Simulation results show that the proposed TSF approach yields higher source separation performance than NMF and the other variants including the original TSF.",
keywords = "Audio source separation, non-negative matrix factorization(NMF), projected gradient method",
author = "Hideaki Kagami and Hirokazu Kameoka and Masahiro Yukawa",
year = "2017",
month = "6",
day = "16",
doi = "10.1109/ICASSP.2017.7952218",
language = "English",
pages = "561--565",
booktitle = "2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - A majorization-minimization algorithm with projected gradient updates for time-domain spectrogram factorization

AU - Kagami, Hideaki

AU - Kameoka, Hirokazu

AU - Yukawa, Masahiro

PY - 2017/6/16

Y1 - 2017/6/16

N2 - We previously introduced a framework called time-domain spectrogram factorization (TSF), which realizes nonnegative matrix factorization (NMF)-like source separation in the time domain. This framework is particularly noteworthy in that, while maintaining the ability of NMF to obtain a parts-based representation of magnitude spectra, it allows us to (i) circumvent the commonly made assumption with the NMF approach that the magnitude spectra of source components are additive and (ii) take account of the interdependence of the phase/amplitude components at different time-frequency points. In particular, the second factor has been overlooked despite its potential importance. Our previous study revealed that the conventional TSF algorithm was relatively slow due to large matrix inversions, and the early stopping of the algorithm often resulted in poor separation accuracy. To overcome this problem, this paper presents an iterative TSF solver using projected gradient updates. Simulation results show that the proposed TSF approach yields higher source separation performance than NMF and the other variants including the original TSF.

AB - We previously introduced a framework called time-domain spectrogram factorization (TSF), which realizes nonnegative matrix factorization (NMF)-like source separation in the time domain. This framework is particularly noteworthy in that, while maintaining the ability of NMF to obtain a parts-based representation of magnitude spectra, it allows us to (i) circumvent the commonly made assumption with the NMF approach that the magnitude spectra of source components are additive and (ii) take account of the interdependence of the phase/amplitude components at different time-frequency points. In particular, the second factor has been overlooked despite its potential importance. Our previous study revealed that the conventional TSF algorithm was relatively slow due to large matrix inversions, and the early stopping of the algorithm often resulted in poor separation accuracy. To overcome this problem, this paper presents an iterative TSF solver using projected gradient updates. Simulation results show that the proposed TSF approach yields higher source separation performance than NMF and the other variants including the original TSF.

KW - Audio source separation

KW - non-negative matrix factorization(NMF)

KW - projected gradient method

UR - http://www.scopus.com/inward/record.url?scp=85023758677&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85023758677&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2017.7952218

DO - 10.1109/ICASSP.2017.7952218

M3 - Conference contribution

AN - SCOPUS:85023758677

SP - 561

EP - 565

BT - 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

ER -