Skip to main content

Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency

  • Conference paper
Latent Variable Analysis and Signal Separation (LVA/ICA 2010)

Abstract

Wiener filtering is one of the most widely used methods in audio source separation. It is often applied on time-frequency representations of signals, such as the short-time Fourier transform (STFT), to exploit their short-term stationarity, but so far the design of the Wiener time-frequency mask did not take into account the necessity for the output spectrograms to be consistent, i.e., to correspond to the STFT of a time-domain signal. In this paper, we generalize the concept of Wiener filtering to time-frequency masks which can involve manipulation of the phase as well by formulating the problem as a consistency-constrained Maximum-Likelihood one. We present two methods to solve the problem, one looking for the optimal time-domain signal, the other promoting consistency through a penalty function directly in the time-frequency domain. We show through experimental evaluation that, both in oracle conditions and combined with spectral subtraction, our method outperforms classical Wiener filtering.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Diethorn, E.J.: Subband noise reduction methods for speech enhancement. In: Huang, Y., Benesty, J. (eds.) Audio Signal Processing for Next-Generation Multimedia Communication Systems, pp. 91–115. Kluwer, Dordrecht (2004)

    Chapter  Google Scholar 

  2. Vincent, E., Jafari, M.G., Abdallah, S.A., Plumbley, M.D., Davies, M.E.: Probabilistic modeling paradigms for audio source separation. In: Machine Audition: Principles, Algorithms and Systems. IGI Global (to appear)

    Google Scholar 

  3. Le Roux, J., Ono, N., Sagayama, S.: Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction. In: Proc. SAPA, pp. 23–28 (September 2008)

    Google Scholar 

  4. Le Roux, J., Kameoka, H., Vincent, E., Ono, N., Kashino, K., Sagayama, S.: Complex NMF under spectrogram consistency constraints. In: Proc. ASJ Autumn Meeting, (2-4-5) (September 2009)

    Google Scholar 

  5. Boll, S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. ASSP 27, 113–120 (1979)

    Article  Google Scholar 

  6. Griffin, D.W., Lim, J.S.: Signal estimation from modified short-time Fourier transform. IEEE Trans. ASSP 32(2), 236–243 (1984)

    Article  Google Scholar 

  7. Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Proc. NIPS*2000, pp. 556–562. The MIT Press, Cambridge (2001)

    Google Scholar 

  8. Vincent, E., Gribonval, R., Plumbley, M.D.: BSS Oracle Toolbox Version 2.1, http://bass-db.gforge.inria.fr/bssoracle/

  9. Vincent, E., Sawada, H., Bofill, P., Makino, S., Rosca, J.P.: First stereo audio source separation evaluation campaign: Data, algorithms and results. In: Proc. ICA, pp. 552–559 (September 2007)

    Google Scholar 

  10. Araki, S., Ozerov, A., Gowreesunker, V., Sawada, H., Theis, F., Nolte, G., Lutter, D., Duong, N.Q.: The 2010 signal separation evaluation campaign (SiSEC 2010) –Part II–: Audio source separation challenges. In: Proc. LVA/ICA (2010)

    Google Scholar 

  11. Yu, G., Mallat, S., Bacry, E.: Audio denoising by time-frequency block thresholding. IEEE Trans. Signal Process. 56(5), 1830–1839 (2008)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Le Roux, J., Vincent, E., Mizuno, Y., Kameoka, H., Ono, N., Sagayama, S. (2010). Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency. In: Vigneron, V., Zarzoso, V., Moreau, E., Gribonval, R., Vincent, E. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2010. Lecture Notes in Computer Science, vol 6365. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15995-4_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15995-4_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15994-7

  • Online ISBN: 978-3-642-15995-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics