Skip to main content

Crystal-MUSIC: Accurate Localization of Multiple Sources in Diffuse Noise Environments Using Crystal-Shaped Microphone Arrays

  • Conference paper
Latent Variable Analysis and Signal Separation (LVA/ICA 2010)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6365))

Abstract

This paper presents crystal-MUSIC, a method for DOA estimation of multiple sources in the presence of diffuse noise. MUSIC is well known as a method for the estimation of the DOAs of multiple sources but is not very robust to diffuse noise from many directions, because the covariance structure of such noise is not spherical. Our method makes it possible for MUSIC to accurately estimate the DOAs by removing the contribution of diffuse noise from the spatial covariance matrix. This denoising is performed in two steps: 1) denoising of the off-diagonal entries via a blind noise decorrelation using crystal-shaped arrays, and 2) denoising of the diagonal entries through a low-rank matrix completion technique. The denoising process does not require the spatial covariance matrix of diffuse noise to be known, but relies only on an isotropy feature of diffuse noise. Experimental results with real-world noise show that the DOA estimation accuracy is substantially improved compared to the conventional MUSIC.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Knapp, C.H., Carter, G.C.: The generalized correlation method for estimation of time delay. IEEE Trans. Acoust., Speech, Signal Process. (4), 320–327 (August 1976)

    Google Scholar 

  2. Schmidt, R.O.: Multiple emitter location and signal parameter estimation. IEEE Trans. Antennas Propag., 276–280 (March 1986)

    Google Scholar 

  3. Wax, M., Shan, T.-J., Kailath, T.: Spatio-temporal spectral analysis by eignestructure methods. IEEE Trans. Acoust., Speech, Signal Process., 817–827 (August 1984)

    Google Scholar 

  4. Pham, T., Sadler, B.M.: Adaptive wideband aeroacoustic array processing. In: Proceedings of the 8th IEEE Signal Processing Workshop on Stastical Signal and Array Processing, Corfu, Greece, pp. 295–298 (June 1996)

    Google Scholar 

  5. Shimizu, H., Ono, N., Matsumoto, K., Sagayama, S.: Isotropic noise suppression in the power spectrum domain by symmetric microphone arrays. In: Proc. WASPAA, New Paltz, NY, pp. 54–57 (October 2007)

    Google Scholar 

  6. Ono, N., Ito, N., Sagayama, S.: Five classes of crystal arrays for blind decorrelation of diffuse noise. In: Proc. SAM, Darmstadt, Germany, pp. 151–154 (July 2008)

    Google Scholar 

  7. Bitzer, J., Simmer, K.U.: Superdirective microphone arrays. In: Brandstein, M., Ward, D. (eds.) Microphone Arrays: Signal Processing Techniques and Applications, ch. 2, pp. 19–38. Springer, Berlin (2001)

    Google Scholar 

  8. Srebro, N., Jaakkola, T.: Weighted low-rank approximations. In: 20th International Conference on Machine Learning, pp. 720–727. AAAI Press, Menlo Park (2003)

    Google Scholar 

  9. Candès, E.J., Recht, B.: Exact matrix completion via convex optimization. The Journal of the Society for the Foundations of Computational Mathematics (9), 717–772 (April 2009)

    Google Scholar 

  10. Ji, S., Ye, J.: An accelerated gradient method for trace norm minimization. In: Proceedings of the 26th International Conference on Machine Learning, Montreal, Canada, pp. 457–464 (2009)

    Google Scholar 

  11. Ito, N., Ono, N., Vincent, E., Sagayama, S.: Designing the Wiener post-filter for diffuse noise suppression using imaginary parts of inter-channel cross-spectra. In: Proc. ICASSP 2010, Dallas, USA (March 2010)

    Google Scholar 

  12. Kurematsu, A., Takeda, K., Sagisaka, Y., Katagiri, S., Kuwabara, H., Shikano, K.: ATR Japanese speech database as a tool of speech recognition and synthesis. Speech Communication 9(4), 357–363 (1990)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ito, N., Vincent, E., Ono, N., Gribonval, R., Sagayama, S. (2010). Crystal-MUSIC: Accurate Localization of Multiple Sources in Diffuse Noise Environments Using Crystal-Shaped Microphone Arrays. In: Vigneron, V., Zarzoso, V., Moreau, E., Gribonval, R., Vincent, E. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2010. Lecture Notes in Computer Science, vol 6365. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15995-4_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15995-4_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15994-7

  • Online ISBN: 978-3-642-15995-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics