Conclusions
This report highlighted some of the intricate relationships between the statistics of natural sounds and auditory processing. We have argued that many of the common steps that we often take to perform computational audition can be seen as processes driven by the nature of sound, and not so as steps inspired by human physiology or engineering. We have shown how different aspects of hearing can be explained using a simple and common rule exploiting the statistical structure of sound. Although the methods we employed are very simple, the results are just as promising as using any other more complex approach. We hope that the simplicity and the elegance of this approach will inspire further work along these lines, and give rise to more investigations in the field of computationally evolving audition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amari, S-I., Cichocki, A., and Yang, H.H., 1996, A new learning algorithm for blind signal separation, Neural Information Processing Systems, 1996.
Ahmed, N., Natarajan, T., and Rao, K.R., 1974, Discrete Cosine Transform, IEEE Transactions on Computers.
Barlow, H.B., 1961, Possible principles underlying the transformation of sensory messages. In Sensory Communication, MIT Press.
Barlow, H.B., 1989, Unsupervised learning, in Neural Computation 1, MIT Press, Cambridge MA.
Cardoso, J.-F., 1990, Eigen-structure of the fourth-order cumulant tensor with application to the blind source separation problem. Proc. ICASSP’90, pages 2655–2658
Cichocki, A. and Georgiev, P., 2003, Blind source separation algorithms with matrix constraints, IEICE Trans. Fundamentals, Vol. E86-A
Deco, G. and Obradovic, D., 1996, An information theoretic approach to neural computing. Springer-Verlag, New York, New York.
Haykin, S.S., 1994, Neural Networks: A Comprehensive Foundation, Macmillan.
Hyvärinen, A., 1999, Survey on Independent Component Analysis, Neural Computer Surveys, Vol. 2, pp. 94–128.
Jolliffe, I.T., 1986, Principal Component Analysis. Springer-Verlag, New York, New York.
Olshausen, B.A., and Field, D.J., 1996, Emergence of simple-cell receptive field properties by learning a sparse code for natural images, Nature, 381.
Plumbley, M.D., 2002, Conditions for non-negative independent component analysis, IEEE Signal Processing Letters, 9(6), pp177–180.
Rao, K. and Yip, P., 1990, Discrete Cosine Transform, Algorithms, Advantages, Applications. Academic Press.
Roweis, S., 1998, EM Algorithms for PCA and SPCA, Neural Information Processing Systems 10.
Smaragdis, P., 2001, Redundancy Reduction for Computational Audition: A Unifying Approach. MIT Doctoral dissertation.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science + Business Media, Inc.
About this chapter
Cite this chapter
Smaragdis, P. (2005). Exploiting Redundancy to Construct Listening Systems. In: Divenyi, P. (eds) Speech Separation by Humans and Machines. Springer, Boston, MA. https://doi.org/10.1007/0-387-22794-6_7
Download citation
DOI: https://doi.org/10.1007/0-387-22794-6_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-8001-2
Online ISBN: 978-0-387-22794-8
eBook Packages: EngineeringEngineering (R0)