|
|
||||||||
Review |
haykin{at}mcmaster.ca, Adaptive Systems Lab, McMaster University, Hamilton, Ontario, Canada L8S 4K1
zhechen{at}soma.crl.mcmaster.ca, Adaptive Systems Lab, McMaster University, Hamilton, Ontario, Canada L8S 4K1
This review presents an overview of a challenging problem in auditory perception, the cocktail party phenomenon, the delineation of which goes back to a classic paper by Cherry in 1953. In this review, we address the following issues: (1) human auditory scene analysis, which is a general process carried out by the auditory system of a human listener; (2) insight into auditory perception, which is derived from Marr's vision theory; (3) computational auditory scene analysis, which focuses on specific approaches aimed at solving the machine cocktail party problem; (4) active audition, the proposal for which is motivated by analogy with active vision, and (5) discussion of brain theory and independent component analysis, on the one hand, and correlative neural firing, on the other.
One of our most important faculties is our ability to listen to, and follow, one speaker in the presence of others. This is such a common experience that we may take it for granted; we may call it "the cocktail party problem." No machine has been constructed to do just this, to filter out one conversation from a number jumbled together.Colin Cherry, 1957.
This article has been cited by other articles:
![]() |
W. H. Mehaffey, F. R. Fernandez, L. Maler, and R. W. Turner Regulation of Burst Dynamics Improves Differential Encoding of Stimulus Frequency by Spike Train Segregation J Neurophysiol, August 1, 2007; 98(2): 939 - 951. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| J COGNITIVE NEUROSCIENCE | NEURAL COMPUTATION | MIT PRESS JOURNALS |