One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis

Edgar Hemery 1 Jean-Julien Aucouturier 2, *
* Auteur correspondant
2 Equipe Perception et design sonores
STMS - Sciences et Technologies de la Musique et du Son
Abstract : The mammalian auditory system extracts features from the acoustic environment based on the responses of spatially distributed sets of neurons in the subcortical and cortical auditory structures. The characteristic responses of these neurons (linearly approximated by their spectro-temporal receptive fields, or STRFs) suggest that auditory representations are formed, as early as in the inferior colliculi, on the basis of a time, frequency, rate (temporal modulations) and scale (spectral modulations) analysis of sound. However, how these four dimensions are integrated and processed in subsequent neural networks remains unclear. In this work, we present a new methodology to generate computational insights into the functional organization of such processes. We first propose a systematic framework to explore more than a hundred different computational strategies proposed in the literature to process the output of a generic STRF model. We then evaluate these strategies on their ability to compute perceptual distances between pairs of environmental sounds. Finally, we conduct a meta-analysis of the dataset of all these algorithms' accuracies to examine whether certain combinations of dimensions and certain ways to treat such dimensions are, on the whole, more computationally effective than others. We present an application of this methodology to a dataset of ten environmental sound categories, in which the analysis reveals that (1) models are most effective when they organize STRF data into frequency groupings—which is consistent with the known tonotopic organization of receptive fields in auditory structures-, and that (2) models that treat STRF data as time series are no more effective than models that rely only on summary statistics along time—which corroborates recent experimental evidence on texture discrimination by summary statistics.
Type de document :
Article dans une revue
Frontiers in Computational Neuroscience, Frontiers, 2015, 9 (Article 80), 18 p. 〈10.3389/fncom.2015.00080〉
Liste complète des métadonnées

Littérature citée [42 références]  Voir  Masquer  Télécharger

https://hal-mines-paristech.archives-ouvertes.fr/hal-01186328
Contributeur : Bibliothèque Mines Paristech <>
Soumis le : lundi 24 août 2015 - 16:04:14
Dernière modification le : vendredi 30 novembre 2018 - 01:27:37
Document(s) archivé(s) le : mercredi 25 novembre 2015 - 18:26:34

Fichier

fncom-09-00080.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Citation

Edgar Hemery, Jean-Julien Aucouturier. One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis. Frontiers in Computational Neuroscience, Frontiers, 2015, 9 (Article 80), 18 p. 〈10.3389/fncom.2015.00080〉. 〈hal-01186328〉

Partager

Métriques

Consultations de la notice

464

Téléchargements de fichiers

154