skip to main content

Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks

Rajalingham, Rishi ; Issa, Elias B ; Bashivan, Pouya ; Kar, Kohitij ; Schmidt, Kailyn ; DiCarlo, James J

The Journal of neuroscience, 2018-08, Vol.38 (33), p.7255-7269 [Periódico revisado por pares]

United States: Society for Neuroscience

Texto completo disponível

Citações Citado por
  • Título:
    Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks
  • Autor: Rajalingham, Rishi ; Issa, Elias B ; Bashivan, Pouya ; Kar, Kohitij ; Schmidt, Kailyn ; DiCarlo, James J
  • Assuntos: Animals ; Artificial neural networks ; Behavior ; Benchmarks ; Data collection ; Discrimination, Psychological - physiology ; High resolution ; Human behavior ; Humans ; Macaca mulatta - physiology ; Male ; Mathematical models ; Models, Neurological ; Monkeys ; Monkeys & apes ; Nervous system ; Neural networks ; Neural Networks, Computer ; Object recognition ; Pattern recognition ; Pattern Recognition, Visual - physiology ; Performance prediction ; Primates ; Psychophysics ; Recognition, Psychology - physiology ; Species Specificity ; State of the art ; Vision ; Visual discrimination ; Visual observation
  • É parte de: The Journal of neuroscience, 2018-08, Vol.38 (33), p.7255-7269
  • Notas: ObjectType-Article-1
    SourceType-Scholarly Journals-1
    ObjectType-Feature-2
    content type line 23
    Author contributions: R.R., E.B.I., and J.J.D. designed research; R.R., E.B.I., K.K., and K.S. performed research; R.R. and P.B. analyzed data; R.R., E.B.I., and J.J.D. wrote the paper.
    E. Issa's present address: Department of Neuroscience, Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027.
    R.R. and E.B.I. contributed equally to this work.
  • Descrição: Primates, including humans, can typically recognize objects in visual images at a glance despite naturally occurring identity-preserving image transformations (e.g., changes in viewpoint). A primary neuroscience goal is to uncover neuron-level mechanistic models that quantitatively explain this behavior by predicting primate performance for each and every image. Here, we applied this stringent behavioral prediction test to the leading mechanistic models of primate vision (specifically, deep, convolutional, artificial neural networks; ANNs) by directly comparing their behavioral signatures against those of humans and rhesus macaque monkeys. Using high-throughput data collection systems for human and monkey psychophysics, we collected more than one million behavioral trials from 1472 anonymous humans and five male macaque monkeys for 2400 images over 276 binary object discrimination tasks. Consistent with previous work, we observed that state-of-the-art deep, feedforward convolutional ANNs trained for visual categorization (termed DCNN models) accurately predicted primate patterns of object-level confusion. However, when we examined behavioral performance for individual images within each object discrimination task, we found that all tested DCNN models were significantly nonpredictive of primate performance and that this prediction failure was not accounted for by simple image attributes nor rescued by simple model modifications. These results show that current DCNN models cannot account for the image-level behavioral patterns of primates and that new ANN models are needed to more precisely capture the neural mechanisms underlying primate object vision. To this end, large-scale, high-resolution primate behavioral benchmarks such as those obtained here could serve as direct guides for discovering such models. Recently, specific feedforward deep convolutional artificial neural networks (ANNs) models have dramatically advanced our quantitative understanding of the neural mechanisms underlying primate core object recognition. In this work, we tested the limits of those ANNs by systematically comparing the behavioral responses of these models with the behavioral responses of humans and monkeys at the resolution of individual images. Using these high-resolution metrics, we found that all tested ANN models significantly diverged from primate behavior. Going forward, these high-resolution, large-scale primate behavioral benchmarks could serve as direct guides for discovering better ANN models of the primate visual system.
  • Editor: United States: Society for Neuroscience
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.