Cracking the neural code for word recognition in convolutional neural networks.

Abstract

Learning to read places a strong challenge on the visual system. Years of expertise lead to a remarkable capacity to separate similar letters and encode their relative positions, thus distinguishing words such as FORM and FROM, invariantly over a large range of positions, sizes and fonts. How neural circuits achieve invariant word recognition remains unknown. Here, we address this issue by recycling deep neural network models initially trained for image recognition. We retrain them to recognize written words and then analyze how reading-specialized units emerge and operate across the successive layers. With literacy, a small subset of units becomes specialized for word recognition in the learned script, similar to the visual word form area (VWFA) in the human brain. We show that these units are sensitive to specific letter identities and their ordinal position from the left or the right of a word. The transition from retinotopic to ordinal position coding is achieved by a hierarchy of “space bigram” unit that detect the position of a letter relative to a blank space and that pool across low- and high-frequency-sensitive units from early layers of the network. The proposed scheme provides a plausible neural code for written words in the VWFA, and leads to predictions for reading behavior, error patterns, and the neurophysiology of reading.Copyright: © 2024 Agrawal, Dehaene. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Show Full Text

Cracking the neural code for word recognition in convolutional neural networks.

Researchers

Journal

Modalities

Models

Abstract

Does your dermatology classifier know what it doesn’t know? Detecting the long-tail of unseen conditions.

Fast Pure Shift NMR Spectroscopy Using Attention-Assisted Deep Neural Network.

Application of Deep Learning to Predict Standardized Uptake Value Ratio and Amyloid Status on F-Florbetapir PET Using ADNI Data.

Robust vascular segmentation for raw complex images of laser speckle contrast based on weakly supervised learning.

Deep learning-based ultrasonographic classification of canine chronic kidney disease.

Two-Stream Deep Hashing With Class-Specific Centers for Supervised Image Search.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply