A computational architecture incorporating shallow brain networks: integrating parallel cortical and subcortical processing
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Artificial neural networks commonly have deep hierarchical structures that were originally inspired by the neuroanatomical evidence of cortico-cortical connectivity pattern found in the mammalian brain. Largely neglected in those models are non-hierarchical aspects of brain architecture, namely the subcortical pathways and the interactions between cortical and subcortical areas regardless of their hierarchical locations. Inspired by this principle, we present a computational model combining cortical hierarchical processing with subcortical pathways based on neuroanatomical evidence. We show the versatility of our model by implementing the cortical hierarchy in two alternative ways—a convolutional feedforward network and a predictive coding network. Both model variants can replicate behavioral observations in humans and monkeys on a perceptual context-dependent decision-making task. The model also reveals that subcortical structures lead decisions for easy trials while the more complex hierarchical network is necessary for the harder trials. Our results suggest that the parallel cortico-subcortical processing explored in the model represents a fundamental property that cannot be neglected in understanding the computational principles used by the brain.
Significance
Artificial intelligence and computational neuroscience models, particularly deep learning and predictive coding architectures, have been traditionally dominated by cortico-centric hierarchical frameworks. However, extensive neurobiological evidence suggests that cortical areas, regardless of their hierarchical classification, are deeply interconnected with subcortical structures in real brains. We propose here a computational framework demonstrating that parallel cortical-subcortical architectures can yield complex and more flexible computational capabilities, aligning with the observed behavior in the mammalian brain. Our model addresses key limitations in existing deep learning and predictive coding networks, offering a more biologically plausible and functionally advantageous alternative.