paraafPeter A. van der Helm Demo link

About Teaching Research Publications Presentations



Cognitive architecture
or
the neural signature of transparallel processing



The term cognitive architecture refers to computational models of not only resulting behavior but also structural properties of intelligent systems. These structural properties can be physical properties as well as more abstract properties implemented in physical systems such as computers and brains. There is no consensus about what these structural properties should be, and indeed, many different cognitive architectures have been proposed. These models differ, for instance, in whether they involve fixed or flexible architectures, in what forms of processing they allow, and in the extent to which they are based on a set of symbolic information-processing rules applied by one central processor or rely on emergent properties of many interacting processing units. Most models agree, however, that a cognitive architecture is a parameter-free blueprint for a system that acts like the human cognitive system as a whole.

Cognitive architectures differ from cognitive models and expert systems which usually (a) focus on particular competences such as language, concept learning, or problem solving, and (b) are judged solely by their resulting behavior. As indicated above, cognitive architectures are judged by their structural properties as well. Below, to address the question of what the structural properties of the human cognitive architecture in the brain might be, the focus is first on the visual hierarchy in the brain, and then on the intriguing phenomenon of neuronal synchronization which, in the visual hierarchy, seems to underlie perceptual organization.


The visual hierarchy

The top end of the visual hierarchy seems to involve a smooth transition into higher cognitive structures, while the bottom end can be said to be in the primary visual area V1 in the occipital lobe, which receives its main input from the lateral geniculate nucleus (LGN) (see next left-hand figure). In the LGN, a distinction can be made between retinal input entering the parvocellular pathway and retinal input entering the magnocellular pathway. Via V1 and higher visual areas, these pathways bifurcate into a ventral and a dorsal stream which seem to be dedicated to object perception and spatial perception, respectively (see next right-hand figure).


Visual pathways   What-where paths
     
Retinal signals go, via the optic chiasm (OC) and the lateral geniculate nucleus (LGN), to the visual cortex; the OC arranges that the left-hand visual fields of both eyes are projected onto the right-hand cortex, and vice versa; in the LGN, retinal signals enter parvocellular and magnocellular paths, which perform something like a spatial frequency analysis.   In the visual cortex, the signals bifurcate into ventral and dorsal streams which are dedicated to object perception and spatial perception, respectively.


The neural network in the visual hierarchy is organized with 10-14 distinguishable hierarchical levels (with multiple distinguishable areas within each level), contains many short-range and long-range connections (both within and between levels), and it can be said to perform distributed hierarchical processing. Furthermore, in the visual hierarchy, the intertwined but functionally distinguishable subprocesses of feature encoding, feature binding, and feature selection seem to be mediated by feedforward (or ascending), horizontal (or lateral), and recurrent (or feedback, or reentrant, or descending) connections, respectively (see next left-hand figure). As indicated in next right-hand figure, these three subprocesses have also been implemented in SIT's transparallel processing model of perceptual organization (see also Smart processing and Hyperstrings).


Subprocesses in the visual hierarchy
 
(a) The three intertwined but functionally distinguishable subprocesses which, in neuroscience, are believed to take place in the visual hierarchy in the brain. (b) The three corresponding and also intertwined methods implemented in SIT's transparallel processing model of perceptual organization.


The three subprocesses are physically intertwined but, functionally, they can be characterized separately as follows:
There is controversy about the question of whether the selection of different features and their integration into complete percepts is controlled by endogenous, attention-driven, recurrent processing starting from beyond the visual hierarchy or by exogenous, stimulus-driven, recurrent processing within the visual hierarchy. This controversy seems moot, however. After all, the combination of feedforward and recurrent processing in the visual hierarchy might be analogous to the cascade formed by a fountain under increasing water pressure. That is, as the feedforward sweep progresses along ascending connections, each passed level in the visual hierarchy forms the starting point of integrative recurrent processing along descending connections. This yields a gradual buildup from partial percepts at lower levels in the hierarchy to complete percepts near its top end. This implies, on the one hand, that top-down attentional processes may intrude before a percept has completed, but on the other hand, that the perceptual organization process has already done much of its integrative work by then. To paraphrase Neisser (1967), before you can pick an apple from a tree, you first have to perceptually organize the scene to at least some degree.

Furthermore, in between feedforward and recurrent processing, there is the horizontal binding of similar features. This subprocess is a relatively underexposed topic in neuroscience, but it may well be the neuronal counterpart of the regularity extraction operations which, in representational approaches like SIT, are proposed to lead to structured mental representations. This subprocess seems to involve transient (i.e., input-dependent) neural assemblies which also have been implicated in the phenomenon of neuronal synchronization.


Neuronal synchronization

Neuronal synchronization is the phenomenon that neurons, in transient assemblies, temporarily synchronize their activity. Not to be confused with neuroplasticity which involves changes in connectivity, such assemblies are thought to arise when neurons shift their allegiance to different groups by altering connection strengths, which may also imply a shift in the specificity and function of neurons. Both theoretically and empirically, neuronal synchronization has been associated with a broad range of cognitive processes. Gamma-band synchronization (30-70 Hz), in particular, has been associated with visual processes such as those dealing with change detection, interocular rivalry, feature binding, Gestalt formation, and form discrimination.

The dynamics of neuronal synchronization (mostly in visual processing) are being studied by way of methods from dynamic systems theory. Furthermore, proposed ideas about the meaning of neuronal synchronization (also mostly in visual processing) are, for instance:
Indeed, neuronal synchronization may reflect a flexible and efficient mechanism subserving the representation of information, the regulation of the flow of information, and the storage and retrieval of information. All these ideas, however, are about cognitive factors associated with synchronization rather than about the nature of the underlying cognitive process itself. Therefore, instead of saying that synchronization mediates cognitive processes, it seems better to say that it is a manifestation of cognitive processing -- just as the bubbles in boiling water are a manifestation of the boiling process.

The foregoing does not make neuronal synchronization less interesting -- on the contrary, it raises the question of what form of processing it might be a manifestation. That is, it does not seem to reflect a simple form of parallel (distributed) processing. After all, basically, parallel processing is performed by different agents who simultaneously do different things. When these agents simultaneously do the same thing, however, they seem to enter another processing mode -- think of flash mobs or groups of singers going from cacophony to harmony. Indeed, assuming that neuronal synchronization underlies perceptual organization, which is characterized by a high combinatorial capacity and high speed, it must be a special form of processing that manifests itself by neuronal synchronization.

Notice in this respect that:
These two things together suggest that gamma-band synchronization in the visual hierarchy in the brain might be a manifestation of transparallel processing of similar features gathered in hyperstring-like transient neural assemblies. As indicated next, this gives rise to a picture of flexible cognitive architecture implemented in the relatively rigid neural architecture of the brain.


From neurons to gnosons: a pluralist approach

The foregoing indicates that SIT's model of perceptual organization is neurally plausible in that it implements those three intertwined but functionally distinguishable subprocesses in the visual hierarchy. Furthermore, it suggests that the mechanism of transparallel processing by hyperstrings provides a computational explanation of synchronization in transient neural assemblies.

In SIT's model, the combination of feedforward feature encoding and horizontal feature binding yields an input-dependent tree of hyperstrings, that is, a hierarchical distributed representation which represents the output space for only the input at hand. This contrasts with standard connectionist modeling, which starts from a pre-fixed network that represents the output space for all possible inputs. The intertwined subprocess of feature selection and integration in SIT's model, however, is comparable to the method of selection by activation-spreading in standard connectionist modeling (see Inanimate PDP in Slimy, Hilly, and Pixy).

Hence, SIT's transparallel processing model of perceptual organization transcends traditional definitions of representational and connectionist approaches, in that it puts the representational idea that cognition relies on regularity extraction to get structured representations in a more dynamic perspective together with a more flexible version of the connectionist idea that cognition relies on activation spreading through a network. Its transparallel mechanism also relates plausibly to neuronal synchronization, so that it also honors the idea in dynamic-systems theory that cognition relies on dynamic changes in the brain's neural state. In this sense, SIT's model reflects a truly pluralist account of visual processing (see also Marr's levels and Metaphors of cognition).

Furthermore, the idea that cognition is a dynamic process of self-organization is not new, and the idea that those temporarily synchronized assemblies are the building blocks of cognition is not new either. What SIT's model adds, however, is the idea that the temporarily synchronized neural assemblies in the visual hierarchy can be seen as hyperstring-like cognitive information processors which, in a transparallel fashion, process similar features in the input at hand. Therefore, these assemblies of neurons may be called "gnosons", that is, fundamental particles of cognition, and they can be seen as the constituents of flexible self-organizing cognitive architecture in between the relatively rigid level of neurons and the still elusive level of consciousness.


For a demo on transparallel processing as such, see Pencil selection
For a formal account of transparallel processing by hyperstrings, see Proceedings of the National Academy of Sciences USA 2004
For extensive discussions and references on cognitive architecture, see Cognitive Processing 2012