Elsevier

NeuroImage

Volume 96, 1 August 2014, Pages 300-308
NeuroImage

Fractal image perception provides novel insights into hierarchical cognition

https://doi.org/10.1016/j.neuroimage.2014.03.064Get rights and content

Highlights

  • In this study we investigated the neural processing of fractals.

  • Visual hierarchies are efficiently represented using self-embedding rules.

  • Rule-based hierarchical processing recruits the visual dorsal stream.

  • Self-embedding rules additionally activate the parieto-medial temporal pathway.

  • This pathway integrates objects in contexts, and integrates “space” and “meaning”.

Abstract

Hierarchical structures play a central role in many aspects of human cognition, prominently including both language and music. In this study we addressed hierarchy in the visual domain, using a novel paradigm based on fractal images. Fractals are self-similar patterns generated by repeating the same simple rule at multiple hierarchical levels. Our hypothesis was that the brain uses different resources for processing hierarchies depending on whether it applies a “fractal” or a “non-fractal” cognitive strategy. We analyzed the neural circuits activated by these complex hierarchical patterns in an event-related fMRI study of 40 healthy subjects.

Brain activation was compared across three different tasks: a similarity task, and two hierarchical tasks in which subjects were asked to recognize the repetition of a rule operating transformations either within an existing hierarchical level, or generating new hierarchical levels. Similar hierarchical images were generated by both rules and target images were identical.

We found that when processing visual hierarchies, engagement in both hierarchical tasks activated the visual dorsal stream (occipito-parietal cortex, intraparietal sulcus and dorsolateral prefrontal cortex). In addition, the level-generating task specifically activated circuits related to the integration of spatial and categorical information, and with the integration of items in contexts (posterior cingulate cortex, retrosplenial cortex, and medial, ventral and anterior regions of temporal cortex). These findings provide interesting new clues about the cognitive mechanisms involved in the generation of new hierarchical levels as required for fractals.

Introduction

The ability to represent and generate complex hierarchical structures is one of the hallmarks of human cognition. In many domains, including language, music, problem solving, action-sequencing and spatial navigation, humans organize basic elements into higher-order groupings and structures (Badre, 2008, Chomsky, 1957, Hauser et al., 2002, Nardini et al., 2008, Unterrainer and Owen, 2006, Wohlschlager et al., 2003). This ability to encode the relationship between items (words, people, etc.) and the broader structures in which these items are embedded (sentences, corporations, etc) affords flexibility to human behavior. For example, in action sequencing, humans are able to change, add or adapt certain basic movements to particular contexts, while keeping the overall structure (and goals) of canonical motor procedures intact (Wohlschlager et al., 2003). Typical examples of these actions-in-context are ‘grinding the beans’ or ‘re-filling the water container’ in the process of making coffee (Jackendoff, 2002). Individuals can evaluate the need for these actions and omit them if they are unnecessary without impairing the overall procedure of making coffee (Badre and D'Esposito, 2009). This ability is different from simple action sequencing, and seems very limited in non-human animals (Conway and Christiansen, 2001).

A promising method to represent complex hierarchical structures – realized in nature and attractive for experimental research – is the use of recursive embedding processes (Fitch, 2010, Martins, 2012). Recursive embedding refers to the incorporation of a structure inside another structure of the same sort, and it allows the generation of hierarchies with infinite depth using very simple rules. We can add several new elements to a certain hierarchical level using within-level transformation rules (Fig. 1A), but it is only possible to generate multiple hierarchical levels with a single rule if this rule involves recursive embedding (Fig. 1B). When used in association with other rule-based processes, recursive embedding allows the generation of hierarchies that are deep, structurally rich and perceived as attractive. Some examples are the fractal Mandelbrot images or fractal structures in nature such as tree branches, algae, the flower of the Brassica oleracea, snail shells and coastlines. These structures can be extended or sub-divided indefinitely while visual and structural similarity is retained at all scales. These kinds of structures contrast with others with simpler modes of organization such as grass or crop fields, which like bead necklaces, are formed by adding several items to a group at fixed hierarchical levels.

Here we investigate the ability to recognize well-formed visuo-spatial hierarchical structures, based on the application of rules that either operate transformations within a hierarchical level, or rules which generate new self-similar hierarchical levels (Fig. 1). For simplicity, we simply use the expression ‘recursive’ or ‘recursion’ to refer to ‘recursive embedding’.

The processing of visuo-spatial stimuli is often described as occurring in parallel in two different systems – the ventral stream and the dorsal stream (de Haan and Cowey, 2011, Kravitz et al., 2011). The ventral stream, an occipito-temporal network, seems to process object quality or semantic information, with more abstract categories represented in more anterior portions of the temporal lobe (Kravitz et al., 2013). The dorsal stream, an occipito-parietal network, has classically been described as processing spatial information only. Recently, however, this classical view of the dorsal stream has been updated (Kravitz et al., 2011). While projections from the parietal cortex to the prefrontal cortex seem to be important for spatial working memory and visually-guided action, a third system, called the parietal-medial temporal pathway (PMT) appears to be necessary to integrate spatial and semantic information (Kravitz et al., 2011). The PMT pathway connects the dorsal stream with the medial temporal cortex (hippocampus and parahippocampus), through the posterior cingulate (PCC) and retrosplenial cortices (RSC) (Kravitz et al., 2011, Margulies et al., 2009). This pathway appears to be crucial for the retrieval of landmark information during spatial navigation and for the integration of objects in contextual frames (e.g. a mug in a date in a coffee shop) (Aguirre and D'Esposito, 1999, Buzsáki and Moser, 2013, Ino et al., 2007, Ranganath and Ritchey, 2012, Sato et al., 2010) (Fig. 2). We therefore hypothesize that the PMT may play a specific role in the representation of principles that allow the recognition and generation of well-formed hierarchical embeddings in the visuo-spatial domain.

Based on the principles depicted in Fig. 1, we developed two tasks to investigate the cognitive processes involved in the representation of visuo-spatial hierarchies: The Visual Recursion Task (VRT) and the Embedded Iteration Task (EIT). In both tasks participants are exposed to generative processes for a certain number of iterative steps and then asked to make inferences about further iterations. This means that in both tasks participants are asked to extract simple rules from the first iterations which can then be applied to predict further transformations. In VRT, each iterative step generates a new hierarchical level according to one particular spatial rule isomorphic to the rule displayed in previous levels of the hierarchy. The brain requires only one simple rule to be able to generate large self-similar structures (fractals) with an unlimited number of levels. In EIT, new elements are embedded iteratively within a fixed hierarchical level, according to a spatial rule but without generating new levels. It is important to clarify that both tasks are iterative (i.e. a certain rule is applied a given number of times) and both may generate hierarchies of similar complexity (see Fig. 1, Fig. 3).

Our previous research with these tasks (Martins, 2012, Martins and Fitch, 2012) suggests that, in comparison with EIT, performance in VRT is more strongly associated with abstract reasoning and less correlated with specific visuo-spatial cognitive abilities. In the current study, we investigated the neural bases involved in the representation of visuo-spatial hierarchies by comparing the brain circuits active during VRT and EIT. As a control task we introduced a ‘similarity task’ (Positional Similarity Visual Task — PSVT), in which participants were asked to match a target visuo-spatial hierarchy with a set of alternatives. The setup and images displayed were closely matched for all three tasks. As indicated above, our primary hypothesis was that the brain uses different resources for processing identical hierarchical structures depending on whether it applies a “fractal” or a “non-fractal” cognitive strategy.

Section snippets

Participants

40 healthy participants (19 males and 21 females, age range 20–32) took part in the study. All had normal or corrected-to-normal vision, no history of neurological or psychiatric disease, and no current use of psychoactive medications. All completed a short questionnaire screening for previous clinical history and a battery of cognitive tests. Participants, who were all right-handed native German speakers and mostly university students, were recruited online, and gave informed written consent

Behavioral results

All 40 participants performed well within the scanner, and reported no problems in solving the tasks. Behavioral data collected during the fMRI runs showed a high rate of correct responses in VRT (M = 96%, SD = 8%), EIT (M = 91%, SD = 5%) and PSVT (M = 95%, SD = 8%). The percentage of correct answers differed between tasks (repeated-measures ANOVA: F 1,39 = 7.1, p = 0.011): participants scored lower in EIT than in VRT (p < 0.01) and PSVT (p = 0.03). Mean response time was 2.34 s in VRT, 2.56 s in EIT, and 2.59 s in

Discussion

In this study we contrasted the brain networks active during the representation of processes allowing the generation of new hierarchical levels (as required for generating fractals) with the representation of processes that may generate structures of equal complexity but do this without creating new levels. The rationale was that many attractive structures in nature are fractals and, based on our previous research, we hypothesized that these are processed in a specific and very efficient way

Acknowledgments

This work was supported by a research cluster grant "Shared Neural Resources for Music and Language" to WTF and RB (University of Vienna - internal project number FG761002 - and Medical University of Vienna - internal project number FA103FC003), an FCT grant SFRH/BD/64206/2009 to MM, and by ERC Advanced Grant SOMACCA, project number 230604 to WTF. The authors declare no competing financial interests.

References (57)

  • E. Koechlin et al.

    Broca's area and the hierarchical organization of human behavior

    Neuron

    (2006)
  • D.J. Kravitz et al.

    The ventral visual pathway: an expanded neural framework for the processing of object quality

    Trends Cogn. Sci.

    (2013)
  • D. Kumaran et al.

    The emergence and representation of knowledge about social and nonsocial hierarchies

    Neuron

    (2012)
  • D. Lehmann et al.

    Core networks for visual-concrete and abstract thought content: a brain electric microstate analysis

    NeuroImage

    (2010)
  • T.E. Lund et al.

    Motion or activity: their role in intra- and inter-subject variation in fMRI

    NeuroImage

    (2005)
  • M. Nardini et al.

    Development of cue integration in human navigation

    Curr. Biol.

    (2008)
  • T. Nichols et al.

    Valid conjunction inference with the minimum statistic

    NeuroImage

    (2005)
  • K.M. Petersson

    Artificial syntactic violations activate Broca's region

    Cogn. Sci.

    (2004)
  • K.M. Petersson et al.

    What artificial grammar learning reveals about the neurobiology of syntax

    Brain Lang.

    (2012)
  • M.N. Rajah et al.

    Prefrontal contributions to domain-general executive control processes during temporal context retrieval

    Neuropsychologia

    (2008)
  • J. Uddén et al.

    The inferior frontal cortex in artificial syntax processing: an rTMS study

    Brain Res.

    (2008)
  • J.M. Unterrainer et al.

    Planning and problem solving: from neuropsychology to functional neuroimaging

    J. Physiol. Paris

    (2006)
  • G. Aguirre et al.

    Topographical disorientation: a synthesis and taxonomy

    Brain

    (1999)
  • E. Aminoff et al.

    The parahippocampal cortex mediates spatial and nonspatial associations

    Cereb. Cortex

    (2006)
  • A. Baddeley

    Working memory: looking back and looking forward

    Nat. Rev. Neurosci.

    (2003)
  • D. Badre et al.

    Is the rostro-caudal axis of the frontal lobe hierarchical?

    Nat. Rev. Neurosci.

    (2009)
  • S.L. Bengtsson et al.

    The representation of abstract task rules in the human prefrontal cortex

    Cereb. Cortex

    (2008)
  • R.C. Berwick et al.

    Evolution, brain, and the nature of language

    Trends Cogn. Sci

    (2013)
  • Cited by (32)

    • Self-similarity and recursion as default modes in human cognition

      2017, Cortex
      Citation Excerpt :

      Participants were paid 30 Euros for their participation. Note that these participants constitute a subgroup of those used in a recent distinct analysis (Martins, Fischmeister, et al., 2014; Martins, Laaha, et al., 2014); five participants from the original study were excluded since for various reasons no resting-state task data were collected. All subjects gave informed written consent prior to participation in the study, which was approved by the appropriate ethics committee of the Medical University of Vienna.

    • Cognitive representation of “musical fractals”: Processing hierarchy and recursion in the auditory domain

      2017, Cognition
      Citation Excerpt :

      To that effect, we will investigate the relationship between accuracy in ART and in other non-auditory recursive tasks, namely the Visual Recursion Task (VRT) (Martins et al., 2015) and the Tower of Hanoi task (ToH; which requires recursive action sequencing) (Goel & Grafman, 1995a; Halford, Wilson, & Phillips, 1998; Halford, Wilson, & Phillips, 2010). Performance in VRT and ToH was previously shown to be correlated (Martins, Fischmeister, et al., 2014) and they thus offer appropriate tools to tap into the ability to establish cognitive representations of recursion. A close correlation between ART, VRT and ToH would corroborate that ART taps into something specific to recursion.

    • Representing visual recursion does not require verbal or motor resources

      2015, Cognitive Psychology
      Citation Excerpt :

      Although Hypothesis 2 could be criticized for being non-modular (Fodor, 1983; Hornstein, & Pietroski, 2009; Roeper, 2011), there are a number of other cognitive processes, for example those involved in central executive processing, which are implemented by specific neural systems, and yet are available for all domains of cognition (Baddeley, 1998; Fodor, 1983). While recursion has been studied mostly within the linguistic domain, recent research has shown that humans are also able to acquire and apply recursive rules governing the generation of visuo–spatial hierarchies (Martins, 2012; Martins, Fischmeister, Puig-Waldmueller, Oh, Geissler, Robinson, & Beisteiner, 2014; Martins, & Fitch, 2012). This research suggested that the ability to acquire recursive rules in the visuo–spatial domain might crucially depend on the engagement of analytical and effortful cognitive strategies.

    View all citing articles on Scopus
    1

    These authors contributed equally to this work.

    View full text