Elsevier

Cognition

Volume 86, Issue 1, November 2002, Pages 57-70
Cognition

Visual mental images can be ambiguous: insights from individual differences in spatial transformation abilities

https://doi.org/10.1016/S0010-0277(02)00137-3Get rights and content

Abstract

The debate about whether objects in mental images can be ambiguous has produced ambiguous results. In some studies, participants could not reinterpret objects in images, but even in the studies where participants could reinterpret visualized patterns, the results are not conclusive. The present study used a novel task to investigate the reinterpretation of ambiguous figures in imagery, which required the participants to mentally rotate a figure 180 degrees before attempting to “see” an alternate interpretation. In addition, the participants did not know the purpose of the study in advance, nor did they see alternate interpretations of the stimuli; moreover, we explicitly measured individual differences in key mental imagery abilities. Eight of the 44 participants discovered the alternate version while they were memorizing the figure; 16 reported it after mentally rotating an image; and 20 were not able to “see” the alternate version. The ability to rotate images, assessed with an independent task, was highly associated with reports of image reversals, whereas measures of other imagery abilities were not.

Introduction

Visual perception is driven by the nature of the external world, and percepts typically are only subtly affected by the viewer's intentions and beliefs. In contrast, mental images are created by the imaginer, and thus may not be independent of their creator's intentions and beliefs. Indeed, some researchers have claimed that objects in images are inextricably tied to the way a shape was interpreted when it was encoded into memory, and cannot subsequently be reinterpreted during imagery (e.g. Chambers & Reisberg, 1985). According to this view, images are more like descriptions than they are like pictures that can be readily reinterpreted during perception.

The question of whether images are descriptions or depictions is of interest in part because it bears on the “imagery debate” (e.g. Kosslyn, 1980, Kosslyn et al., 2001, Pylyshyn, 1973, Pylyshyn, 1981, Pylyshyn, in press). This debate revolves around the question of whether the depictive aspects of imagery that are evident to introspection reflect functional aspects of the underlying representation, or whether such pictorial characteristics are purely epiphenomenal (like the heat from a light bulb when one is reading, which plays no functional role in the reading process). If instead of describing information, images depict information in a picture-like way, they would retain some of the “raw material” of the perception – and hence could later be interpreted.

Chambers and Reisberg, 1985, Chambers and Reisberg, 1992 concluded from their experiments that people cannot reinterpret patterns in mental images, and hence that mental images are more like descriptions than pictures. However, other researchers subsequently showed that under some circumstances imagined patterns can be reinterpreted (Anderson and Helstrup, 1993, Finke et al., 1989, Peterson et al., 1992, Rouw et al., 1997). At this writing, it is fair to say that the available findings about whether imaged objects can be ambiguous remain ambiguous. Two features of the studies seem particularly pertinent. First, in many of the previous studies, the participants were told in advance, as part of the instructions, that the stimuli were reversible figures. On the one hand, such instructions could help the participants develop appropriate reversal strategies. Indeed, Rock and Mitchener (1992) found that the nature of the instructions has a clear influence on the number of reversals of ambiguous figures in perception. Only one-third of their participants ever reversed when they were not told about the possibility of reversals. However, when told that the figures were reversible, all participants experienced reversals. On the other hand, it is also possible that such instructions could instill inappropriate reversal strategies when participants are acquainted with different ambiguous figures prior to the experiment. For example, Chambers and Reisberg (1985) report that none of their 55 participants (across three experiments) ever reinterpreted an imagined duck/rabbit ambiguous figure. In sharp contrast, Peterson et al. (1992) reported that such reversals commonly occur in mental imagery. In Experiment 1 of their study, 40% of the participants reported image reversals. In fact, Peterson et al. (1992) used the same duck/rabbit ambiguous figure used by Chambers and Reisberg (1985), only changing the demonstration figure they used in the instructions prior to the task (they used the goose/hawk ambiguous figure instead of the ambiguous Mach book, see Fig. 1). Peterson et al. (1992) conclude that using a demonstration figure that was more appropriate for the duck/rabbit ambiguous figure helped the participants discover the alternate version.

However, it is possible that participants who reinterpreted objects in images might not have actually operated on the image, but instead they might simply have recalled a different interpretation that they stored when they initially memorized the stimuli. In order to rule out such an alternative account, in the present study we examine the ability to reinterpret ambiguous figures in mental images but do not alert the participants in advance that they will see ambiguous figures or try to reverse them. We use a drawing that changes its apparent identity only when it is inverted. The face appears as a young woman, and, when turned upside-down, as an old woman. Unlike other ambiguous figures (e.g. the duck/rabbit figure, or chef/dog figure), the inverted alternate version is hardly noticeable without prior experience. This type of reversal requires a new interpretation of the image components (e.g. the nose in one orientation becomes part of the hair upon inversion), and a new reference frame (as discussed by Peterson et al., 1992). In particular, the top and bottom of the figure must be reassigned to discover the alternate version. Thus, this ambiguous figure requires a rotation of the reference frame by 180 degrees (from right-side-up to upside-down), which makes an incidental discovery (by merely looking at the figure) much less likely than with other figures.

The second key aspect of previous studies is that even when some participants could interpret the imaged pattern in novel ways, not all participants could do so. People clearly differ in their mental imagery abilities (e.g. Kosslyn et al., 1984, Richardson, 1994), and people who have difficulty with specific types of imagery processing may have difficulty in challenging imagery tasks that draw on such processing. Thus, in the present study we assess four key individual imagery abilities, specifically the ability to generate (i.e. form on the basis of stored information) high-resolution images, the ability to compose images from separate parts, the ability to inspect imaged patterns, and the ability to rotate objects in images. We seek to discover whether individual differences in specific aspects of mental imagery predict performance when people visualize ambiguous figures.

Section snippets

Participants

Forty-four people (23 female, 21 male, mean age 22 years, range 18–32 years) volunteered to take part in this study. The participants gave their informed consent and completed a health history questionnaire prior to taking part in this study. None of them reported any health problems and all participants had normal or corrected-to-normal vision. The participants were Harvard University students or professionals from the Boston area. All participants were naive regarding the purpose of this

Ambiguous figures

Eight of 44 participants discovered the alternate version during the learning phase, when the drawing of the ambiguous figure was actually visible. Seven of these participants saw the old woman orientation, and thus discovered the upside-down young woman interpretation, whereas only one participant who saw the young woman orientation then discovered the upside-down old woman interpretation. The remaining 36 participants confirmed during debriefing that they did not see the upside-down version

Discussion

The present study produced two important findings. First, these results demonstrate that alternate interpretations can in fact be discovered in mental images. Contrary to findings reported by Chambers and Reisberg (Chambers and Reisberg, 1985, Chambers and Reisberg, 1992, Reisberg and Chambers, 1991), 16 out of 36 participants were able to identify an unanticipated shape in their mental image when given partial stimulus cues. The participants were not led to expect a new interpretation in

Acknowledgements

This study was supported by a grant from the Swiss NSF awarded to the first author, and by NIH grant R01 MH60734-01 and by NIMA grant NMA201-01-C-0032 to the second author. We thank Jennifer Shephard for her role in creating the visual cognition test battery. We thank Judith Danovitch, Marie Burrage and Jason Davis for their help in collecting the data. Part of this study was presented at the Annual Meeting of the Cognitive Neuroscience Society in San Francisco, April 2000.

References (22)

  • D. Chambers et al.

    What an image depicts depends on what an image means

    Cognitive Psychology

    (1992)
  • S.M. Kosslyn et al.

    Individual differences in mental imagery ability: a computational analysis

    Cognition

    (1984)
  • R. Rouw et al.

    Detecting high-level and low-level properties in visual images and visual percepts

    Cognition

    (1997)
  • R.E. Anderson et al.

    Visual discovery in mind and on paper

    Memory and Cognition

    (1993)
  • D. Chambers et al.

    Can mental images be ambiguous?

    Journal of Experimental Psychology: Human Perception and Performance

    (1985)
  • E.L. Cochran et al.

    Task-specific strategies of mental “rotation” of facial representations

    Memory and Cognition

    (1983)
  • J.D. Cohen et al.

    PsyScope: a new graphic interactive environment for designing psychology experiments

    Behavioral Research Methods, Instruments, and Computers

    (1993)
  • R.A. Finke et al.

    Reinterpreting visual patterns in visual imagery

    Cognitive Science

    (1989)
  • I.S. Howard

    Human visual orientation

    (1982)
  • I.E. Hyman et al.

    Reconstruing mental images. Problems of method (Emory Cognition Project Tech. Rep. No. 19)

    (1991)
  • S.M. Kosslyn

    Image and mind

    (1980)
  • Cited by (50)

    • Eye movements during visual imagery and perception show spatial correspondence but have unique temporal signatures

      2021, Cognition
      Citation Excerpt :

      Consequently, recalling a visual scene would reenact the eye movement sequence of encoding, leading to spatially corresponding fixation locations (Laeng & Teodorescu, 2002; Spivey & Geng, 2001; but see Foulsham & Kingstone, 2012; Johansson et al., 2012). Furthermore, there is consensus that eye movements on a blank screen can originate from attention shifts associated with the generation, maintenance or inspection of mental images (Johansson et al., 2012; Kosslyn, Thompson, & Ganis, 2006; Mast & Kosslyn, 2002). Interestingly, most studies that investigated the role of eye fixations in visual imagery have focused on the spatial correspondence, for example by comparing the distribution of fixation locations across different areas of interest (AOIs) on the screen during encoding and visual imagery (or recall) (Johansson & Johansson, 2014; Laeng, Bloem, D'Ascenzo, & Tommasi, 2014; Laeng & Teodorescu, 2002; Martarelli & Mast, 2013; Richardson & Spivey, 2000; Spivey & Geng, 2001).

    • Revisiting mental rotation with stereoscopic disparity: A new spin for a classic paradigm

      2019, Brain and Cognition
      Citation Excerpt :

      In our experiment, if stereoscopic disparity enriches the observer’s percept, the enriched information would afford a larger variety of neural computations (e.g., by recruiting brain regions processing binocular disparity cells; Freeman, 1999; Goncalves & Welchman, 2017; Parker, 2007) or behavioral strategies (e.g., manual rotation simulation; Price & Lee, 2010). Alternatively, if stereoscopic information reduces perceptual ambiguity (Banks, Read, Allison, & Watt, 2012; Caziot & Backus, 2015), akin to the reduction of ambiguity illustrated by the disambiguation of the reversible cubes illusion (Fig. 1B, C, E), neural operations are simplified because fewer perceptual ‘options’ need to be entertained (Mast & Kosslyn, 2002; Peterson, Kihlstrom, Rose, & Glisky, 1992). To explore these two possibilities, we measured the complexity of the evoked brain activity using multiscale entropy.

    • The role of gesture as simulated action in reinterpretation of mental imagery

      2019, Acta Psychologica
      Citation Excerpt :

      In this way, we considered any interpretation that was offered by participants during perception as a valid interpretation of the stimuli. Similar to other studies using this paradigm (e.g., Chambers & Reisberg, 1985; Mast & Kosslyn, 2002; Peterson et al., 1992), a limited number of post-hoc experimenter decisions were made to count answers as correct that were not mentioned in the corresponding perception phase, but that shared the same objective features as the target figure (e.g., reindeer head or impala were both counted as correct answers for the deer figure). Frequency of specific (in)correct interpretations can be retrieved from the Open Science Framework https://osf.io/725te/.

    • Mental and perceptual feedback in the development of creative flow

      2016, Consciousness and Cognition
      Citation Excerpt :

      Others have questioned the theory that MI is insufficient for full, reinterpretive creative cognition and argued that MI is sufficiently like perceptual imagery to allow for the complexities of creative thought (Shepard, 1978). Studies have since provided evidence that there is spatial and temporal equivalence between mental and perceptual imagery (Finke, 1980), that restructuring and reconstrual does frequently occur, and that geometric data are preserved in MI to allow new categorical interpretations to be formed (Finke, Pinker, & Farah, 1989; Mast & Kosslyn, 2002; Peterson, Kihlstrom, Rose, & Glisky, 1992). Wiseman, Watt, Gilhooly, and Georgiou (2011) showed that creative ability was linked to the ability to reinterpret ambiguous mental images mentally, confirming that agility in internally manipulating imagery is a fundamental component of creativity.

    • Frontal theta activity is pronounced during illusory perception

      2014, International Journal of Psychophysiology
    View all citing articles on Scopus
    View full text