Metaphor plays an important role in the transmission of new schemata of thought during cognitive therapy (Muran & Digiuseppe, 1990). In the field of psycholinguistics, neural correlates of metaphor comprehension have been widely investigated over the past decade (Benedek et al., 2014; Bohrn, Altmann, & Jacobs, 2012; Rapp, Leube, Erb, Grodd, & Kircher, 2004; Rapp, Mutschler, & Erb, 2012; Yang, 2014). However, the underlying neural mechanisms of metaphor in the context of cognitive therapy remain unknown.

Therapeutic metaphor is believed to have potential in producing therapeutic insight. Conceptual metaphor theory (CMT) posits that conceptual metaphor is a cognitive tool that people use to represent an abstract target concept in terms of a superficially unrelated, typically more concrete source concept (Lakoff & Johnson, 1980). According to MacCormac (1985), metaphor is “the creative cognitive process of activating widely separated areas of long-term memory and of combining normally unassociated concepts.” First, the use of metaphor in cognitive therapy helps clients to gain insight and make changes in their lives (Kok, Lim, & Low, 2011; Pollio, Barlow, Fine, & Pollio, 1977). Therapists of various theoretical orientations agree that insight is a salient characteristic of a “good therapist” (Novotney, 2013) and insight is also termed cognitive restructuring in cognitive therapy (Grosse Holtforth et al., 2007). The use of metaphor in psychotherapy can introduce a different way of perceiving and organizing the world, making it easier to breach the epistemological gap between old and new knowledge (Muran & Digiuseppe, 1990). Because metaphor involves both imagistic and verbal processes, it can further facilitate the acquisition, retention, and recall of new paradigms (Muran & Digiuseppe, 1990; Paivio, 1986). Prior researches have indicated that novel metaphoric language constituted the contents of specific therapeutic insights (Barlow, Pollio, & Fine, 1977; Elliott et al., 1994), and clients rated therapy sessions in which they recalled their therapists’ intentional use of metaphor as more helpful than other sessions (Martin, Cummings, & Hallberg, 1992). Additionally, empirical evidence has demonstrated that metaphor is effective in ameliorating irrational beliefs (Komasi, Saeidi, Zakiei, Amiri, & Soltani, 2016), as well as in changing catastrophizing and pain biology knowledge (Gallagher, Mcauley, & Moseley, 2013).

Second, in the other domains, the use of metaphor could foster insight process and facilitate problem solving (Keefer & Landau, 2016; Slepian, Weisbuch, Rutchick, Newman, & Ambady, 2010). The investigators found that the use of prototypical metaphor in solving scientific innovation problems facilitates insight process (Yang et al., 2016). Archimedes cracking the mystery of the crown is a famous example; he suddenly realized that he could use the water displacement to work out the volume and density of the king’s crown and shouted “Eureka!” Insightful problem solving is one classical subfield of creativity (Abraham & Windmann, 2007), and metaphor is considered as a creative language expression due to its properties of novelty and imagery (Lakoff & Johnson, 1980). Metaphor also is used to investigate creativity. For example, Benedek et al. (2014) used metaphor to investigate the neural correlates of creativity production by generating novel metaphors or literal synonyms to complete sentences. In a word, therapeutic metaphor has distinct potentials in producing insight for mental distress problems due to its cognitive superiority in conceptual transference and creative characteristics.

However, the neural mechanisms of therapeutic metaphor and its correlation with insight experience have yet to be elucidated. The difficulty in collecting relevant data may explain the lack: a well-designed study would have to record brain activity in response to metaphor both in real time and in the context of psychotherapy. Moreover, the latter requirement would have to repeatedly present therapeutic metaphors for the valid isolation and measurement of related neural activity.

Micro-counseling scenarios have been used repeatedly to present therapeutic metaphor in prior research (Hu, Zhang, Zhang, Yu, & Zhang, 2018; Jiang, Yu, Zhang, & Zhang, 2016; Yu, Zhang, Zhang, Zhang, & Luo, 2016). In this context, “maladaptive” and “adaptive” thinking were reduced to “problems” and “solutions,” respectively. Participants of such studies read a description of a mental distress problem (e.g., “I feel extremely frustrated, because I am beginning a major that I dislike”) followed by one of three randomly matched solutions: metaphorical, literal, or problem-restatement. The first type reinterprets the functional aspects involved in a problem by applying a metaphor, including various figures of speech; for example, “Success in life is not holding good cards but playing bad cards well.” The literal solution reinterprets the functional aspects of an issue by providing a plain, soothing expression in literal language: “success mainly depends on effort, it is important to do a good job now.” The problem-restatement solution rephrases the original problem using literal wording: “your current major is not your favorite, you lack confidence in your future.” Researchers found that the metaphorical solutions yielded the greatest degree of insightfulness and novelty for both psychotherapists and students, followed respectively by literal and problem-restatement solutions (Yu et al., 2016). The problem-solution scenarios in the micro-counseling context referenced the riddle paradigm, which was used in a neuroimaging study on insight to observe brain activities of insight processing (Luo & Niki, 2003). The investigators asked participants to read a riddle and then provided them with the solution. The therapeutic metaphor in the micro-counseling scenario thus meets our criteria for a method to detect neural mechanisms of therapeutic metaphor: (a) capturing online multiple events of therapeutic metaphor in one functional magnetic resonance imaging (fMRI) scanning session, and (b) ecologically valid stimuli that resemble an actual psychotherapy scenario.

We further sought to identify the specific brain regions activated by therapeutic metaphor, because empirical evidence has demonstrated that therapeutic metaphor is more effective in alleviating mental distress (Gallagher et al., 2013; Komasi et al., 2016), and the insightfulness experiences produced by therapeutic metaphor in the micro-counseling scenarios could predict the reduction of negative affect after the intervention (Hu et al., 2018). The neuroimaging studies demonstrated that metaphor comprehension induced different activities than did literal sentences; metaphor significantly increased signal changes in the left inferior frontal gyrus (IFG; Brodmann area (BA) 45/47), as well as in the left inferior and middle temporal gyri (ITG and MTG, respectively) (Rapp et al., 2004; Bohrn et al., 2012; Rapp et al., 2012). Moreover, it was found that metaphor or simile utterances in a naturally evolving, continuous, and coherent story rather than isolated or single sentences resulted in significantly higher levels of activation in the left IFG, MTG, and superior temporal gyrus (STG) (Eviatar & Just, 2006; Nagels et al., 2013). Prior research also reported that metaphorical solutions to mental distress problems were associated with the experience of insightfulness (Yu et al., 2016). To the best of our knowledge, fMRI technique has not been used to investigate insight-related brain activation induced by therapeutic metaphor. Other domains of insight research, however, have benefitted from numerous neuroimaging studies. One important early neuroimaging study on insight found that receiving the answer to a riddle increased hippocampal activity (Luo & Niki, 2003). Other research found that the hippocampus and amygdala were both activated by induced or self-generated insight in a perceptual task, as well as in a compound remote-associates task (Huang, Fan, & Luo, 2015; Kizilirmak, Thuerich, Folta-Schoofs, Schott, & Richardson-Klavehn, 2016; Ludmer, Dudai, & Rubin, 2011; Tik et al., 2018; Zhao et al., 2013). The literature on metaphorical language comprehension has featured scant discussion of the hippocampus and amygdala.

The present study sought to identify neural correlates of therapeutic metaphor in micro-counseling scenarios. We hypothesized that: (a) brain regions associated with basic metaphorical language comprehension, such as the IFG, MTG, and STG, would be involved in the processing of therapeutic metaphor; and (b) that brain regions correlated with the experience of insight but not associated with basic metaphor understanding, such as the hippocampus and amygdala, also would be recruited in the processing of therapeutic metaphor.

Methods

Participants

The present study recruited 22 undergraduate and graduate volunteers (12 women, 10 men; age range, 19-27 years; mean (M) = 22.95; standard deviation (SD) = 2.04) from universities in Beijing, China. None of the participants had any medical, neurological, or psychiatric illness. The study was approved by the ethical committee of the Institute of Psychology, Chinese Academy of Sciences and was performed according to the Declaration of Helsinki. All participants gave written, informed consent and received compensation for their participation of the study. Due to excessive head movement during the fMRI scans (see fMRI data analysis below), two participants were removed from the final analyses.

The sample size was calculated with the G*Power 3.1. The effect size of f = 2.03 (this f value was obtained by a convertible calculation of \( {\eta}_p^2 \) = 0.805 for insightfulness rating) in a previous study using the same paradigm (Yu et al., 2016), and thus 4 observations (i.e. approximately 1.33 participants) were required to detect such effect size with a type I error of 5% and 80% power in the one-way repeated-measures ANOVA test. However, we could not determine whether this small sample size was suitable to detect brain activation or not. We assumed an effect as medium as f = 0.3 in the present study, 61 observations (i.e. approximately 20.33 participants) were required to detect such effect size with a type I error of 5% and 80% power in the one-way repeated-measures ANOVA test. It indicated that the sample size in the present study was appropriate.

Materials

The experiment consisted of 75 micro-counseling scenarios. Each of these featured of a description of mental distress (the problem) followed by a matched response (the solution). The material database included three types of the latter: a metaphorical solution (MET), which applied figures of speech to reinterpret the functional or emotional reactions present in a situation; a literal solution (LIT), which provided a plain, soothing reinterpretation of the functional or emotional reactions present in a situation in literal language; and a problem-restatement solution (PRO-R), which rephrased the original problem using literal wording. Only one of these three solutions was randomly matched to each problem in the formal experiment. For example, to the problem “I feel extremely frustrated, because I am beginning a major that I dislike,” we provided the following: a metaphorical solution, “Success in life is not holding good cards but playing bad cards well”; a literal solution, “success mainly depends on effort, it is important to do a good job now”; and a problem-restatement solution, “your current major is not your favorite, you lack confidence in your future.” Several examples were provided in the Table 1. The length of each problem and solution description was limited to 24–30 Chinese characters, including punctuation. There were thus 75 problems to which 25 solutions of each type were randomly assigned for each participant. These scenarios were previously assessed in the study of Yu et al. (2016) by experts in psychology and university students on a nine-point Likert scale. Their findings showed that different levels of insightfulness and novelty could be induced by the three types of solutions (Insightfulness: for the expert sample, MET, M = 6.44, SD = 1.27, LIT, M = 5.25, SD = 0.74, PRO-R, M = 3.53, SD = 1.43; for the student sample, MET, M = 6.89, SD = 0.91, LIT, M = 6.31, SD = 1.08, PRO-R, M = 3.74, SD = 1.33. Novelty: for the expert sample, MET, M = 6.41, SD = 1.10, LIT, M = 3.37, SD = 1.83, PRO-R, M = 3.14, SD = 1.18; for the student sample, MET, M = 6.67, SD = 1.04, LIT, M = 5.41, SD = 1.73, PRO-R, M = 3.38, SD = 1.34).

Table 1 Examples of the experimental materials

Design and procedure

The present study used fMRI to detect brain responses induced by metaphorical, literal, and problem-restatement solutions with a within-subjects design. Participants were informed that they would read micro-counseling scenarios, each of which consisted of a problem (i.e., the descriptions of ideas or thoughts the clients may have when encountering negative events or problematic settings) and a corresponding solution (i.e., the descriptions of ideas or thoughts the psychological therapists may provide). They were instructed to imagine vividly the pairings as if they were part of an interactive counseling scenario and to rate the problems and solutions from the viewpoint of the clients. Descriptions of the problems and solutions were presented to participants in the scanner using E-prime software (Version 2, Psychology Software Tools).

To first familiarize participants with the procedure and pace of the task, they completed a training session outside the scanner; this procedure used six scenarios that were not presented in the formal fMRI scan. In the formal scanning phase, 75 scenarios were presented randomly and allocated into 3 blocks. Thus, there were 25 trails in each block. Within each block, three different types of solutions were pseudo-randomly organized and solutions from the same category were presented no more than three times in a row to avoid habituation and order effects. Each problem and solution was presented as a complete sentence in a line of white Chinese characters against a black background; it was projected onto a screen visible to the participants through a mirror. As shown in Fig. 1, each trial began with a fixation cross presented in the center of the screen with a random jitter of 1–4 s. The problem was then displayed for 8 s followed by a second jitter of 1–4 s. The solution was then presented for 8 s, after which the participants were allowed 6 s to rate the emotional valence of the solution (i.e., to what extent the solution was emotionally positive) on a nine-point Likert scale; a score of 1 indicated “not at all,” whereas a score of 9 corresponded to “very much.” After the fMRI scanning (approximately 40 minute for tasks) was completed, in order to know more details about the participants' experiences while reading these micro-counseling scenarios, 2 minutes later they were requested to rate the materials again at another quiet laboratory (approximately 40 minutes for tasks). For this second evaluation, participants were instructed to assess the problems in terms of emotional valence and adaptability (i.e., to what extent the solution was functional or helpful in adapting to challenges) and to assess the solutions in terms of emotional valence, adaptability, novelty (i.e., to what extent the solution was distinctive and unexpected), and insightfulness (i.e., to what extent there was a cognitive “click” or new enlightenment that would improve understanding of a problem) on a nine-point Likert scale; 1 indicated “not at all” and 9 indicated “very much,” the same as Yu et al. (2016). They were asked to assess the problems and matched solutions based on their present experiences and not to recall the ratings assessed in the fMRI scanner deliberately. The sequence of materials presented was identical to that of the scanning phase.

Fig. 1
figure 1

One trial. The participants first read a problem and then a solution. The two were presented for 8 s, and then participants were allowed 6 s to rate to what extent the solution was emotionally positive on a nine-point Likert scale. There was a jittered fixation cross of 1-4 seconds before each problem and solution

fMRI data acquisition and analysis

All scanning was performed with a Siemens Trio 3.0 Tesla MR-scanner using a standard radio frequency head coil. The head was fixed with foam pads during scanning to minimize head movement. A T2*-weighted echo-planar imaging (EPI) sequence based on blood oxygenation level-dependent (BOLD) contrast was acquired with 33 interleaved axial slices to cover the whole brain. The imaging parameters were as follows: thickness = 4.0 mm, repetition time (TR) = 2 s, echo time (TE) = 30 ms, flip angle (FA) = 90 o, field of view (FOV) = 192 × 192 mm2, matrix size = 64 × 64 mesh, and voxel size = 3.0 × 3.0 × 4.0 mm3. For each participant, a high-resolution T1-weighted 3D magnetization prepared rapid gradient echo pulse (MPRAGE) sequence was acquired for the coregistration and standardization to a template brain (thickness = 1.00 mm, TR = 2600 ms, TE = 3.02 ms, FOV = 256 × 256 mm2, and voxel size = 1.0 × 1.0 × 1.0 mm3) .

Preprocessing and statistical analyses were performed using Statistical Parametric Mapping (SPM8, http://www.fil.ion.ucl.ac.uk/spm/). For preprocessing, the initial four time points were removed to allow for signal stabilization and participant adaptation. The images of each subject were then corrected for slice timing, realigned for head motion correction, co-registered to the high-resolution 3D-T1 anatomical images, spatially normalized using DARTEL and smoothed with an 8-mm, full-width, half-maximum (FWHM) Gaussian spatial kernel. The exclusion criteria for head motion were a translation of greater than 2.5 mm or a rotation of more than 2.5 o. Data from two participants were eliminated from the subsequent data analyses.

For the first-level analysis, data from the remaining 20 participants were statistically analyzed using general linear models with SPM8. To examine the neural activity associated with the processing of the three types of solutions, three separate regressors were defined (MET, metaphorical solutions; LIT, literal solutions; PRO-R, problem-restatement solutions). These were time-locked to the onset of the solution presentation of 8 s and then convoluted using the canonical hemodynamic response function (HRF). A high-pass filter with a cutoff frequency of 1/128 Hz was used to correct for low-frequency components, and serial correlations were corrected using an autoregressive AR (1) model. Motion realignment parameters also were modeled to account for differences related to head movements.

The resulting contrast images from the first-level analysis were then submitted to a full factorial group analysis of variance (ANOVA) using a random effects model. To determine the neural activity related to the metaphorical solutions, we conducted two analyses: metaphorical versus problem-restatement (MET > PRO-R) and metaphorical versus literal (MET > LIT); to determine neural activation associated with the literal solutions, we performed two additional analyses: literal versus problem-restatement (LIT > PRO-R) and literal versus metaphorical (LIT > MET). We then completed a conjunction analysis using the contrasts of metaphorical versus problem-restatement and metaphorical versus literal [(MET > PRO-R) ∩ (MET > LIT)] to further clarify the neural network uniquely associated with the metaphorical solutions; a similar conjunction analysis was performed to characterize the neural networks common to metaphorical and literal solutions using the contrasts of metaphorical versus problem-restatement and literal versus problem-restatement [(MET > PRO-R) ∩ ( LIT > PRO-R)]. Whole brain search results and small-volume correction (SVC) results were reported in the present study. In previous studies, hippocampus and amygdala often were found in the insight processing, so we were interested specifically in detecting the activations of the two regions. When whole-brain results could not detect them, we would use the method of SVC. SVC (Worsley et al., 1996) is a correction you can apply when your a priori hypothesis do not apply the entire brain but only to certain areas that can be included in a volume. In the present study, for specific regions of interest (ROIs) of bilateral hippocampus and amygdala, we applied a SVC on these ROIs with an anatomical mask according to the AAL template (Tzourio-Mazoyer et al., 2002) provide by WFU PickAtlas software (Maldjian, Laurienti, Kraft, & Burdette, 2003). For the whole-brain analysis, the threshold was set at p < 0.001 (voxel level, uncorrected), p < 0.05 (cluster level, uncorrected), and 50 continuous voxels. For the SVC analysis, the threshold was set at p < 0.001 (voxel level, uncorrected), p < 0.05 (cluster level, FWE corrected).

We conducted two analyses to clarify the relationship between insightfulness experiences and brain activities in amygdala and hippocampus. First, hierarchical linear modeling (HLM) was used to test whether the correlation was statistically significant between insightfulness experience and the activities in hippocampus and amygdala. HLM could simultaneously investigate relationships within and between hierarchical levels of grouped data, thereby making it more efficient at accounting for variance among variables at different levels (Woltman, Feldstain, MacKay, & Rocchi, 2012). Some researchers have demonstrated that the utility of HLM analysis was critical to fMRI studies, which separate within-participants variance from between-participants factors (Schacht et al., 2011). In the present study, the statistical software package HLM 7.0 (Raudenbush, Bryk, & Congdon, 2010) was used with different solution types nested within each study participant. The signal values extracted from each ROI were entered as the outcome variable in the two-level HLMs, with solution types (Level 1: MET, LIT, PRO-R) nested within participants (Level 2). The ROIs of bilateral amygdala and hippocampus were defined as spheres with a radius of 0.5 mm centered on the peak coordinates in the conjunction analysis for (MET > PRO-R) ∩ (MET > LIT). Level 1 parameters (MET, LIT, PRO-R) were dummy-coded. Then, we conducted another different general linear model with SPM8 to test the relationship between insightfulness experiences and the activities in hippocampus and amygdala. We defined high-insight metaphor events (HI-MET, above mean) and low-insight metaphor events (LI-MET, below mean) for 16 participants based on their own average score of insightfulness and for 4 participants based on their median (when average score failed in dividing metaphor events into roughly equal HI-MET and LI-MET events), and then compared them in the contrast of high-insight metaphor versus low-insight metaphor (HI-MEI > LI-MEI). For this analysis, the threshold was set at p < 0.005 (voxel level, uncorrected).

Results

Behavioral ratings in the scanning and rating phases

During the fMRI scanning phase, only the emotional valence of the solutions was assessed. A one-way repeated-measures ANOVA found a significant main effect, F (2, 38) = 519.84, p < 0.001, \( {\eta}_p^2 \) = 0.965, 90% CI = [0.942, 0.973] and the adjusted p < 0.001 (Bonferroni correction for multivariates). A Bonferroni simple effects analysis further showed that there was no significant difference in the emotional valence ratings between the metaphorical and literal solutions (MET: M = 7.27, SD = 0.55; LIT: M = 7.23, SD = 0.45, respectively) but that they were significantly higher than the problem-restatement solutions (PRO-R: M = 2.98, SD = 0.51; Fig. 2A).

Fig. 2
figure 2

Behavioral ratings in the scanning and rating phases. (A) Emotional valence ratings of the metaphorical, literal and problem-restatement solutions during the scanning phase. (B1) Insightfulness, (B2) novelty, (B3) emotional valence, and (B4) adaptability ratings of the metaphorical, literal and problem-restatement solutions in the rating phase. There were significant differences on insightfulness and novelty ratings in the pairwise comparisons among metaphorical, literal and problem-restatement solutions. **p < 0.01. ***p < 0.001. MET = metaphorical; LIT = literal; PRO-R = problem-restatement

In the rating phase, both the problems and solutions were assessed. For the former, emotional valence and adaptability were rated. One sample t-tests for the scores of emotional valence and adaptability indicated that the ratings for emotional valence (M = 2.64, SD = 0.65) and adaptability (M = 2.88, SD = 1.19) were both significantly lower than the neutral point of the scales (score of 5 on the 9-point scale), suggesting that the problems represented emotionally negative and maladaptive situations.

For the solutions, insightfulness, novelty, emotional valence, and adaptability were assessed. One-way repeated-measures ANOVAs were performed separately. We found significant main effects of solution type on insightfulness and novelty: for insightfulness, F (2, 38) = 66.43, p < 0.001, \( {\eta}_p^2 \) = 0.778, 90% CI = [0.649, 0.830] and the adjusted p < 0.001 (Bonferroni correction for multivariates); for novelty, F (2, 38) = 51.92, p < 0.001, \( {\eta}_p^2 \) = 0.732, 90% CI = [0.582, 0.796] and the adjusted p < 0.001 (Bonferroni correction for multivariates). Simple effects analyses for insightfulness and novelty revealed significant differences in the three pairwise comparisons among the metaphorical (insightfulness: M = 6.73, SD = 1.04; novelty, M = 6.59, SD = 1.15), literal (insightfulness: M = 6.08 , SD = 1.16; novelty, M = 5.15, SD = 1.78), and problem-restatement solutions (insightfulness: M = 3.29, SD = 1.42; novelty, M = 3.10, SD = 1.46), p < 0.01 for each (Bonferroni correction for multiple comparisons) (Fig. 2B1 and B2). We further found significant main effects of solution type on emotional valence and adaptability: for emotional valence, F (2, 38) = 311.63, p < 0.001, \( {\eta}_p^2 \) = 0.943, 90% CI = [0.906, 0.956] and the adjusted p < 0.001 (Bonferroni correction for multivariates); for adaptability, F (2, 38) = 118.61, p < 0.001, \( {\eta}_p^2 \) = 0.862, 90% CI = [0.778, 0.895] and the adjusted p < 0.001 (Bonferroni correction for multivariates). Simple effects analyses indicated that there was no difference in the emotional valence and adaptability between metaphorical (emotional valence: M = 7.12 , SD = 0.61; adaptability: M = 6.92, SD = 0.76) and literal solutions (emotional valence: M = 6.95, SD = 0.54; adaptability: M = 6.71, SD = 0.78); the scores in metaphorical and literal solutions were significantly higher than in problem-restatement solutions (emotional valence: M = 2.87, SD = 0.74; adaptability: M = 2.99, SD = 1.33), p < 0.001 for each (Bonferroni correction for multiple comparisons) (Fig. 2B3 and B4). It should be noted that we assessed insightfulness, novelty, adaptability, and emotional valence after the fMRI scanning rather than during the fMRI scanning and got similar response patterns to previous study (Yu et al., 2016) [e.g., Insightfulness: present study (MET, M = 6.73; LIT, M = 6.08; M = 3.29) vs. previous study for student sample (MET, M = 6.89; LIT, M = 6.31; PRO-R, M = 3.74)], indicating these ratings in the present study were reliable.

Imaging results

Neural activations related to the metaphorical and literal solutions

Concerning the neural activation related to metaphorical solutions, the contrast between MET > PRO-R indicated that many brain areas showed greater activation in response to metaphorical solutions than problem-restatement solutions. These areas included the bilateral inferior frontal gyrus, left superior temporal gyrus, bilateral middle temporal and occipital gyri, bilateral hippocampus, amygdala, fusiform gyrus, left insula, postcentral gyrus, right lingual gyrus, superior occipital gyrus, and the bilateral thalamus. The contrast between MET > LIT revealed a widespread network pattern of brain activity similar to that shown in the contrast between MET > PRO-R (Table 2).

Table 2 Brain regions activated in the contrasts of MET > PRO-R, MET > LIT, LIT > PRO-R, and LIT > MET

For the literal solutions, a contrast between LIT > PRO-R showed increased activation in the left postcentral and precentral gyri, paracentral lobule, superior parietal lobule, insula, cingulate gyrus, right cuneus, parahippocampal gyrus, and bilateral anterior cingulate cortex. The contrast between LIT > MET revealed an enhancement of activity in the left and right precuneus, left superior parietal lobule, medial frontal gyrus, and anterior cingulate cortex. The pattern of brain activity associated with the literal solutions was different from that which was elicited by metaphorical solutions (Table 2).

Common activation patterns with conjunction analysis

Conjunction analyses using MET > PRO-R and MET > LIT contrasts found that a widely distributed network of brain areas was significantly activated in response to metaphorical solutions. These areas included the left inferior frontal gyrus, superior and middle temporal gyri, bilateral hippocampus, amygdala, left fusiform and lingual gyri, and the bilateral middle and inferior occipital gyri (Table 3; Fig. 3).

Table 3 Conjunction analyses for (MET > PRO-R) ∩ (MET > LIT) and (MET > PRO-R) ∩ (LIT > PRO-R)
Fig. 3
figure 3

Conjunction analysis for (MET > PRO-R) ∩ (MET > LIT). Brain regions showed more activation specifically for the metaphorical solutions. MET = metaphorical; LIT = literal; PRO-R = problem-restatement

Conjunction analyses using MET > PRO-R and LIT > PRO-R contrasts found several significant activation clusters, including the left postcentral and precentral gyri, paracentral and superior parietal lobules, cingulate gyrus, insula, thalamus, right cuneus, and parahippocampal gyrus (Table 3; Fig. 4).

Fig. 4
figure 4

Conjunction analysis for (MET > PRO-R) ∩ (LIT > PRO-R). Brain regions common to metaphorical and literal solutions. MET = metaphorical; LIT = literal; PRO-R = problem-restatement

Correlations between neural activations and insightfulness

The results of HLM revealed that, across all the participants, the regression coefficients between insightfulness experiences and the activities of four ROIs were all positive and statistically significant, left amygdala, β = 0.276, p < 0.001; right amygdala, β = 0.183, p < 0.05; left hippocampus, β = 0.343, p < 0.001; right hippocampus, β = 0.248, p < 0.001 (Fig. 5A), respectively, indicating significant correlations between insightfulness and the activations of bilateral hippocampus and amygdala.

Fig. 5
figure 5

Correlation analyses between the right hippocampus and insightfulness experience. (A1) The peak coordinates (39, -12, -24) of the right hippocampus activated in the conjunction analyses for (MET > PRO-R) ∩ (MET > LIT). (A2) Correlation between insightfulness experience and the BOLD signal in right hippocampus, the line indicates the fitting regression equation. (B) Right hippocampus activation in the contrast of HI-MET > LI-MET. MET = metaphorical; LIT = literal; PRO-R = problem-restatement; HI-MET = high insightful metaphor; LI-MET = low insightful metaphor

The contrast between HI-MET > LI-MET (Fig. 5B) found significant activations in the bilateral hippocampus and bilateral amygdala (Hippocampus_L, MNI [7 -12 -36], Tuncorrected = 3.05; Hippocampus_R, MNI [24 39 -24], Tuncorrected = 3.93; Amygdala_L, MNI [-30 0 -18], Tuncorrected = 2.89; Amygdala_R, MNI [-36 0 -27], Tuncorrected = 2.89) with a loose threshold (p < 0.005, voxel level, uncorrected), indicating significant relationship between insightfulness and the activations of bilateral hippocampus and amygdala.

Discussion

To the best of our knowledge, the present study is the first to examine directly the neural correlates of metaphor in a psychotherapy scenario. Therapeutic metaphor activated two neural networks: one is associated with basic metaphorical language processing, including the IFG, MTG, and STG, and the other is a specific network correlated with insightful problem solving, which includes the hippocampus, amygdala, and fusiform gyrus. These data were compared with activity elicited by literal and problem-restatement solutions.

General neural network involved in basic metaphorical language processing

Metaphorical solutions activated a basic neural network involved in metaphorical language processing that includes the bilateral IFG, left STG, and MTG. Meta-analyses of neuroimaging studies on metaphorical language (Bohrn et al., 2012; Rapp et al., 2012) found that a predominantly left-lateralized network is important for processing metaphorical or figurative expressions relative to literal expressions within a sentence (Mashal, Faust, Hendler, & Jung-Beeman, 2009; Rapp et al., 2004) and within a naturally evolving coherent story (Eviatar & Just, 2006; Nagels et al., 2013). The identified areas included the bilateral IFG, left STG, and MTG, as well as medial prefrontal, superior frontal, parahippocampal regions. Our imaging findings overlapped extensively with these results.

The increase in bilateral IFG activity may reflect higher cognitive demands to integrate nonliteral meanings, as opposed to literal ones, into a sentence context or regulate the meaning selection and evaluation during metaphor comprehension, as it is necessary to decide whether the meaning of a word is intended to be literal or nonliteral (Rapp, Erb, Grodd, Bartels, & Markert, 2011; Rapp et al., 2012). However, IFG activation also is found in insightful problem solving, such as in chunk decomposition task (Huang et al., 2015) and compound remote-associates task (Kizilirmak et al., 2016). Because such trials require semantic selection to obtain an insightful solution, IFG stimulation may primarily reflect the breaking of mental set and set shift (Shen, Luo, Liu, & Yuan, 2013).

Concerning the left STG/MTG, these regions are situated in an anatomically key position within the language network; they have numerous connections to other cortical association areas, which were found by the meta-analyses to be activated by nonliteral and literal stimuli (Rapp et al., 2012). The left STG/MTG therefore composes a crucial part of the network for nonliteral meaning comprehension and is considered to be important for coherence analysis and processing of texts (Ferstl, Neumann, Bogler, & Von Cramon, 2008). Past studies also have observed the activation of the left STG/MTG in a reappraisal condition of emotion regulation (Buhle et al., 2014; Ochsner, Silvers, & Buhle, 2015). A mechanism of reappraisal may depend on retrieval processes, which search semantic memory for concepts that can be used to reinterpret some part of the current situation (Ochsner et al., 2015; Ohlsson, 1984). More importantly, some studies on insight found that insightful events were accompanied with increased activity in the left or right STG using the compound remote associates task (Jung-Beeman et al., 2004; Subramaniam, Kounios, Parrish, & Jung-Beeman, 2009; Tik et al., 2018). The present study was consistent with these findings. We speculated that the STG was involved in actively semantic retrieval processes, which were required for the insightful problem solving both in the compound remote associates task and in the metaphorical solution to mental distress problem.

Neural network involved in insight processing

Relative to literal and problem-restatement solutions, we observed greater activation in the bilateral hippocampus, amygdala, and fusiform gyrus. We also observed greater insightfulness and novelty experiences but indiscriminate emotional valence and adaptability experiences. This result was consistent with previous findings (Jiang et al., 2016; Yu et al., 2016). More importantly, we found significant correlations between the insightfulness experiences and the activities in hippocampus and amygdala.

Several investigations have observed the activation of the hippocampus, amygdala, and fusiform gyrus at the moment insight occurs. There is good consistency with previous studies on insight. Using a brain teaser riddle paradigm, Luo and Niki (2003) revealed that the hippocampus and fusiform gyrus were significantly activated by induced insight. The hippocampus and amygdala were found to be activated by induced insight in the interactive effect between novelty and appropriateness using the chunk decomposition paradigm (Huang et al., 2015), with other perceptual paradigms (Kizilirmak et al., 2016; Ludmer et al., 2011), and with Chinese “chengyu” riddles (Zhao et al., 2013). In addition, activation of the hippocampus was observed following self-generated insight via a compound remote associates task (Tik et al., 2018). Moreover, human subjects remember contexts in which they had an “aha” experience better than those in which they failed to feel the “aha” effect (Auble, Franks, & Soraci, 1979; Jung-Beeman et al., 2004; Wills, Soraci, Chechile, & Taylor, 2000). Amygdala activity could predict the consolidation of insightful solutions in long-term memory (Kizilirmak et al., 2016; Ludmer et al., 2011). Therapeutic metaphor solution in a psychotherapy scenario produced an obvious insight process similar to previous insight studies and may improve memory, relative to literal solutions.

One reason that therapeutic metaphors promote insight is the novelty of metaphorical expression. Specifically, metaphorical expression enables the person to access a conceptual structure common to an abstract concept of problem and a remote, imaginative concept of metaphorical solution (Landau, Arndt, & Cameron, 2018), but literal expression does not have this function. Similarly, hippocampus and amygdala activation also has been revealed in novelty detection. It was found that these areas feature higher activity in response to novel stimuli in comparison to common stimuli when using emotional images (Blackford, Buckholtz, Avery, & Zald, 2010; Weierich, Wright, Negreira, Dickerson, & Barrett, 2010). The hippocampus and amygdala also respond selectively to conditions containing semantic and perceptual novelty (Hashimoto et al., 2012; Henke, Weber, Kneifel, Wieser, & Buck, 1999; O'Kane, Insler, & Wagner, 2005; Poppenk et al., 2008; Stoppel et al., 2009; Strange & Dolan, 2001; Strange, Fletcher, Henson, Friston, & Dolan, 1999). In the present study, metaphorical solutions related a novel concept (“playing bad cards well”; see the example in Materials) to an old concept (major) by identifying similarities between them. In contrast, literal expressions did not introduce a new concept: its solution was limited to the old concept (major). Metaphorical solutions thus produced a greater change in cognitive representation than literal solutions; novelty largely accounts for this difference. Ohlsson (1984) suggested that whether a representational change results in a goal state (insight) mainly depends on the distance between the current state and the goal state in the new representation. In this regard, the novelty of metaphorical solutions contributed much to induce insight.

Another reason may be that problem context of mental distress could enhance the effect of metaphorical solution, because problem context increased the practicality of metaphorical solution. First, insightful problem solving is a classic type of creative thinking in which originality (novelty) and practicality (adaptability) are both key factors (Abraham & Windmann, 2007; Huang et al., 2015; Huang, Tang, Sun, & Luo, 2018). Huang et al. (2015) used an insight task of chunk decomposition that systematically varied in novelty and appropriateness (practicality) to elucidate their neural mechanisms. The study associated the activations of the hippocampus and amygdala with the interactive effect of novelty and appropriateness, indicating both high novelty and high appropriateness were important to produce insight. The present study replicated their findings in hippocampus and amygdala. Second, the activations of hippocampus and amygdala had been rarely found in the meta-analysis studies on metaphor (Bohrn et al., 2012; Rapp et al., 2012); one reason may be the absence of the context in the metaphorical expression. In contrast, this study first presented a context of mental distress problem and then a metaphorical solution, this created a sufficiently complete conception of the problem and allowed the problem solver to address the problem and then apply schematic knowledge of metaphorical solution to think through the target problem (Keefer & Landau, 2016). Third, the above discussions revealed a limitation of the present study: it did not manipulate the level of adaptability on insight processing. Metaphorical solutions thus featured high novelty and high adaptability; literal solutions, low novelty, and high adaptability; and problem-restatement solutions, low novelty, and low adaptability. Regardless, we still observed the activations of hippocampus and amygdala in response to metaphorical solutions, which was highly consistent with Huang et al. (2015). Metaphorical solutions therefore meet the requirements of creativity in that they are both novel and adaptive. However, future studies should adjust the adaptability of metaphorical and literal solutions to identify corresponding differences in the elicited neural activity.

Regarding the functions of the hippocampus and amygdala, they appeared to work synergistically, but their roles in insightful problem solving were not necessarily the same. Although the hippocampus often is regarded as solely associated with long-term declarative memory, it contributes to creative thinking. For example, Luo and Niki (2003) found that a riddle paradigm elicited activity in the hippocampus, suggesting that the hippocampus participates in insightful problem solving by forming new association and breaking mental fixation. Participants with hippocampal damage performed less well than healthy controls on subcomponents of the Torrance tests of creative thinking (Duff, Kurczek, Rubin, Cohen, & Tranel, 2013) and on the compound remote associates task (Warren, Kurczek, & Duff, 2016). Moreover, it was suggested that the relational binding and representational flexibility provided by the hippocampal declarative memory system positions the hippocampus as a key contributor to language use and processing (Duff & Sarah, 2012). These findings indicate that hippocampal-dependent declarative memory is necessary for creative problem solving. By comparison, the amygdala has been demonstrated to involve enhancing the processing of emotionally arousing stimuli and modulating enhanced memory consolidation with the hippocampus (Girardeau, Inema, & Buzsáki, 2017; Yang & Wang, 2017). Thus, metaphorical solutions containing creative and emotional information would activate both the hippocampus and amygdala, as well as promote long-term memory to yield an effective therapeutic impact.

In addition, we observed that the presentation of metaphorical solutions activated the right lingual and occipital gyri. However, this region has received scant attention in past research. As increased activity in the lingual gyrus has been observed using an insight task with heuristic prototypes, it has been suggested that it might be involved in forming novel associations using heuristic information (Luo et al., 2013).

Common neural network involving insightful and non-insightful processing

A shared neural network may be involved in the processing of metaphorical and literal solutions, as opposed to that which underlies problem-restatement solutions. The left cingulate cortex is primarily recruited in task-related goal directing and governing, the superior parietal lobule and cuneus may in reorientation or visual-spatial information processing, and the left insula in emotional valence (Shen et al., 2013). The left postcentral and precentral gyri may reflect semantic processing associated with sensory and motor components (Rapp et al., 2012). All of these regions may contribute to a general cognitive process that may or may not lead to insight.

Conclusions, implications, and limitations

The present study explored whether metaphorical solutions to mental distress problems would yield a salient insight process and activate a neural network similar to that which underlies insightful problem solving. To compile experimental stimuli, we designed text-based micro-counseling dialogue scenarios that reflected psychotherapy; they resembled real-life situations and therefore featured high ecological validity in eliciting the insight process. The results revealed that metaphorical solutions in a psychotherapy context activated a neural network that included the hippocampus and amygdala; this system overlaps extensively with the network of insightful problem solving. These findings have important implications. This study found that therapeutic metaphor in the problem-solution scenario could produce higher insightfulness, and there were significant correlations between insightfulness experience and activations of hippocampus and amygdala. This implied that therapeutic metaphor got a strong semantic encoding and then could have potential in maintaining a longer time and producing a better intervention effect. This effect may be a lie in that metaphorical solutions could trigger a creative cognitive process by activating widely separated areas of long-term memory and combining normally unassociated concepts. Therefore, the use of therapeutic metaphor should be emphasized in the psychotherapy.

There are some limitations in the presented study. First, to obtain both detailed information of subjective experiences and high quality of fMRI data, we let participants assess materials after the fMRI scanning rather than make real-time ratings during the fMRI scanning. The ratings obtained from the second reading may have some differences from the first reading due to the changes of familiarity, so it should be cautious to understand their correlative analysis with neural activities. Second, the same metaphor might have a different impact on the individuals from different cultural background due to the limitation of cultural appropriateness. For instance, in the example of college major, in America, it could be a good idea to offer help in dealing with either frustration or choosing a major that one likes. But in China, due to the specific enrollment system, some Chinese college students are not matriculated by the majors they declare and are transferred to other majors they do not like so much. The present study focused on offering help in realizing the future may not only depend on what you have but on what you do.