Humans possess the incredible ability to identify words, irrespective of font type, size, or relative letter position (for examples, see, e.g., Dehaene, Cohen, Sigman, & Vinckier, 2005; Velan & Frost, 2007). Most models of visual word recognition attempt to explain these (and other) phenomena by considering multiple levels of activation, including orthographic, lexical, phonological, and semantic (e.g., Allen, Smith, Lien, Kaut, & Canfield, 2009; Balota, Yap, & Cortese, 2006; Coltheart, Rastle, Perry, Langdon, & Ziegler, 2001; McClelland & Rumelhart 1981). These models typically assume a continuous flow of information from recognizing orthographic form to retrieving meaning, in either a cascaded or a parallel processing mode. Despite diverse assumptions proposed in variations of those models, they generally agree that visual word recognition is mediated by the perceptibility of words (see Lupker, 2005, for a review).

A study by Perea , Duñabeitia, and Carreiras (2008) demonstrated that visual word recognition is not disrupted by displaying words with characters that share similar configural features with the letters that constitute the words. Perea et al. used LEET stimuli, in which certain letters are replaced by letter-like digits (e.g., the digit “3” for the letter “E,” or R34D1NG instead of READING). They argued that if detecting the shapes of individual letters is sufficient to produce lexical activation, then letter-like digits or letter-like symbols should produce lexical activation just like regular words. To test this claim, Perea et al. used a masked-priming paradigm. They presented a forward mask (a row of #s) for 500 ms, followed by a 50-ms prime in the center location. Immediately after the offset of the prime, the target appeared in the same location and remained on screen until participants made a response. To reduce the spatial overlap between prime and target, Perea et al. presented the primes and targets in different font sizes (10-point Courier vs. 12-point Courier, respectively). Participants were to determine whether the target was a word (a lexical-decision task). In Experiment 1, the prime was either the same target word (e.g., MATERIAL for the target MATERIAL), the corresponding LEET for that target (e.g., M4T3R14L), or stimuli with letters replaced with corresponding symbols (letter-like symbols; e.g., M∆T€R!∆L) or control letters (replacing the original letters with other letters, randomly selected; e.g., MOTURUOL for the target MATERIAL).

Perea et al. (2008) found that response times (RTs) to the target word were significantly faster when the word was primed by LEETs and symbols than when it was primed by the control letters (i.e., a priming effect). Most importantly, both LEETs and symbols produced mean RTs similar to those for the target word prime. They further found, in Experiment 2, that both LEETs and symbols produced faster RTs than their corresponding control conditions, in which other non-letter-like numbers and symbols, respectively, were used (e.g., M6T2R76L for the target MATERIAL in the control LEET condition; MT%R?L in the control symbol condition). They concluded that “it is visual similarity rather than the status of the leet digits as numbers that seems to be responsible for the leet priming effect” (p. 239).

Perea et al.’s (2008) findings with the masked-priming paradigm seem to support the conclusion that, in the absence of a top-down context, letter-like digits (and symbols) embedded in a word are sufficient to trigger lexical activation. Nevertheless, some studies have suggested that priming effects result from prelexical processing (i.e., affecting stages prior to lexical activation; see Hutchison, Neely, Neill, & Walker, 2004; Masson & Isaak, 1999). In other words, the similarity of the visual shapes of LEET stimuli to actual letters may have affected only orthographic, but not lexical, processing of the target word. Evidence favoring the sublexical account mainly comes from the finding of similar repetition priming effects for words and nonwords in the lexical decision task, even though nonwords should not benefit from lexical processing of primes (e.g., Masson & Isaak, 1999). Thus, the advantage for LEET stimuli over the nonword control primes (e.g., M4T3R14L vs. MOTURUOL for the target word MATERIAL) in Perea et al. could arise merely from less visual impairment in the LEET condition than in the other stimulus conditions (e.g., Davis & Lupker, 2006; Horemans & Schiller, 2004; see Forster, 1998, for a review).

Because the priming effect with LEET stimuli observed in Perea et al. (2008) cannot be unambiguously attributed to lexical activation, this highlights a need for converging evidence. Because it is unclear whether priming results from lexical activation, we instead examined the semantic activation of LEET stimuli using a category judgment task—determining whether a single stimulus (either a word or a LEET stimulus) was related or unrelated to the category provided prior to that block. It has been suggested that semantic-relatedness judgments, regardless of whether they are related or unrelated, are a result of lexical activation (e.g., Besner, Smith, & MacLeod, 1990). Accordingly, if we were to find any evidence of semantic activation, we could infer the occurrence of lexical activation.

In addition to examining semantic activation using behavioral data (e.g., RTs), we used event-related potential (ERP) measures. ERPs provide online, continuous measures of meaning extraction and often reveal evidence of deeper processing than is apparent in behavioral data. For instance, Heil, Rolke, and Pecchinenda (2004) found a modulation of ERP amplitudes by the semantic relatedness of prime and probe words, even when there was no semantic priming effect in RTs (e.g., Rolke, Heil, Streb, & Hennighausen, 2001; see also Vogel, Luck, & Shapiro, 1998, for an example of ERPs elicited by semantic activation even when participants could not report the targets in the attentional blink task). By examining the ERP components associated with semantic activation, it is possible to determine whether LEET stimuli trigger semantic activation just like regular words do.

We used the N400 component, a negative-going brain potential that occurs around 400 ms after the onset of potentially meaningful stimuli (e.g., words). This component is also called the mismatch negativity, because it is known to be elicited when a person notices a stimulus that is incongruent with the current semantic context (see Kutas & Van Petten, 1988, for a review). For instance, after one sees the word SPORTS, the word APPLE (unrelated) would elicit a much larger negative ERP than would the word TENNIS (related) 400–600 ms after word onset. Therefore, the N400 effect can be quantified as the average difference in brain potentials between words that are related and unrelated to the current semantic context (N400 effect = unrelated word ERPs – related word ERPs).

The critical point is that the N400 effect can be used as an indicator that a person has extracted a word’s meaning. That is, the N400 effect occurs only when participants detect a semantic mismatch, indicating processing even deeper than lexical activation. It has been suggested, in fact, that the N400 effect is a more sensitive measure of semantic activation than are behavioral measures (e.g., Heil et al., 2004; Rolke et al., 2001; Vogel et al., 1998). As a concrete example, Lien, Ruthruff, Cornett, Goodin, and Allen (2008) used N400 effects to determine whether people can extract word meaning while central attention is devoted to another task. Participants performed a tone–pitch Task 1 and a semantic-relatedness Task 2 (i.e., whether the Task 2 word was related to a previously presented context word/category). The critical finding was that the N400 effect declined sharply under dual-task conditions. They concluded that semantic activation of visual words is impaired while central attention is allocated to another task.

The present study

In the present study, we used ERPs (i.e., the N400 effect) to assess whether semantic activation occurs for LEET stimuli just as strongly as it does for regular words. Thus, regular words were included to provide a baseline for semantic activation. We adopted Lien et al.’s (2008) category judgment task, in which participants memorized a category name prior to each block (e.g., fruit) and then determined whether a series of single stimuli (e.g., APPLE for the regular word or 4PPL3 for the LEET stimulus) were related or unrelated to that category. Stimulus type (word vs. LEET) varied within blocks.

Our main interest was the semantic activation of words and LEET stimuli, as indicated by the N400 effect (the difference in ERPs between unrelated and related words). The semantic relatedness effect (the difference between unrelated and related words) on RTs does not provide a clear picture regarding semantic activation for words and LEET stimuli, since related and unrelated responses are made with different response fingers. Thus, the effect on RTs could reflect a modulation of response decision processes (see the Discussion regarding the problems with interpreting semantic activation using behavioral data). Furthermore, since we compared semantic activation between words and their corresponding LEET stimuli, word frequency and word length should have little influence on the comparison between them.

We expected to observe a large N400 effect for regular words, as had previously been shown in single-task conditions (e.g., Lien et al., 2008). The main question for the present study was whether similar N400 effects would also be observed for LEET stimuli formed by letters and digits. If LEET stimuli are processed like real words (i.e., if they can access word meaning), then LEET stimuli should produce N400 effects similar to those from regular words. Such a result not only would imply semantic activation for the LEET stimuli, but also would provide converging evidence for Perea et al.’s (2008) claim of full lexical activation for LEET stimuli.

Method

Participants

A group of 24 undergraduates (native English speakers) at Oregon State University participated in this experiment. The data from two of these participants were excluded due to excessive eye movement artifacts in the electroencephalographic (EEG) data (see below). The remaining 22 participants (16 females, six males) had a mean age of 20 years (range: 18–27).

Apparatus, stimuli, and procedure

Stimuli were displayed on a 19-in. monitor and were viewed from a distance of about 55 cm. The category presented prior to each block was in lowercase, whereas the stimulus for each trial was a string of uppercase letters (words) or a mixture of letters and digits (LEET stimuli) printed in white, against a black background, in the center of the screen. LEET stimuli were formed by changing some letters in a word using digits that were similar in shape to their corresponding uppercase letters. That is, the digit 1 was used for the letter L, the digit 3 for the letter E, the digit 4 for the letter A, the digit 5 for the letter S, the digit 6 for the letter G, and the digit 0 for the letter O. The digit substitutions ranged from 30% to 64% (mean: 43%) across letter positions. Each letter and digit subtended a visual angle of approximately 1.15º in width × 1.25º in height.

Each trial started with a fixation cross in the screen center for 1,200 ms, which was then replaced by the stimulus until a response was made. Next, auditory feedback (a tone on error trials, silence on correct trials) was presented for 100 ms. The next trial then began with the fixation display. The participant’s task was to indicate whether the stimulus was related or unrelated to the category for that block by pressing the leftmost response-box button for related and the rightmost button for unrelated stimuli. They were also told that some words would be formed by digits and letters, which they should treat like regular words and determine the semantic relatedness to the current category. Speed and accuracy were emphasized equally.

A total of 20 categories were used for the experimental blocks, and two categories were used for the practice blocks, taken from Lien et al. (2008; see the Appendix for the complete list).Footnote 1 Each participant performed two sessions. The first session contained one practice block of 36 trials, randomly selected from the two practice-block categories, followed by 20 experimental blocks of 36 trials each (nine related words, nine related LEETs, nine unrelated words, and nine unrelated LEETs, randomly determined). The second session consisted of the same 20 categories of experimental blocks as the first session, except that the related/unrelated words and LEET stimuli were different. The order of categories for the experimental blocks was randomly determined for each participant. For each participant, each word and LEET stimulus appeared twice throughout the whole experiment—once for the related list and once for the unrelated list, in a random order. Participants completed these two sessions within a single visit to the lab and were given breaks between blocks and sessions.

EEG recording and analyses

Electroencephalographic (EEG) activity was recorded from electrodes F3, Fz, F4, C3, Cz, C4, P3, Pz, P4, O1, and O2. These sites and the right mastoid were recorded in relation to a reference electrode at the left mastoid. The EEGs were then re-referenced offline to the average of the left and right mastoids. A horizontal electrooculogram (HEOG) was recorded bipolarly from electrodes at the outer canthi of both eyes, and a vertical electrooculogram (VEOG) was recorded from electrodes above and below the midpoint of the left eye. The electrode impedance was kept below 5 kΩ. EEG, HEOG, and VEOG were amplified using Synamps2 (Neuroscan) with a gain of 2,000 and a bandpass of 0.1–70 Hz, and were digitized at 250 Hz.

Trials with possible ocular and movement artifacts were identified using a threshold of ±75 μV for a 1,400-ms epoch beginning 200 ms before stimulus onset to 1,200 ms after stimulus onset. Each of these artifact trials was then inspected manually. This procedure led to the rejection of 7% of the trials, with no more than 25% rejected for any individual.

Results

In addition to trials with ocular artifacts, trials were excluded from analyses of the behavioral data (RTs and proportions of errors) and the ERP data if the RT was less than 100 ms or greater than 2,000 ms (0.4% of trials exceeded these cutoff values). Incorrect-response trials were also excluded from the RT and ERP analyses. Analyses of variance (ANOVA) were used for all statistical analyses. Whenever appropriate, p values were adjusted using the Greenhouse–Geisser epsilon correction for nonsphericity.

Behavioral data analyses

The ANOVAs on RTs and proportions of errors were conducted as a function of stimulus type (word vs. LEET) and semantic relatedness (related vs. unrelated). Table 1 shows the mean RTs and proportions of errors for each condition. The analyses revealed that the overall RT was 41 ms longer for LEET than for word stimuli, F(1, 21) = 103.76, p < .0001, η p 2 = .83, and the mean RT was 44 ms longer for unrelated stimuli (677 ms) than for related stimuli (633 ms), F(1, 21) = 44.62, p < .0001, η p 2 = .68. The semantic-relatedness effect (unrelated – related) on RTs was larger for LEET stimuli than for words (58 vs. 29 ms, respectively), F(1, 21) = 22.91, p < .0001, η p 2 = .52. Further t test analyses revealed that the semantic-relatedness effect was significant for both LEET stimuli, t(21) = 7.31, p < .0001, and words, t(21) = 4.58, p < .001.

Table 1 Mean response times (in milliseconds) and proportions of errors as a function of semantic relatedness (related vs. unrelated) for word and LEET stimuli

For the proportion-of-error data, LEET stimuli produced higher error rates than did word stimuli (.085 vs. .072), F(1, 21) = 27.73, p < .0001, η p 2 = .57. The error rate was also higher for related (.100) than for unrelated (.056) stimuli, F(1, 21) = 28.83, p < .0001, η p 2 = .58. However, the semantic-relatedness effects on error rates were similar for LEET stimuli and for words (−.047 vs. –.041, respectively), F < 1.0. As in the RT data, further t test analyses revealed that the semantic-relatedness effects were significant for both LEET stimuli, t(21) = 5.08, p < .0001, and words, t(21) = 4.97, p < .0001.

ERP analyses

The averaged ERP waveforms were time-locked to stimulus onset. For each stimulus type (word vs. LEET), difference waves were constructed by subtracting the ERP waveforms elicited by stimuli related to the category from the ERP waveforms elicited by stimuli unrelated to the category (i.e., the N400 effect). We collapsed across the three frontal electrode sites (F3, Fz, and F4), the three central electrode sites (C3, Cz, and C4), and the three parietal electrode sites (P3, Pz, and P4). Following Lien et al. (2008; see also Vogel et al., 1998), the mean amplitude of the N400 effect was measured from 400 to 600 ms after stimulus onset, relative to the 200-ms baseline period before stimulus onset. This is the time window during which the N400 effect is typically maximal (see Fig. 1). An ANOVA on the N400 amplitudes (difference waveforms = unrelated ERPs – related ERPs) was conducted as a function of stimulus type (word vs. LEET) and electrode site (frontal [F3, Fz, F4] vs. central [C3, Cz, C4] vs. parietal [P3, Pz, P4]). Figure 2 shows the grand average waveforms for related and unrelated stimuli for these electrodes, and Fig. 3 shows the mean N400 amplitudes.

Fig. 1
figure 1

Scalp topographies of the N400 effect (difference = unrelated ERPs – related ERPs) for word and LEET stimuli during the time window 400–600 ms after stimulus onset. ERP: event-related potential

Fig. 2
figure 2

Grand average event-related brain potentials as a function of semantically related and unrelated word and LEET stimuli at frontal electrode sites (data collapsed across F3, Fz, and F4), central electrode sites (data collapsed across C3, Cz, and C4), and parietal electrode sites (data collapsed across P3, Pz, and P4). Negative is plotted upward, and time zero represents stimulus onset. The baseline period was 200 ms prior to stimulus onset.

Fig. 3
figure 3

Grand average differences in event-related brain potentials, formed by subtracting semantically related trials from semantically unrelated trials (i.e., the N400 effect), for word and LEET stimuli at frontal electrode sites (data collapsed across F3, Fz, and F4), central electrode sites (data collapsed across C3, Cz, and C4), and parietal electrode sites (data collapsed across P3, Pz, and P4). The unfilled rectangular boxes indicate the time window used to assess the N400 effect (400–600 ms after stimulus onset). Negative is plotted upward, and time zero represents stimulus onset. The baseline period was 200 ms prior to stimulus onset.

The N400 effects were similar for both words and LEET stimuli (−3.568 vs. –3.848 μV, respectively), F(1, 21) = 1.51, p = .2329, η p 2 = .03. Further t test analyses revealed that the N400 effect was significantly greater than zero for both words, t(21) = −12.88, p < .0001, and LEET stimuli, t(21) = −13.35, p < .0001. Although the overall N400 effect was much larger for the central sites (−4.050 μV) than for the frontal and parietal sites (−3.585 vs. –3.488 μV), F(2, 42) = 7.23, p < .01, η p 2 = .26, the difference in the N400 effects between words and LEET stimuli did not interact with electrode site, F < 1.0.

Discussion

In the present study, we used the N400 effect to examine whether semantic activation occurs for LEET stimuli (formed by letters and digits) just as strongly as for regular words. In each trial, participants made a category judgment on either the LEET stimuli or regular words (formed by uppercase letters), intermixed within blocks. An advantage of using a single-stimulus presentation in our study is that it eliminated unwanted effects of extraneous stimuli, such as the possible forward-masking interference produced by the priming paradigm (e.g., Forster, 1998). The critical finding was that the N400 effects at all three electrode sites (frontal, central, and parietal) were similar in magnitude for both LEET and regular words, suggesting that semantic activation occurs for LEET stimuli just as strongly as for regular words. Since it has been suggested that semantic activation is a result of lexical activation (e.g., Besner et al., 1990), we concluded that, in the presence of a top-down context (e.g., semantic categories), letter-like digits embedded in a word are able to activate both lexical and semantic information.

Another interesting finding of the present study was the apparent trend toward earlier N400 onsets for word than for LEET stimuli (see Fig. 3). Thus, even though the digits in LEET stimuli can be encoded in a letter-like manner and subsequently may activate semantic information, the regular words have faster access to the mental lexicon due to greater familiarity. The behavioral data (e.g., RTs) are also consistent with this claim, showing that faster semantic-relatedness judgments occur for words than for LEET stimuli. This finding seems to suggest that processing stages prior to semantic activation (e.g., encoding) might be delayed for LEET stimuli, but that semantic analysis of the encoded letter representations is not affected at all.

We noted earlier that the use of behavioral data (e.g., RTs) opens the door to numerous alternative explanations for the priming effect with LEET stimuli, such as prelexical processing or less-disruptive visual similarity between prime and target, rather than the lexical-activation interpretation favored by Perea at al. (2008). Likewise, our behavioral data may not provide clear evidence for semantic activation in LEET stimuli. We found a positive semantic-relatedness effect on RTs (slower for unrelated than for related stimuli), but a negative effect on errors (smaller error rates for unrelated than for related stimuli), indicating a speed–accuracy trade-off. This finding underscores another major limitation of behavioral measures—they are sensitive to response bias (e.g., Green & Swets, 1966). On the positive side, however, it also highlights one major advantage of N400 measures, as we used in the present study; that is, they more specifically reflect the buildup of semantic activation, with little constraint from decision making (e.g., Heil et al., 2004; Rolke et al., 2001; for a review, see Kutas & Federmeier, 2011).

Our N400 data provided clear evidence for semantic activation in LEET stimuli. Alternatively, the activation could be due to top-down expectancy, since participants already knew what category to look for in each block. Although we always presented a single stimulus (either a LEET stimulus or a regular uppercase word) in each trial, those stimuli were assessed in terms of their fit within a semantic category presented prior to each block. In this light, the finding that LEET stimuli elicited an N400 effect similar in magnitude to that for regular words suggests that the N400 effect is not an indicator of the perceptual integration of lexicality (because the mixture of digits and letters in LEET stimuli would have disrupted the encoding of lexical representations, with little semantic information being activated). Rather, it more likely reflects a postlexical process that is driven by integration of a context, such as categories (e.g., Brown & Hagoort, 1993; Kutas & Federmeier, 2011; but see Deacon, Hewitt, Yang, & Nagata, 2000, for a case in which the N400 effect occurred without conscious awareness of word identity).

One constraint of our study is that the effects of letter versus digit position in LEET stimuli and the number of digit replacements in the LEET stimuli may have played roles in semantic activation.Footnote 2 For instance, as was observed by Jordan, Thomas, Patching, and Scott-Brown (2003), when words were presented in passages of text, visually degrading the exterior letters of a word (the first and last positions, such as “d_ _k” for the word “dark”) slowed reading rates more than did visually degrading interior letters (all letter positions that lay between the first and last positions, such as “_ar_” for the word “dark”). They therefore argued that the exterior letters of words play a major role in visual word recognition (see also Jordan, Thomas, & Scott-Brown, 1999, with presentation of single words rather than passages). In the present study, about 55% of the LEET stimuli replaced at least one of the exterior letters with a digit (30% and 34%, respectively, of the first and last letters were replaced, and both positions were replaced for 9% of stimuli). If semantic access was restricted to digit replacements only in interior locations or only in exterior locations, one would expect the N400 effect elicited by the LEET stimuli to be smaller than the effect elicited by regular words. In contrast to this prediction, our results showed that the N400 effect for LEET stimuli was about 8% larger in magnitude than the effect for regular words. Thus, the present N400 effect for LEET stimuli was not due solely to an “exterior-letter effect.” The present LEET results, along with Jordan and colleagues’ earlier letter-position work and the case-mixing results of Allen et al. (2009) and Lien, Allen, and Crawford (2012), suggest that “coarse-scale” information in words (i.e., the cursory shape of the whole word; Jordan et al., 2003) is involved in visual word recognition in addition to individual letter identities. This appears to be the case, because all of these studies suggest that the physical identity of individual letters is not the only information used to encode words.

In sum, we have demonstrated that LEET stimuli gain lexical access and activate semantic information using electrophysiological measures (i.e., N400 effects). It is clear that the exact identity of the components in a word, such as digits instead of letters, does not hinder semantic activation, even though the encoding is slower for LEET than for regular words. Thus, we extended Perea et al.’s (2008) priming results to a category task and showed that the processing equivalence applies even to deeper levels of word processing (i.e., access to word meaning).