Assessing students' executive functions in the classroom: Validating a scalable group-based procedure

doi:10.1016/j.appdev.2017.03.003

Journal of Applied Developmental Psychology

Volume 55, March–April 2018, Pages 4-13

https://doi.org/10.1016/j.appdev.2017.03.003 Get rights and content

Abstract

We describe and validate a novel, scalable, group-based assessment of executive functions (EFs) in a classroom setting using tablet computers. Relative to the conventional method of a more controlled, one-on-one individual assessment (IA), the group assessment (GA) can be administered quickly to many students, requires less training for assessors, and measures performance in a naturalistic classroom setting. In a socioeconomically and ethnically diverse sample of 269 students in third through fifth grade, we show that IA and GA scores for the same tasks were highly inter-correlated, equally reliable, and showed analogous associations with known EF covariates. IA and GA scores independently predicted teacher-rated self-regulated classroom behavior and standardized test scores. Further, only the GA score emerged as a unique predictor of academic achievement when controlling for prior achievement. We are sharing the tablet apps, source code, and supporting materials for this GA procedure at no cost under an open-source license.

Introduction

Executive function (EF) skills have been linked to various educational outcomes, including specific academic skills, school engagement, and self-regulated classroom behaviors (Diamond, 2013, Obradović et al., 2012). However, the conventional approach to EF assessment is to measure children's performance on standard EF tasks in a highly controlled, laboratory-like setting, typically with a ratio of one child to one assessor. This approach lacks the ecological validity of assessment in a classroom setting—where children practice and apply EF skills daily—and does not scale well for collecting data from a large number of students. We developed a new procedure to simultaneously assess EF skills in all students in a classroom using standard EF tasks administered on tablet computers. The goals of the current study are to validate this new assessment by: (1) examining convergent validity with conventional individual assessment procedures; (2) comparing students' EF performance across group and individual assessment settings; (3) comparing associations of EFs with known demographic and educational covariates across the two assessment settings; and (4) investigating the predictive validity of EF skills assessed in group versus individual assessment settings for teachers' reports of students' self-regulated classroom behaviors and their academic achievement on standardized tests.

EFs are a set of higher-order cognitive skills that enable children to inhibit their impulses, control inappropriate behaviors, ignore distractions, hold and manipulate information in the mind, and shift between competing rules or attentional demands. As such, EF skills are implicated in many aspects of school success. Over the last decade, researchers have linked direct assessments of EF skills to teachers' reports of students' self-regulated classroom behaviors, such as their ability to follow instructions, stay focused on tasks, and work collaboratively with peers (Ciairano et al., 2007, Diamond, 2013, Obradović et al., 2012, Rimm-Kaufman et al., 2009). However, most of these studies have been conducted in early childhood. Researchers working with this age group often employ a composite of EF tasks tapping into multiple EF components (Fuhs et al., 2015, Neuenschwander et al., 2012, Sasser et al., 2015). However, more research is needed to better understand how similar direct assessments of EFs relate to self-regulated classroom behaviors in middle childhood.

In addition to their role in promoting self-regulated behaviors, EF skills also contribute directly to academic performance. For example, solving math problems requires children to flexibly shift attention between different strategies and to manipulate and update information in working memory (Blair, Ursache, Greenberg, Vernon-Feagans, & Family Life Project Investigators, 2015). Although empirical evidence is most robust for the association between working memory and math skills (Bull and Lee, 2014, Jacob and Parkinson, 2015), meta-analytic studies have demonstrated that direct assessments of inhibitory control (Allan, Hume, Allan, Farrington, & Lonigan, 2014), working memory (Friso-van den Bos, van der Ven, Kroesbergen, & van Luit, 2013) and cognitive flexibility (Yeniad, Malda, Mesman, van IJzendoorn, & Pieper, 2013) are all associated with children's performance on literacy and math achievement tests.

Researchers often assess children's EF skills in university-based laboratory settings using a battery of developmentally appropriate standard tasks administered by a highly trained research assistant (Carlson, 2005, Kochanska and Knaack, 2003). There are also many school-based studies, but these typically mimic a laboratory setting: researchers take children out of their classrooms to be assessed one-on-one in a quiet space such as a library room (Blair and Razza, 2007, Raver et al., 2011, Schmitt et al., 2015, Weiland et al., 2014). The assessor works closely with the child to explain the task instructions, provide guidance and feedback during practice trials, and ensure focused completion of the test trials. This context provides many external motivators (both intentional and inadvertent) for children to perform well on EF tasks — motivators that are not normally present in the classroom. Assessors are trained to establish good rapport with participants and express a caring and affirmative demeanor. They provide positive encouragement during practice trials, physical proximity during test trials, and praise after the task is completed. This individualized attention may motivate some children to (try harder to) perform well on the tasks and may contribute to artificially inflated EF performance that does not reflect the child's ability to engage EF skills in a more natural setting. Conversely, some children may be more comfortable in the classroom or better motivated by the presence of peers and teachers, and thus may underperform in a laboratory setting. Individual assessment minimizes the external distractions and interpersonal dynamics present in the classroom, and it provides controlled testing conditions that include constant monitoring and timed positive feedback (Silver, 2014), but it lacks ecological validity.

Ecological validity is an aspect of research design that refers to the similarity between the participants, materials, and settings used in a study and the real-world context under investigation (Shadish, Cook, & Campbell, 2002). By better aligning the assessment context with real-world conditions in which children employ their EF skills, researchers can improve the ecological validity of EF assessments (McCabe et al., 2000, Sbordone, 2001). Specifically, assessing EF skills in a classroom setting, with its naturally occurring distractors and motivators, will yield a more ecologically valid measure of EF skills. It may also improve the predictive validity of directly assessed EF skills for students' self-regulated classroom behavior and measures of academic achievement such as performance on standardized tests.

As educators and policymakers debate the merits of assessing student progress using measures of socioemotional learning (Campbell et al., 2016, Duckworth and Yeager, 2015, Ursache et al., 2012, West, 2016), researchers need to create valid, pragmatic, and cost-effective ways of assessing EFs at scale. Although teacher report on questionnaire measures of EF has been found consistently to predict children's academic achievement (Allan et al., 2014, McClelland et al., 2006), teacher report has several known limitations. First, teacher report of student behavior can be subject to a “halo” effect (Nisbett & Wilson, 1977), where the respondent's general impression of the child's overall functioning biases the report of specific skills. This may be exacerbated when teachers are required to rapidly evaluate and compare many students. There is also evidence of systematic racial and gender bias in teachers' reports (McKown and Weinstein, 2008, Ready and Wright, 2011). Moreover, when asked to consider students' self-regulation, teachers may find it difficult to differentiate between EFs and related constructs such as conscientiousness (Eisenberg, Duckworth, Spinrad, & Valiente, 2014). Further, questionnaire items tend to capture broad behavioral markers of self-regulation and composites tend to have positively skewed distributions, with many students scoring at or close to the scale maximum. As such, they are less sensitive than direct assessments in reflecting small differences in EF skills across students and incremental changes in EF skills over time. Finally, questionnaires require teachers to contribute considerable time and cognitive effort, which makes it difficult to gather information on all students in a classroom or to track changes throughout an academic year.

Direct assessment of EF skills addresses problems with objectivity and measurement precision (Silver, 2014) and is thus considered to be the “gold standard” of EF measurement. However, extant individual assessment procedures are prohibitively expensive for large-scale studies such as program evaluations. Further, taking children out of the classroom one at a time burdens teachers by reducing instructional time and disrupting students' attention and behavior. Understandably, teachers and district officials often object to this type of research design. In order to employ direct assessments of EF skills at scale, we need to develop a group-based assessment procedure that is pragmatic, cost-effective, and minimally disruptive.

Although researchers have recognized the need to measure EF skills in real-world settings (McCabe et al., 2000, Sbordone, 2001), they almost exclusively employ individual assessment procedures (Fuhs et al., 2013, Prager et al., 2016, Schmitt et al., 2015, Weiland et al., 2014). We were able to identify only one small pilot study (reported in a book chapter) in which EF data were collected in a group context. McCabe, Rebello-Britto, Hernandez, and Brooks-Gunn (2004) tested 44 preschoolers in a group administration procedure, where four familiar peers simultaneously completed modified laboratory-based tasks in a classroom setting with one administrator. The authors coded video recordings of group assessment and reported that children had a harder time controlling impulses during the Gift Wrap task when assessed in a peer group setting than during individual assessment, but otherwise did not compare children's EF performance across the two settings. Computerized tasks that automatically score accuracy and reaction time (RT) create an opportunity to extended this work and evaluate the feasibility of group assessment of EFs in middle childhood.

The main goal of the current study was to evaluate a new group assessment procedure that allows researchers to directly measure EF skills in all students at the same time. Our assessment procedures included a number of methodological innovations to obtain reliable and valid data while simultaneously reducing staff training requirements and the cost of data collection. We adapted developmentally appropriate, widely used EF tasks for administration on tablet computers. These tasks were selected to yield a broad measure of EFs, as represented by inhibitory control, working memory, and cognitive flexibility (see Measures for details). The computer-based tasks provided both accuracy and RT data, thus eliminating the need for video recording or coding of children's responses. Moreover, the portability of tablet devices and children's ease with the touch-screen interface enabled group assessment. Our procedure has the potential to significantly lower the costs and increase the widespread use of high quality direct assessments of EFs.

Our analyses compare the reliability and validity of this novel group assessment procedure with the reliability and validity of an analogous individual assessment procedure that was conducted in a quiet, highly controlled setting. We hypothesized that children's performance in the group assessment setting would show convergent validity with their performance in the more conventional individual assessment setting. Further, we expected that EF skills would show similar associations with known demographic and educational covariates across both assessment settings. Finally, since direct assessment of EFs in the classroom has greater ecological validity, we hypothesized that performance in the group assessment setting would have greater predictive validity for children's self-regulated classroom behaviors and academic achievement compared to performance in the individual assessment setting.

Section snippets

Participants

Students and teachers in 33 classrooms across eight schools in the San Francisco Bay Area participated in a three-tiered, longitudinal study design. First, all but one parent agreed to let their child participate in classroom activities, including the group assessment (GA) of EF skills. During GA, 720 students completed at least one of the three EF tasks evaluated in the current study. Second, we received parental written consent for 71% of these students to access their school records data.

Individual executive function tasks

Correlations and descriptive statistics for the scores on the individual EF tasks are presented in Table 1. Only accuracy for the incongruent blocks on MSIT and H&F tasks showed an indication of ceiling effects. For the MSIT incongruent block, 20% of students attained perfect scores during IA, and 19% of students attained perfect scores during GA. For the H&F incongruent block, 38% of students attained perfect scores during IA, and 34% of students attained perfect scores during GA. Although

Discussion

In the current study, we demonstrated that a novel, group-based approach to measuring students' EF skills in a classroom context is a reliable and valid alternative to a conventional, individual assessment approach. One important advantage of the group-based assessment procedure over the conventional individual assessment approach is that it provides a way to simultaneously collect direct tests of EFs for all students in a class, making this a pragmatic and scalable method of data collection

Acknowledgements

This research was supported by a William T. Grant Foundation Scholar award (180826) to Jelena Obradović. The preparation of this manuscript is also supported by a William R. and Sara Hart Kimball Stanford Graduate Fellowship to Jenna Finch. The authors thank the children, teachers, and school administrators who participated and made this research possible, and many graduate and undergraduate students who helped collect and process the data. The findings, conclusions, and opinions here are those

References (65)

C.E. Cameron Ponitz et al.
Touch your toes! Developing a direct measure of behavioral regulation in early childhood
Early Child Research Quarterly
(2008)
S.B. Campbell et al.
Commentary on the review of measures of early childhood social and emotional development: Conceptualization, critique, and recommendations
Journal of Applied Developmental Psychology
(2016)
S.M. Carlson et al.
Inhibitory control and emotion regulation in preschool children
Cognitive Development
(2007)
M.C. Davidson et al.
Development of cognitive control and executive functions from 4 to 13 years: Evidence from manipulations of memory, inhibition, and task switching
Neuropsychologia
(2006)
I. Friso-van den Bos et al.
Working memory and mathematics in primary school children: A meta-analysis
Educational Research Review
(2013)
Y. Liu et al.
The typical development of posterior medial frontal cortex function and connectivity during task control demands in youth 8–19 years old
NeuroImage
(2016)
M.M. McClelland et al.
The impact of kindergarten learning-related skills on academic trajectories at the end of elementary school
Early Child Research Quarterly
(2006)
C. McKown et al.
Teacher expectations, classroom context, and the achievement gap
Journal of School Psychology
(2008)
R. Neuenschwander et al.
How do different aspects of self-regulation predict successful adaptation to school?
Journal of Experimental Child Psychology
(2012)
E. Oberle et al.
Relations among peer acceptance, inhibitory control, and math achievement in early adolescence
Journal of Applied Developmental Psychology
(2013)

J. Obradović

Effortful control and adaptive functioning of homeless children: Variable-focused and person-focused analyses

Journal of Applied Developmental Psychology

(2010)

E.O. Prager et al.

Executive function and magnitude skills in preschool children

Journal of Experimental Child Psychology

(2016)

T.R. Sasser et al.

Executive functioning and school adjustment: The mediational role of pre-kindergarten learning-related behaviors

Early Child Research Quarterly

(2015)

S.A. Schmitt et al.

Strengthening school readiness for Head Start children: Evaluation of a self-regulation intervention

Early Child Research Quarterly

(2015)

N. Yeniad et al.

Cognitive flexibility children across the transition to school: A longitudinal study

Cognitive Development

(2014)

N. Yeniad et al.

Shifting ability predicts math and reading performance in children: A meta-analytical study

Learning and Individual Differences

(2013)

N.P. Allan et al.

Relations between inhibitory control and the development of academic skills in preschool and kindergarten: A meta-analysis

Developmental Psychology

(2014)

P. Allison

Fixed effects regression models

(2009)

K. Bartolino et al.

Expanding the definition of student success: A case study of the CORE districts

J.R. Best et al.

A developmental perspective on executive function

Child Development

(2010)

C. Blair et al.

Relating effortful control, executive function, and false belief understanding to emerging math and literacy ability in kindergarten

Child Development

(2007)

C. Blair et al.

Multiple aspects of self-regulation uniquely predict mathematics but not letter–word knowledge in the early elementary grades

Developmental Psychology

(2015)

T.L. Blankenship et al.

Frontotemporal coherence and executive functions contribute to episodic memory during middle childhood

Developmental Neuropsychology

(2015)

K.C. Brocki et al.

Executive functions in children aged 6 to 13: A dimensional and developmental study

Developmental Neuropsychology

(2004)

R. Bull et al.

Executive functioning and mathematics achievement

Child Development Perspectives

(2014)

G. Bush et al.

The Multi-Source Interference Task: An fMRI task that reliably activates the cingulo-frontal-parietal cognitive/attention network

Nature Protocols

(2006)

California Department of Education

California STAR program - 2013 STAR test results (CA Dept of Education)

California Department of Education

Smarter balanced assessment system - testing (CA Dept of Education)

S.M. Carlson

Developmentally sensitive measures of executive function in preschool children

Developmental Neuropsychology

(2005)

S. Ciairano et al.

Executive inhibitory control and cooperative behavior during early school years: A follow-up study

Journal of Abnormal Child Psychology

(2007)

A. Diamond

Executive functions

Annual Review of Psychology

(2013)

A.L. Duckworth et al.

Measurement matters: Assessing personal qualities other than cognitive ability for educational purposes

Educational Researcher

(2015)

Cited by (47)

Executive functions and play
2023, Trends in Neuroscience and Education
Executive functions and play have been researched separately over the last few decades. Only recently has the association between the two constructs received more attention. Thus, a Special Issue on this association is timely. The six empirical studies of the Special Issue applied various types of play (e.g., dramatic play or physical play) in their research. Children's executive functions were also measured with a variety of tasks. The wide variability of the studies was a learning point, especially given the cultural connotation of executive function measures. All the studies of the Special Issue were conducted in WEIRD (Western, Educated, Industrialized, Rich, and Democratic) countries raising the issue of generalizability. We discuss future directions of the research on executive functions and play hoping for longitudinal studies on the association between these constructs in the future.
Can the Clobber game become a classroom-based tool for screening students’ executive functions?
2023, Progress in Brain Research
The use of games for cognitive screening is not new and involves employing simple tasks as well as virtual reality. In this work, we introduce the use of the combinatorial game Clobber, created by the mathematicians Albert, Grossman, Nowakowski and Wolfe in 2001 in a classroom-based experiment and analyzed how it can assess cognitive functions. Specifically, this study tries to address how the use of the Clobber game can target executive functions (EFs) and why it may be a valuable game to assess EFs. Executive functions have an extremely complex nature and combine abilities which involve planning, decision-making, productive action, and self-regulation, among others. We performed a cross-sectional study with a sample of 111 participants aged 9–30 from three educational levels in which Clobber was applied in four different configurations varying in complexity. The findings identify two variables that can guide future experiments with Clobber: the game configuration and the time spent solving the game.
Directly assessed and adult-reported executive functions: Associations with academic skills in Ghana
2022, Journal of Applied Developmental Psychology
Extant work on the importance of children's executive function (EF) for academic skills typically employs either direct assessments of EF skills or adult reports of children's EF behaviors. Each approach has advantages, yet few studies have examined how different EF measurement approaches distinctly relate to child outcomes. We examined how direct assessment of EF skills and teacher- and assessor-reports of EF behaviors uniquely predicted literacy and numeracy skills in the Greater Accra region of Ghana (N = 371, average age = 9.3 years). All three EF measures demonstrated significant associations with children's concurrent numeracy and literacy performance. Controlling for previous academic skills, direct assessment of EF skills predicted numeracy, teacher-report of EF behaviors predicted literacy, and assessor-report predicted both. Adapted EF measures uniquely contribute to students' academic skills in a context where educational experiences tend to be teacher-directed and emphasize obedience, suggesting that promoting EF can support learning in Ghana.
Classroom-level peer self-regulation as a predictor of individual self-regulatory and social-emotional development in Brazil
2021, Journal of Applied Developmental Psychology
Citation Excerpt :
For example, in one-on-one settings, there are fewer distractions from peers and the assessor's attention is mostly focused on the student. One recent study has shown that scores captured from group-based direct assessments of cognitive regulation in the classroom contexts were more predictive of students' academic skills than those derived from one-on-one settings (Obradović, Sulik, Finch, & Tirado-Strayer, 2018), highlighting how the adapted procedures may better reflect the conditions students experience in the classroom. Despite these advancements, direct assessments tend to focus on a narrow set of cognitive dimensions of self-regulation (e.g., working memory, inhibitory control), which may not fully translate into students' behaviors among their peers (McCoy, 2019; van Mierlo et al., 2009).
Peers' skills matter for students' development in classroom contexts. Using a sample of 2950 students in 173 classrooms in public primary schools in Rio de Janeiro, Brazil, the present study sought to validate an observational measure of classroom-level peer self-regulation and then used this measure to predict individual students' gains in cognitive regulation and social-emotional skills over an academic year. The classroom-level measure of peer self-regulation showed high levels of internal reliability and predictive validity. Specifically, higher levels of baseline peer self-regulation were associated with greater gains in individual students' performance on cognitive regulation and emotion knowledge tasks. These associations largely did not depend on students' baseline skills. This paper introduces a novel approach for measuring peer self-regulation at the classroom level, contributes to the literature on peer effects, and also expands the literature on students' social-emotional development in diverse global contexts.
Cognition in context: Validating group-based executive function assessments in young children
2021, Journal of Experimental Child Psychology
The current article describes and validates a set of group-based executive function (EF) assessments for use with young children. These situational tasks involve instructing groups of young students to march to music while completing tasks that place demands on their EF abilities. These efforts were motivated by providing researchers with a set of measures that assess EF subcomponents while also accounting for the dynamic social processes present in group settings. These assessments take place in schools, are relatively simple to administer, and include multiple EF indicators. Drawing on a diverse sample of 283 kindergarteners (M_age = 5.8 years, SD = 0.38), we found that group-based EF assessments were significantly related to individually assessed EF measures and differentially predicted children’s performance on standardized tests of math and reading achievement. Overall, this study represents a first step toward developing a set of group-based EF measures that are appropriate for use with young children.
Hidden talents in harsh environments
2022, Development and Psychopathology

View all citing articles on Scopus

View full text

Assessing students' executive functions in the classroom: Validating a scalable group-based procedure

Abstract

Introduction

Section snippets

Participants

Individual executive function tasks

Discussion

Acknowledgements

Early Child Research Quarterly

Journal of Applied Developmental Psychology

Cognitive Development

Neuropsychologia

Educational Research Review

NeuroImage

Early Child Research Quarterly

Journal of School Psychology

Journal of Experimental Child Psychology

Journal of Applied Developmental Psychology

Journal of Applied Developmental Psychology

Journal of Experimental Child Psychology

Early Child Research Quarterly

Early Child Research Quarterly

Cognitive Development

Learning and Individual Differences

Relations between inhibitory control and the development of academic skills in preschool and kindergarten: A meta-analysis

Developmental Psychology

Fixed effects regression models

Expanding the definition of student success: A case study of the CORE districts

A developmental perspective on executive function

Child Development

Relating effortful control, executive function, and false belief understanding to emerging math and literacy ability in kindergarten

Child Development

Multiple aspects of self-regulation uniquely predict mathematics but not letter–word knowledge in the early elementary grades

Developmental Psychology

Frontotemporal coherence and executive functions contribute to episodic memory during middle childhood

Developmental Neuropsychology

Executive functions in children aged 6 to 13: A dimensional and developmental study

Developmental Neuropsychology

Executive functioning and mathematics achievement

Child Development Perspectives

The Multi-Source Interference Task: An fMRI task that reliably activates the cingulo-frontal-parietal cognitive/attention network

Nature Protocols

California STAR program - 2013 STAR test results (CA Dept of Education)

Smarter balanced assessment system - testing (CA Dept of Education)

Developmentally sensitive measures of executive function in preschool children

Developmental Neuropsychology

Executive inhibitory control and cooperative behavior during early school years: A follow-up study

Journal of Abnormal Child Psychology

Executive functions

Annual Review of Psychology

Measurement matters: Assessing personal qualities other than cognitive ability for educational purposes

Educational Researcher