In the last few decades, rates of Cesarean section (CS) have risen dramatically. In the United States, the CS rate was 32% in 2015, which is 11% higher than in 1996 [
This rise in CS rates is partly associated with factors that are difficult to change, such as increasing maternal age at first birth [
4]. However, there are other factors that may be easier to alter, such as maternal preference, medical models of care, or funding mechanisms that encourage more frequent intervention in birth. Indeed, after adopting a single, blended payment policy for uncomplicated CS and vaginal births, a decline of 0.27 percentage points per quarter in CS rate was observed in Minnesota, USA [
CS is often a life-saving intervention, but may result in adverse effects on maternal and child health. Women who underwent CS have been found to be at higher risk of miscarriage, stillbirth, placenta previa, placenta accreta and placental abruption in future pregnancies [
9] have been reported as well.
In children, health risks associated with CS are well-documented. Existing reviews and meta-analyses concluded that CS is associated with a higher risk of inflammatory bowel disease [
6]. Effects on children’s psychological development and behavior are less known, although there is emerging evidence suggesting that CS might impact child neurodevelopment due to modified hypothalamic–pituitary–adrenal axis programming [
19]. The extant literature in this area, however, is inconclusive, as the studies on this topic are scarce and conflicting in results. Some studies have revealed deficits in cognitive domains [
23], or a higher risk of mental health problems, such as autism and Attention-Deficit/Hyperactivity Disorder (ADHD) [
25] in children who were born by CS. Some studies, though, found no effects of CS on child psychological outcomes [
29] and some have reported positive outcomes [
31]. These discrepancies might be due to different methodology, such as measurement of outcomes (e.g., parental report or direct child assessment), different sample characteristics, type of CS examined (planned for medical reasons or maternal request, emergency CS), or confounders that are controlled for.
Beyond general expectations that CS may impact developmental progression, potentially delaying acquisition of developmental milestones, CS may also result in alterations in child temperament [
Surgency concerns individual differences in activity level, impulsivity, and pleasure in situations with high stimulus intensity;
Negative Affectivity involves demonstrations of sadness, fear, anger, frustration, or discomfort, and difficulties soothing;
Effortful Control refers to children’s capacities to plan, inhibit inappropriate approach responses, maintain attention, and enjoy low-intensity activities. Temperament has frequently been linked to adjustment in young children [
39]. Early forms of problems are often demarcated as internalizing (emotional problems including depression, anxiety and peer problems) and externalizing (conduct problems involving aggression, hyperactivity and inattention) [
The aim of this study was to examine the association between CS and developmental milestones, temperament, and internalizing and externalizing problems in children aged four years, using data from a birth cohort study. We hypothesized that children born via CS to healthy mothers with no pregnancy complications are at higher risk of developmental delays, behavioral problems, and associated temperament characteristics, compared to children born vaginally. Since those associations might be sex-specific [
42], we stratified our analyses for child sex.
Procedure and Participants
43]. At baseline, the women were asked to complete a questionnaire about their sociodemographic background. Data regarding labor, delivery and neonatal outcomes were extracted from medical records in collaboration with the hospitals. Children were followed up at the age of four. The mothers were asked to complete questionnaires about their child’s behavior and neurodevelopment, their own health and psychological status, and family sociodemographic background.
Data regarding labor and delivery and data about the sociodemographic background were collected from 1190 women. Out of those women, 343 took part in the follow-up study four years postpartum. The following exclusion criteria were applied: participants were excluded based on multiple pregnancy (n = 3), age < 20 or > 40 years (n = 4), gestational age at birth < 37 or > 42 weeks (n = 11), pregnancy complications (e.g., diabetes, hypertension) (n = 38), birth weight < 2500 g (n = 5), Apgar score at 5 min < 8 or hospitalization of the newborn in maternity hospital > 10 days (n = 5), and vaginal instrumental birth (n = 6). Mother–child pairs with missing values on any of the key study variables were also excluded (n = 15). The final sample thus consisted of 256 mothers and their children (see flow chart, Figure S1, Supplementary Material). A comparison of women who were included in the analyses with those who were not showed that women who were older, had higher education levels or had a spouse were more likely to take part in the follow-up study (Table S1, Supplementary Material).
Exposure Variable: Mode of Birth
Children’s Developmental and Behavioral Outcomes at the Age of Four
44]. The ASQ-3 is a parent-completed questionnaire commonly used in clinical and research settings to screen for developmental delays across five domains of child development: Communication, Fine Motor, Gross Motor, Problem Solving, and Personal-Social domain. Parents are required to evaluate whether the child masters a specific skill (‘yes’), is just beginning to master that skill or masters it only occasionally (‘sometimes’), or has not yet acquired that skill (‘not yet’), with a score of 10, 5 or 0, respectively. Each domain contains 6 items. A total score for each subscale is calculated as a sum of the points ranging from 0 to 60. The higher the score, the better the skills and abilities in the given domain. The psychometric characteristics of the ASQ-3 are satisfactory, as reported by Schonhaut et al. [
35]. The CBQ-VSF is a widely used tool to assess young children’s reactivity and regulation; it contains 36 items divided into three subscales: Surgency, Negative Affectivity and Effortful Control. Each subscale consists of 12 items rated on a 7-point scale ranging from 1 to 7. The total score for each subscale represents the mean score of all scale items applicable to the child during the last 6 months. The authors reported satisfactory internal consistency for the CBQ-VSF scales [
41]. The SDQ is a widely used questionnaire for mental health screening that consists of scales measuring five dimensions (Emotional Symptoms, Conduct Problems, Hyperactivity/Inattention, Peer Relationship Problems, Prosocial Behavior), with each scale containing five items. In this study, we used 20 items of the SDQ divided into the Internalizing and Externalizing Problems subscales [
41]. The Internalizing Problems subscale consists of the Emotional Symptoms and Peer Relationship Problems dimensions. The Externalizing Problems subscale covers Conduct Problems and Hyperactivity/Inattention domains. The items are rated on a 3-point scale ranging from 0 to 2 points such that the informant (parent or teacher) evaluates whether the statement is ‘Not True’, ‘Somewhat True’ or ‘Certainly True’. Both Internalizing and Externalizing scores may range from 0 to 20, with the higher scores indicating more severe problems. The SDQ was reported to have satisfactory psychometric properties [
Control Variables
49]. The BDI-II is a widely used questionnaire consisting of 21 items that are rated on a 4-point scale ranging from 0 to 3. The total score may range from 0 to 63, with higher scores indicating a higher level of depressive symptoms. In this study, a validated Czech version of the BDI-II showing high reliability (Cronbach’s alpha 0.92) was used [
Statistical Analyses
Descriptive statistics and chi-square tests were calculated on the maternal characteristics to assess the differences between women who were included and excluded in the present study. The descriptive statistics were used to report maternal, childbirth, and child characteristics. Bivariate associations of mode of birth with maternal, childbirth and child characteristics were calculated with chi square tests, and bivariate associations of birth mode with measurement scores were calculated with Student t-tests or ANOVA-tests, where appropriate.
Univariate and multivariate linear regression analyses were conducted to further assess associations between CS and children’s developmental (ASQ subscales) and behavioral (CBQ-VSF and SDQ subscales) outcomes. The requirements to comply with the assumptions of the multivariate linear regression were evaluated. The unadjusted and adjusted associations between CS and children’s outcomes were reported with a Beta coefficient including 95% confidence interval (95% CI). The corresponding percentages of explained variances (R2) of all regression models were also calculated. In the multivariate model, the effect of CS was adjusted for artificial hormones, i.e. oxytocin/prostaglandin for induction of labor (yes/no), parity (nulliparous vs. multiparous women), gestational age at birth (continuous variable, weeks of gestation) and maternal depression (continuous BDI-II score). Additionally, the linear regression models were stratified for child sex. The statistical analyses were performed using SPSS Statistics version 23.0 (SPSS Inc., Chicago, IL, USA).
The aim of this study was to investigate associations between Cesarean section (CS) and child behavioral and developmental outcomes in four-year-olds born healthy to mothers with no serious pregnancy complications. In analyses adjusted for parity, gestational age, child’s sex, induction of labor, and maternal depressive symptoms, we found a small but significant association suggesting that children born via CS demonstrated better problem solving abilities than children who had been born vaginally. Stratifying by child sex indicated that the effects of CS on problem solving were limited to boys. Girls, on the other hand, scored worse in the Gross Motor domain if they were born via CS rather than vaginally, and this effect was not observed in boys. No associations were found between mode of birth and other developmental domains, child temperament or behavioral difficulties.
52]. It is possible that CS diminished risks related to vaginal birth in boys more frequently in comparison to girls, who face fewer risks of vaginal delivery. However, our results need to be considered with caution as research on the effects of CS on child neurodevelopment is still at a very early stage. Moreover, previous studies focusing on the association between CS and cognitive development observed negative effects [
20] included only twins.
23] reported that both girls and boys were at a higher risk for worse outcomes in motor development in childhood and adolescence if born through either elective or non-elective CS compared to vaginal birth. The inconsistencies in the existing studies may be due to several factors including timing of measurement (i.e. the age of child assessment), method of measurement (parent report or direct assessment), or separation of CS variable into subtypes (elective, non-elective).
54], the higher rate of internalizing problems in their children might be related to genetic and family environment factors rather than mode of birth.
The present study has several strengths. We used prospectively collected data to examine the effect of CS on a broad range of developmental domains. Data regarding labor and delivery were extracted from medical records in cooperation with maternity hospitals, rather than relying upon maternal recollection of intrapartum medication and interventions. Our analyses controlled for multiple relevant covariates, and our sample of healthy mother–child pairs with satisfactory perinatal outcomes diminished confounding by health complications that could lead to both CS and compromised child outcomes.
However, several limitations to our study need to be considered. First, the child developmental and behavioral outcomes were assessed by maternal report only, not by direct assessment of child development and behavior. Even though the ASQ, CBQ-VSF and SDQ are widely used, valid and reliable tools to assess child development and behavior, it is not clear to what extent they measure child behavior per se versus maternal perceptions of the child. Second, although common in longitudinal studies, the attrition rate was relatively high, with statistical differences found between women who dropped out of the study and those who did not. The women who were more likely to participate in the follow-up study were older, had higher educational status, and were living with a spouse, which might limit generalizability of our findings to a population of women with higher socioeconomic status. Similarly, the exclusion criteria set to eliminate confounding by indication for CS resulted in a sample consisting of a relatively narrow population of healthy mother–child pairs, limiting generalization to mothers at higher risk for adverse outcomes. Also, the sample size was relatively small, which may have limited statistical power, especially when it comes to the analyses stratified by child sex. It is thus possible that some associations detectable in a larger sample were not identified in our study. Moreover, the relatively small sample size precluded us from assessing the effects of planned and emergency CS separately, which may have obscured their differential effects. The variance explained by some models was relatively low, indicating that other unobserved factors explain child outcomes. For example, we did not include maternal parenting competences or breastfeeding status in our models, although those variables might play a role in the association between CS and child outcomes.
Previous investigations have suggested possible adverse consequences of CS on child neurodevelopment, yet our study largely failed to support this association. Results of this study indicate some sex-specific effects of CS, with boys scoring better in the cognitive domain and girls worse in the gross motor domain. Nevertheless, more research is needed before any strong conclusions can be made, preferably using population-based samples and objective measures to assess child outcomes, distinguishing between planned and emergency CS, controlling the analyses for relevant confounds including indications for CS, and conducting sensitivity analyses for high- and low-risk populations. Also, research that might help to identify the mechanisms underlying any association between CS and child outcomes, examining both biological (cortisol response, gut microbiota composition, DNA methylation analysis) and psychosocial (maternal parenting competences, bonding) pathways, is warranted. As there is not sufficient research to draw a final conclusion regarding the effects of CS, a “precautionary principle” approach is encouraged, weighing the benefits and potential risks of the surgical birth intervention for each mother and child.
