Introduction

Breastfeeding has long been associated with better health, immunological advantages, and cognitive outcomes for children from infancy through early adulthood [1,2,3,4]. More recently, a growing interest in the potential benefits of breastfeeding on children’s behavioural development has emerged. While breastfeeding and externalising behaviours have been less well-understood, multiple hypotheses regarding the potential mechanisms for associations have been put forth including psychological mechanisms, neurological brain development, and genetic risk. For example, mother–infant bonding via early skin-to-skin touch, eye contact, and cradling during breastfeeding can create more secure attachment in infants [5, 6], independent of shared environment and genetic factors [7], consequently reducing the risk of future externalising problems. Relatedly, higher maternal responsiveness to child needs, marked by greater activation in the superior frontal gyrus, insula, precuneus, striatum, and amygdala, along with increased oxytocin response [8, 9], observed in mothers who breastfeed, have been associated with better child adjustment and reduced risk of externalising problems [10].

Brain development may be another mechanism for the proposed link between breastfeeding and externalising behaviour. The nutrients found in breast milk, in particular the long-chain polyunsaturated fatty acids, have been shown to positively impact on white matter growth, and abnormalities in white matter have been implicated in studies with children who exhibit conduct problems [11] and hyperactivity [12]. Further, white matter growth is particularly critical during the first 2 years, [13] which may support a linear dose–response association of longer durations of breastfeeding. It has also been suggested that breastfeeding may protect against the development of externalising behaviour for children who have an elevated genetic risk [14]. Research on both conduct problems and hyperactivity have consistently revealed a strong genetic link for predisposition to these behaviours [e.g., 15], making increasingly attractive a better understanding of the environmental factors that may protect against their emergence.

Despite the plausibility of the above-mentioned hypotheses, support for the link between breastfeeding and externalising problems across studies has been mixed. Breastfeeding has been associated with reduced externalising behaviours in some studies [16,17,18,19] but not in others [20, 21]. Moreover, the issue of causality remains open due to the high reliance on observational studies. For instance, mothers who breastfeed typically have higher levels of education and engage in less risky prenatal behaviours (e.g., smoking) [e.g., 22]. These same characteristics are also associated with children’s behavioural outcomes [23, 24]. Given these differences, lack of randomisation violates statistical assumptions of equivalence between groups, that is between children who were breastfed and children who were not. This raises the possibility that any benefits regarding behavioural outcomes within observational studies may be attributable to selection bias and unobserved confounding that have not been adequately controlled for, rather than to breastfeeding per se.

Indeed, statistical adjustment for confounding due to selection bias (i.e., mothers self-select into breastfeeding and mothers who breastfed may provide more optimal environments) often results in smaller and/or statistically insignificant ‘effects’. Identifying ‘effects’ without the use of randomised control trials is challenging. The best evidence to date on the effects of breastfeeding and children’s externalising behaviour stems from a cluster-randomised breastfeeding promotion trial, conducted in the Republic of Belarus (PROBIT) [21]. As Kramer et al. [21] note, while no statistically significant differences between treatment and control groups on children’s behavioural outcomes were found under the intention to treat analysis, it is possible that the differences between the two groups, regarding the intensity and duration of breastfeeding, were not large enough to detect any effects. This may particularly be the case for behavioural outcomes given that some previous studies have supported a linear dose–response association [25, 26]. Moreover, the PROBIT trial examined behavioural outcomes using dichotomised scores of the Strengths and Difficulties Questionnaire (i.e., abnormal scores in the upper 85th percentile), which have a strong predictive validity of future clinical diagnosis [27]. It is possible that clinical levels of externalising behaviour problems are less amenable to environmental protective factors, such as breastfeeding, particularly regarding shorter durations and partial breastfeeding.

Given the ethical challenges of randomised trials regarding breastfeeding, the use of quasi-experimental statistical approaches, such as propensity score matching, provides the next best approach to understanding any potential ‘effects’ of breastfeeding on children’s behavioural outcomes. Only a few studies to date have used quasi-experimental approaches while examining breastfeeding and behavioural outcomes [e.g., 16, 17], and results from these studies are suggestive of modest effects.

Objectives

We examined the potential effects of breastfeeding on children’s externalising behaviour from late childhood into adolescence using a quasi-experimental statistical approach, to address the inherent methodological challenges within observational cohort studies. Moreover, we examined whether the benefits of breastfeeding would differ by duration of breastfeeding. It was hypothesised that children who were breastfed would have lower externalising problems compared with children who were not breastfed, with larger effects observed for children who were breastfed for longer durations.

Methods

Participants included children and their families enrolled in the Growing Up in Ireland (GUI) Child Cohort. Recruitment occurred in the Republic of Ireland between September 2007 and June 2008 using a two-stage random selection process, (i.e., the National School System and random selection of eligible children within each school). The child level eligibility criterion was being born between November 1st 1997 and October 31st 1998. The total sample included 8570 9-year-old children, with a response rate of 82% at the school level and 57% at the individual level. For a detail description of the GUI child cohort, please see [28]. The current study used data collected when children were age 9 and 13. Eligibility criteria in this study included children who were born full-term and who had complete data on matching variables at age nine (N = 6013; 70.2%). Girls represented 51.9% (n = 3121). Attrition and missing outcome data over time resulted in N = 5342 at age 13. Demographic characteristics of this sample can be found in Table 1. Further, a comparison between families included in the current study and those initially recruited can be found in the online supplement. The Health Research Board’s Research Ethics Committee granted ethics approval, and both written consent and assent were collected from parents/guardians and children prior to all data collection.

Table 1 Comparison of family, maternal, child and medical characteristics: child cohort at 9 years

Measures

Children’s behaviours were assessed using both the parent version of the Strengths and Difficulties Questionnaire (SDQ) at ages 9 and 13 and the teacher version at age 9 only. The conduct problems (e.g., ‘often fights with other children or bullies them’) and the hyperactivity (e.g., restless, overactive, cannot stay still for long’) scales were used. Both scales are comprised of five items. Parents and teachers were asked to rate the applicability of the child’s behaviour on a 3-point scale (i.e., 0 = not true to 2 = certainly true), with possible scores ranging between 0 and 10. The SDQ has been well-validated [29]. Please see Table 2 for the means of maternal- and teacher-rated conduct problems and hyperactivity, along with the correlations between maternal and teacher ratings.

Table 2 Bivariate correlations between maternal and teacher SDQ scores and means/standard deviations of children’s outcomes at 9 and 13 years of age

Breastfeeding information was collected retrospectively via maternal report when children were age nine. Mothers were asked, “was the child ever breastfed even if only for a short time” and “for how many months or weeks was the study child breastfed”. Support for reliability of recall from 6 to 15 years after breastfeeding occurred has previously been established, with only a slight overestimation bias of 1 week [30, 31]. No information was collected regarding type of breastfeeding in the cohort. Thus, children were grouped into one of four categories: never breastfed (n = 2, 853), breastfed up to 25 weeks (n = 2, 461), breastfed between 26 and 50 weeks, (n = 461), and breastfed 51 weeks or more (n = 238), which included partial and exclusive breastfeeding. The three groups of children who were breastfed were treated as mutually exclusive and compared against those who were not breastfed.

Confounders and self-selection into breastfeeding have been suggested to partially account for previous associations found between breastfeeding and child outcomes. Thus, children (breastfed, never breastfed) were matched on 16 factors previously associated with breastfeeding and children’s behavioural outcomes. At the family level, these included social class (professional/managerial, other non-manual/skilled manual, semi-skilled/unskilled, no valid social class), medical card status (free medical care, free general practitioner care, no free medical care), and both maternal and paternal parenting style (authoritative, authoritarian, permissive, neglectful; the Parenting Style Inventory II) [32]. At the maternal level, factors included age (33 years or younger, 34–38 years, 39–43 years, 44 years or older), education (less than high school diploma, high school diploma, college diploma, university degree, graduate degree), partner status (yes/no), employment status (employee, self-employed, farmer, student, state training, unemployed, long-term sickness/disability, home duties/retired), Irish born (yes/no), smoking during pregnancy (yes/no), drinking during pregnancy (yes/no) and type of delivery (vaginal, caesarean). At the child level, factors included birth weight (2500 g or more, yes/no), having neonatal intensive care (yes/no), sex (boy/girl) and sibling status (yes/no).

Statistical analysis

While experimental designs that randomise breastfeeding conditions are not feasible, statistical techniques such as propensity score matching (PSM) can help overcome selection bias by matching children who were breastfed to those who were not based on their measured characteristics, consequently diminishing differences between groups [33]. Nearest neighbour 1 to 1 matching techniques were used. Nearest neighbour matches groups sequentially, using the closest match between groups once the sample has been randomly ordered. Randomly sorting the sample prior to matching reduces possible bias that could arise from the matching procedure. To ensure the most optimal matches, we imposed a caliper between matches (i.e., a match could only occur if the propensity score fell within a tenth of a standard deviation of each other). Given the low frequency of children who were breastfed between 26 and 50 weeks, and 51 weeks or more, we allowed matching with replacement to ensure a better quality of matches [34]. Balance checks were conducted for each model to ensure bias between the groups was substantially reduced. After matching, remaining bias on individual confounders ranged between 0.0 and 17.9% and the overall mean model bias ranged between 2.4 and 8.2%, indicating successful matching [33]. After matching, all families fell within the area of common support, with the exception of one family whilst examining children who were breastfed 51 weeks or more. Support refers to observations being excluded from the analysis due to not finding a match within the specified caliper. We report the average treatment effect on those who were breastfed. All reported results in the tables have been adjusted for multiple testing using the Holm–Bonferroni method. All statistical analyses were conducted using Stata 13 software.

Results

Prior to matching, children who were breastfed for up to 25 weeks were rated lower on both maternal- and teacher-rated conduct problems and hyperactivity at age 9, and maternal ratings at age 13, compared to children who were never breastfed. These differences were statistically significant. However, after matching none of the differences remained statistically significant. Results are displayed in Table 3. For children who were breastfed between 26 and 50 weeks, statistically significant differences were found between those who were never breastfed and those breastfed between 26 and 50 weeks for both conduct problems and hyperactivity at age 9 (maternal reports), hyperactivity at age 9 (teacher reports), and hyperactivity at age 13 (maternal reports). All results were in the expected direction (i.e., lower conduct problems and hyperactivity for children who were breastfed between 26 and 50 weeks). These statistically significant differences at age nine, for maternal- and teacher-rated hyperactivity, remained statistically significant after matching and correcting for multiple testing. That is, children who were breastfed between 26 and 50 weeks scored between 0.48 and 0.51 points lower than children who were not breastfed, indicative of a small effect. Maternal-reported conduct problems at age nine did not remain statistically significant after matching. Further, maternal-reported hyperactivity at age 13 did not remain statistically significant after matching. For children who were breastfed 51 weeks or more, statistically significant differences were found prior to matching on maternal-reported hyperactivity (age 9 and 13) and conduct problems (age 9) only. After matching and correction for multiple testing, no statistically significant differences remained.

Table 3 Breastfeeding and children’s externalising behaviours at 9 and 13 years of age: pre- and post-matching results

Comment

Our results contribute to the emerging body of work examining breastfeeding and children’s externalising difficulties in late childhood and adolescence. As the post-matching results are the best estimate of causal effect, we refer only to these. Also note, our results apply only to infants born full-term. In this cohort, we find some statistical support for the positive benefits of breastfeeding in reducing children’s hyperactivity at age nine, as reported by both mothers and teachers, for children who were breastfed between 26 and 50 weeks only. This finding may partially support the duration recommendation of the WHO, suggesting that longer durations, of at least 6 months of breastfeeding, may provide more optimal benefits for children’s development and well-being; in particular relating to their behavioural adjustment (i.e., hyperactivity) into late childhood. We did not find any statistically significant differences for hyperactivity at age nine, in favour of children who were breastfed for extended periods lasting beyond 51 weeks. This may suggest that in the current sample, there was a nonlinear dose–response association of breastfeeding on hyperactivity, similarly to the results of Borra and colleagues, using the Avon Longitudinal Study of Parents and Children [16]. The comparable results between these two studies may be reflective of the comparably low rates of breastfeeding in both Ireland and the UK. Alternatively, this may also reflect the small number of mothers in the current sample, less than 4%, who were still breastfeeding at 51 weeks. With the emergence of more studies using quasi-experimental approaches to examine breastfeeding and behavioural outcomes, replication in cohorts with a greater percentage of women who have breastfed for extended periods (i.e., beyond 12 months) may help in our understanding of whether the association with dose–response for behavioural outcomes is indeed non-linear.

Given that no information was collected on direct breastfeeding versus expressed milk, we were unable to further test the exact mechanism responsible for the reduced hyperactivity (i.e., the physical contact occurring during breastfeeding that helps to create more secure attachment or the nutrients found in breast milk which may impact on reduced hyperactivity via white matter growth). Alternatively, the idea of reverse causation is another potential explanation which merits consideration. In particular, it may be that difficult infants, who display early symptoms of hyperactivity, may be more challenging to breastfeed, especially for longer durations. Studies that are better able to disentangle the question of mechanisms are warranted and would help to advance the current state of knowledge. However, while statistically significant differences between children breastfed between 26 and 50 weeks and those who were never breastfed were found, the effect sizes for both maternal- and teacher-rated hyperactivity at age nine were small. Further, the results did not hold at age 13. This would suggest that these benefits were not maintained into adolescence, a time when environmental factors, such as peer deviance, can exert a more influential role on behavioural problems [35].

While the strengths of this study include the use of a large national developmental dataset from Ireland, a country with one of the lowest breastfeeding rates in Europe [36], the use of a multi-informant approach, coupled with a quasi-experimental approach for minimising selection bias, the use of repeated measures of behaviour, and a larger number of matching confounders compared to other studies using PSM, there remains notable limitations. First, despite previous support for the reliability of breastfeeding recall [25, 26, 37], bias may still be present. Relatedly, given the time elapse between breastfeeding and when mothers were retrospectively asked to recall on their breastfeeding engagement, we categorised breastfeeding into ‘less than 6 months, 26–50 weeks, and 51 weeks or more’ rather than looking at shorter intervals of breastfeeding. In the same vein, given the lower reliability of recall regarding the timing of introduction to complementary solids/liquids, exclusivity was not examined in the current study. There was also no data collected on the method of feeding (i.e., expressed breast milk, direct breastfeeding), impeding our ability to better understand the specific mechanism responsible for the observed benefits. Finally, while the use of PSM is an innovative statistical approach designed to minimise selection bias, PSM is only able to match groups on observable characteristics. It remains possible that unobservable characteristics which impact on both the decision to breastfeed and child outcomes contribute to the statistically significant benefits of breastfeeding found in the current study, which was not accounted for. For example, we had no information on maternal IQ, maternal personality, and postpartum maternal mental health, all of which may be potential contributing characteristics. Future studies would do well to build upon the current work through the inclusion of additional matching variables related to these maternal characteristics. Despite these limitations, our findings add to the current literature by providing some statistical support for the positive impacts of breastfeeding, for at least 6 months, on children’s behavioural development into late childhood, as reported by both mothers and teachers. In a practical context, these advantages remain modest given the small magnitude of effect. Regardless, it should be acknowledged that our results do not take away from any of the numerous medical benefits of breastfeeding afforded to both mothers and infants.