Introduction

Young women with breast cancer have a worse prognosis than middle-aged women [17], partly explained by diagnosis at a later stage [24, 6, 8] and by a higher proportion of unfavorable tumor characteristics. Young women more often have high grade, hormone receptor-negative, Her2-positive tumors, and also more often multifocality, high proliferation, and lymphovascular invasion (LVI) [2, 3, 6, 913]. Young women have a higher proportion of intrinsic breast cancer subtypes [14] associated with a worse prognosis: the triple-negative, Her2-positive, and Luminal B subtypes [5, 13, 1518]. Recently, the prognostic significance of young age has been shown to differ between the different subtypes. Whether young age is an independent prognostic marker for poorer survival even when taking subtype into account remains controversial [5, 10, 15, 18].

In a previous large registry-based cohort study, we found women <35 years to have a worse survival than middle-aged women [3]. Stage at diagnosis was a major explanatory factor; however, the excess risk of breast cancer death seen in younger women was only present in early disease, most pronounced in women with small tumors. After correction for stage and tumor characteristics, young age remained an independent risk factor for death.

As it is not likely that young age in itself confers a worse prognosis, but rather this reflects other associations we had not been able to correct for in our registry-based study, we continued with in-depth studies on a large subpopulation from the original cohort. We collected detailed data from the medical records (tumor characteristics, heredity, parity, and treatment), re-evaluated slides (grade, LVI), and collected tumor tissue for TMA providing us with an immunohistochemical (IHC) surrogate of the intrinsic breast subtypes for this population-based cohort study with almost complete and long-term follow-up to study the independent effect of young age on breast cancer-specific survival (BCSS), distant disease-free survival (DDFS), and locoregional recurrence-free survival (LRFS).

Methods

Study design

Through the regional breast cancer registries in two of Sweden’s six health-care regions, a population-based cohort of 22,017 women with a primary invasive breast cancer diagnosed from 1992 to 2005 at 69 years of age or younger were identified. All women <35 years at diagnosis (n = 471) were compared to random sampled groups of women aged 35–39 years (n = 200), 40–49 years (n = 200), and 50–69 years (n = 300) (Flow chart Fig. S1). The sample size was set after power calculations based on the effect sizes from the registry-based study [3]. To reach a power of 80 % at a 95 % significance level, we needed 326 individuals to detect a difference in BCSS and 262 individuals to detect a difference in LRFS.

Information on patient and tumor characteristics, including the treatments given and follow-up until the end of 2012 or until death was collected from the medical records. For women with synchronous bilateral breast cancer, the largest tumor was chosen as the index cancer. Staging was performed using the American Joint Committee on Cancer, 7th edition [19]. The study conforms to the STROBE and REMARK guidelines [20, 21].

Tumor material

Archival haematoxylin and eosin stained sections and corresponding formalin-fixed and paraffin-embedded tumor blocks were retrieved and histologically reviewed for grade [22] and LVI. Re-sectioning and re-staining were carried out when archival sections were missing. When histological review was not possible, data on grade and LVI were extracted from pathology reports. The presence of multifocality (defined as two or more invasive tumor foci separated by at least 1 cm) and ductal carcinoma in situ (DCIS) with extensive growth (defined as >25 % of the tumor consisting of DCIS with intraductal component also beyond the edge of the invasive tumor) was extracted from the original pathology reports.

TMAs were generated for protein expression profiling using IHC. TMA production, IHC staining, slide scanning, and evaluation of outcome were performed in accordance with strategies and standards used in the Human Protein Atlas project [23, 24]. All patients with tumor material available, 983/1120 (88 %), were included in the set of TMAs. For IHC, the following primary antibodies were used: ER (estrogen receptor) 1:150 (M7047, Dako, Glostrup, Denmark), PR (progesterone receptor) 1:1000 (M3569, Dako), Ki67 1:200 (M7240, Dako), and Her2 1:1000 (A0485, Dako). IHC was performed as previously described [25]. In brief, 4 µm sections of the TMA blocks were cut and automated IHC was done using a Lab Vision Autostainer 480 (Thermo Fisher Scientific). The IHC-stained and mounted TMA slides were scanned at ×20 magnification with a ScanScope XT system (Aperio Technologies, Vista, USA). The high-resolution digital images of each tissue core were annotated with respect to the outcome of IHC staining. ER was defined as positive when >1 % of the tumor cell nuclei were positive and PR as positive when >25 % of the tumor cell nuclei were positive. Ki67 was considered high when >20 % of the tumor cell nuclei were positive [26, 27]. Her2 was annotated using Her2 ASCO guidelines [28]. Membrane staining intensity of 3+ was considered positive, while 2+ was further verified through chromogenic in situ hybridization (CISH) to determine Her2-gene amplification [2830]. CISH was performed on an automated Ventana BenchMark ULTRA IHC/ISH Staining Module (Ventana Medical Systems, Inc Tuscon, AZ, USA) using the INFORM HER2 Dual ISH DNA Probe Cocktail. CISH-stained slides were examined under the microscope and the amount of positive Her2 signals scored in tumor cell nuclei. The outcome was scored as Her2 amplified (>6 dots or clusters of positive signal) or non-Her2 amplified (≤6 dots per nuclei).

To define the intrinsic breast cancer subtypes, we used surrogate definitions based on central IHC re-evaluation of ER, PR, Ki67, and Her2 according to the St Gallen consensus statement [27]. Luminal A was defined as ER+, PR+, Her2−, and Ki67 low, Luminal B as ER+, PR+, Her2−, and Ki67 high or ER+, PR−, Her2−, and any Ki67, Luminal-Her2 as ER+ and Her2+, any PR or Ki67, Her2-positive (non-luminal) as ER−, PR− and Her2+, any Ki67 and triple-negative as ER−, PR− and Her2−, any Ki67.

Statistical analysis

Endpoints were BCSS, DDFS, and LRFS. BCSS was calculated using time from diagnosis to death from breast cancer censoring for end of follow-up. DDFS was estimated using time from diagnosis to distant recurrence or death from breast cancer, whichever came first censoring for the end of follow-up. LRFS was calculated using time from diagnosis to locoregional recurrence as first event. Kaplan–Meier curves were used to estimate survival time [31] as death from other causes than breast cancer was uncommon in this population. Survival curves were compared using log-rank test [32]. Cox proportional-hazards models were used to estimate the univariate and multivariate hazard ratios (HR) and 95 % confidence intervals (95 % CI) [33]. All statistical tests were two-sided and p values < 0.05 were deemed significant. All calculations were performed using IBM SPSS Statistics v22.0 (SPSS Inc. Illinois, USA).

Results

Population characteristics

Data on patient and tumor characteristics divided by age group are shown in Table 1. Women <35 years had larger tumors and more often involved lymph nodes than women aged 50–69 years. Fewer women <35 years presented with stage I disease. Women <35 years more often had tumors that were grade III, hormone receptor negative, Her2-positive, and high Ki67. This translates to a lower proportion of the luminal subtypes and a higher proportion of the triple-negative and Her2-positive subtypes among younger women. Multifocal disease, LVI, and the presence of extensive DCIS were more common in women <35 years. Altogether, characteristics in women 35–39 years were similar to those in women <35 years, whereas the characteristics in women 40–49 years group together well with those aged 50–69 years.

Table 1 Patient- and tumor characteristics for women with primary breast cancer stage I–III diagnosed 1992–2005, by age at diagnosis (N = 1120)

Treatment was performed according to the national guidelines for each time period, closely following international practice. Data on treatment by age group are shown in Table 2 and time trends of systemic treatment in relation to age, tumor size, lymph node status, grade, and subtype are presented in Fig. S2.

Table 2 Given treatment for women with primary breast cancer stage I–III diagnosed 1992–2005, by age at diagnosis (N = 1120)

Median follow-up time was 10 years (range 0–20). In the group aged <35 years, 90 of 445 had a locoregional recurrence as first event. The corresponding figures were for women 35–39 years 37 of 190, 40–49 years 27 of 192, and 50–69 years 22 of 293. Distant disease occurred in 169 of 445 women <35 years, in 59 of 190 women aged 35–39 years, in 47 of 192 women aged 40–49 years, and in 42 of 293 women aged 50–69 years.

Univariate analysis

Univariate analyses of risk factors for breast cancer death stratified by age are shown in Table 3. The increased risk of breast cancer death in young versus middle-aged women was significant during the earlier part of the studied period and mainly noted in tumors with favorable characteristics, namely: small tumor size, low grade, Her2-negativity, and no LVI.

Table 3 Univariate analysis of risk factors for breast cancer death by age for women with stage I–III breast cancer diagnosed 1992–2005 (N = 1120)

At 10-year follow-up, the BCSS was for women <35 years 69 % (HR 2.75, 95 % CI 1.93–3.94), for women 35–39 years 76 % (HR 2.33, 95 % CI 1.54–3.52), for women 40–49 years 84 % (HR 1.53, 95 % CI 0.97–2.39), and women 50–69 years 89 % (HR = 1.00 reference) (Fig. 1).

Fig. 1
figure 1

Breast cancer-specific survival by age in a population-based cohort of 1120 women with primary breast cancer stage I–III diagnosed 1992–2005 divided by age <35 years, 35–39 years, 40–49 years, and 50–69 years. Proportion of women surviving at 5, 10, and 15 years from diagnosis. Hazard ratios (HR) of breast cancer death are given with their 95 % confidence intervals (95 % CI). Survival curves are compared by log-rank test

Figure 2 shows BCSS by tumor characteristics and age. Women aged <40 years had a statistically significantly worse survival than women ≥40 years in stages I and IIa (HR 3.03, 95 % CI 1.65–5.57 and HR 2.08, 95 % CI 1.16–3.74), irrespective of tumor grade (grade I; HR 12.25, 95 % CI 1.35–111.17, grade II; HR 1.82, 95 % CI 1.15–2.87 and grade III; HR 1.50, 95 % CI 1.01–2.23), and in the Luminal B subtype (HR = 1.79, 95 % CI = 1.15-2.78). In women <40 years, the best survival was seen in those with Luminal A tumors (10-year BCSS 92 %) while it was markedly worse in the other subtypes (Luminal B 75 %, Her2-positive 68 % (in this analysis Luminal-Her2 and Her2-positive combined), and triple-negative 67 %).

Fig. 2
figure 2

Breast cancer-specific survival by age, stage, grade, and subtype for women with primary breast cancer stage I-III diagnosed 1992–2005 (N = 1120) divided by age <40 years and ≥40 years. Hazard ratios (HR) are given with their 95 % confidence intervals (95 % CI). Survival curves are compared by log-rank test. Proportion of women surviving at 5, 10, and 15 years from diagnosis

Multivariate analysis

In the multivariate analysis (Table 4), successively correcting for year of diagnosis, stage at diagnosis, detection mode, grade, subtype, and systemic treatment, young age (<35 years and 35–39 years) was an independent risk factor in LRFS (HR 2.13, 95 % CI 1.21–3.76 and HR 1.97, 95 % CI 1.06–3.68) but not in DDFS or BCSS.

Table 4 Multivariate analysis of prognostic factorsa affecting breast cancer death, distant disease, and locoregional recurrence by age

To focus on the subpopulation of women where the survival analyses indicated substantial differences between women aged <40 and ≥40 years (Luminal Her-2 negative breast cancer stage I-IIa), we performed a separate multivariate analysis (Fig. 3). Age <40 years was a statistically significant independent risk factor in DDFS (HR 1.87, 95 % CI 1.03–3.44) and in LRFS (HR 4.10, 95 % CI 2.20–7.66), but not in BCSS (HR 1.47, 95 % CI 0.72–3.02).

Fig. 3
figure 3

Forest plot of multivariate Cox regression of risk of event for women with stage I–IIa, estrogen receptor positive, and Her2-negative breast cancer (N = 389) by age <40 years (n = 152) versus reference ≥40 years (n = 237). BCSS Breast cancer-specific survival, DDFS distant disease-free survival, LRFS locoregional recurrence-free survival. Open square Crude, filled square adjusted for diagnostic period, tumor size, lymph node status, grade, subtype (Luminal A or Luminal B), endocrine therapy, and chemotherapy

Discussion

This population-based cohort study included 1120 women with breast cancer stage I–III of which 445 were <35 years at diagnosis with a median follow-up of 10 years. Women aged <35 years and 35–39 years had more advanced stage at diagnosis and a higher proportion of Her2-positive and triple-negative subtypes and less common Luminal A subtype. Women <35 years and 35–39 years received more intense treatment reflecting their stage and subtype distribution. Women <40 years had a worse BCSS compared to women ≥40 years in stage I and IIa, in all tumor grades and in the Luminal B subtype. At multivariate analysis, age remained an independent risk factor in LRFS but not significantly in BCSS or DDFS. In women with luminal early-stage disease, young age was an independent risk factor also of DDFS.

Treatment was given according to national guidelines and best international practice at that time. The number of women <35 years and 35–39 years is large, with detailed data on patient, tumor, and treatment characteristics and follow-up extracted from medical records. The long-term follow-up is nearing completion.

In a central pathology review, we re-evaluated grade and LVI and re-analyzed prognostic markers with modern methods at one single laboratory. Using IHC methods to separate Luminal A from Luminal B tumors has limitations [34]. In this study, we performed new IHC-analyses on archival material to avoid the effects of low intra- and inter-laboratory reproducibility and different antibodies for testing. To validate our results with regard to the arbitrarily set cutoffs, we performed a sensitivity analysis using alternative subtype definitions; grade instead of Ki67, Ki67 cutoff 14 %, ER-positive cutoff >10 % stained nuclei, which did not change the results.

During the study period of 14 years, treatment regimes have changed, and the time trends have not been the same in the compared age groups. More intense treatment was offered to young women and modern regimes were introduced earlier, which might have led to an underestimation of age-related survival differences in the multivariate analyses.

Many studies have shown young women with breast cancer to have a worse prognosis compared to their older counterparts. Our findings demonstrate that the differences in BCSS between age groups diminished over time, and lost significance during the last part of the studied period. In a recent Canadian study, outcomes for young breast cancer patients across two time periods were compared to determine whether the poor prognosis persists in the context of modern adjuvant therapies. There was an improvement in breast cancer outcome over time for all subgroups, but age <40 continued to predict inferior survival despite modern therapies [18]. Published data from The Surveillance, Epidemiology, and End Results program showed improved outcomes for young women with breast cancer over time, however restricted only to women with ER-positive disease [35].

The difference in prognosis between age groups has consistently been reported to be particularly evident in young women with ER-positive tumors [3640]. More recently, the prognostic significance of young age has been shown to be most prominent in the Luminal B subtype [5, 12, 15, 18, 41] even though some reports have indicated an increased risk compared with older women also among young with triple-negative [15, 16] and Her2-positive subtypes [42, 43]. In the present study, women aged <40 had a significantly worse survival only in the Luminal B subtype. Thus, the effect of age seems to vary within tumor subtypes.

Morrison et al. found Luminal B tumors among young women to demonstrate more aggressive features, with significantly lower ER and PR levels, higher Ki67, and p53 overexpression, than in older women with the same subtype. The high proliferation and p53 level, coupled with low ER and PR expression in young women, suggests that these tumors may originate from less-differentiated luminal cells [13].

Using genomic expression analysis, Azim and colleagues could, even after adjustment for subtype, observe remaining genetic differences by age with enrichment of processes related to immature mammary epithelial cells, growth factor signaling, and down-regulation of apoptosis-related genes [5]. Johnson et al. studied age-related gene expression differences within and across breast cancer subtypes. After adjustment for subtype, four key genes for proliferation, invasion, and metastasis persisted, some of which predicted inferior disease-free survival in younger women [43]. Also Liao et al. demonstrated unique genomic signatures differentiating premenopausal breast cancer from postmenopausal breast cancer, with the differences being limited to ER-positive tumors [44].

Whether the age-related biological differences within subtypes fully can explain the worse outcome for young women, or if treatment also plays a major role here, remains unclear. Except for age-related differences in the given treatment, one must also consider age-related differences in compliance to and effect of treatment. In the present study, all women were undertreated by today’s standards, with chemotherapy given to only 76 % of women <35 years and endocrine treatment to those with hormone receptor-positive disease in only 70 %. Ovarian suppression was offered to one-third of the youngest women with hormone receptor-positive tumors. Some authors have found young women to be less compliant with endocrine treatment [4547]. Women with Luminal B breast cancer derive less benefit from endocrine therapy compared to those with Luminal A breast cancer [48], and likewise less benefit from paclitaxel and doxorubicin-containing preoperative chemotherapy compared with HER2-enriched and basal-like breast cancers [4951]. Studies on neoadjuvant chemotherapy in women with luminal tumors have shown women <40 years to have a higher rate of pathological complete response than women >50 years also with positive effect on survival [52]. However, survival differences between young and older women with luminal tumors have been demonstrated also in untreated cohorts [5, 39].

To conclude, the effect of age is modified by tumor subtype. Despite correction for biology and more intense treatment in the young, young age is an independent risk factor for systemic disease in women with early-stage luminal tumors, with a two-fold risk of distant disease. However, current prognostic markers cannot reliably discriminate the young women benefitting from more intense systemic therapy and studies on prognostic markers relevant in the young population, and especially for the Luminal B subtype, are urgently needed. Age remains an important variable in treatment decisions until new relevant predictive markers are found.