Introduction

Cardiorespiratory fitness, assessed as maximal oxygen consumption (VO2max), is a key factor in physical performance (Bassett and Howley 2000) and a strong independent predictor of health and longevity (Blair et al. 1989; Kodama et al. 2009). VO2max is determined during maximal physical effort by indirect calorimetry using special laboratory equipment. The methodology is time consuming, expensive, and dependent on physiological expertise. Furthermore, the VO2max measurements require a maximal effort from an individual, which is not always suitable in the general non-athlete population. Even if the maximal exercise test per se is a relatively safe procedure, there are also a number of contraindications that limits the possibility to evaluate VO2max in many settings (American Thoracic and American College of Chest 2003). To enable some evaluation of cardiorespiratory fitness in a population, in which the direct determination of VO2max is not possible, several tests, such as different step tests (Brouha et al. 1943; McArdle et al. 1972; Bennett et al. 2015), walk tests (Kline et al. 1987; Solway et al. 2001), treadmill (Swank et al. 2001), and cycle ergometer tests (Åstrand and Ryhming 1954; Legge and Bannister 1986; Golding et al. 1989; Hartung et al. 1993; Beekley et al. 2004), have been developed to estimate VO2max from submaximal performance.

In 2012, a submaximal cycle ergometer test (the original Ekblom-Bak test, EB-test2012) was presented, which estimates VO2max based on sex, age, and heart rate (HR) difference between a standard, low standard work rate, and a high individually chosen work rate (Ekblom-Bak et al. 2014). In a mixed population with regard to sex, age, and physical activity status, there was a strong correlation between estimated and actually measured VO2max, r = 0.91, with a corresponding coefficient of variation (CV) of 9.3 %. This was a significantly improved precision level compared with one of the most commonly used submaximal cycle ergometer tests and the Åstrand test (Åstrand 1960), reported by the authors to have a CV of 15 %. This is similar to other validation reports (Jessup et al. 1977; Siconolfi et al. 1982; Ekblom et al. 2007) and to the values found in our study population in the 2012 publication (r = 0.68 and CV 18.1 %) (Ekblom-Bak et al. 2014).

Although the relatively high precision of estimating VO2max by the EB-test2012, there was a strong, significant correlation between the error of measured and estimated VO2max and absolute VO2max level (spearman ρ = 0.42 in the total population, ρ = 0.51 in women, and ρ = 0.81 in men). This means that individuals with high VO2max were underestimated and individuals with low VO2max overestimated with the EB-test2012. Moreover, during the development of the EB-test2012 prediction equation, we found that the size of the study population limited the possibility of sex-specific equations, something that probably would enhance the prediction equation as sex was included as independent variable in the prediction equation. In addition, by including a sample having a greater range of VO2max (currently for the EB-test2012 prediction equation 1.56–3.73 L min−1 in women and 2.75–4.49 L min−1 in men) and age (currently 21–65 years), the test would be applicable on a greater proportion of the general population.

Therefore, in the present paper, we aimed to further develop the EB-test2012 prediction equation by including additional participants to the study population on which the 2012 prediction equation was developed. This enabled us to develop sex-specific prediction equations and to expand the valid age and VO2max range for the test. By identifying individual characteristics and physiological responses associated to VO2max, we also aimed to reduce the estimation bias seen with high VO2max level in the original test. The new prediction equation (EB-testnew) was then validated internally in the model group and in an external cross-validation sample and was also compared with the Åstrand and the EB-test2012.

Materials and methods

Participants

After public announcement and word-of-mouth in the region of Stockholm, Sweden, we included 74 additional participants to the 2012 model group (Ekblom-Bak et al. 2014). This additional sample consisted of 52 men and 22 women, mean age 58.8 (20–86) years, with a mean VO2 max of 3.10 (1.33–5.97) L min−1 and 39.2 (18.0–76.4) mL kg−1 min−1. Inclusion criteria for this group were an age, and VO2max mainly identified to be outside the valid range for the 2012 prediction equation (Ekblom-Bak et al. 2014). These participants were thereafter pooled to the model group of the 2012 prediction equation. The pooled sample was used to create the new prediction equation for the EB-test. The same recruitment method as for the participants to the model group was applied for the cross-validation sample. Recruitment of the latter sample consisted of a mixture of men and women of different ages and fitness levels, to enable cross validation in both the total sample and in subgroups. Inclusion criteria were men and women above 20 years of age, with no known diseases or disabilities. The characteristic of both the new model group (n = 217) as well as the cross-validation sample (n = 115) is presented in Table 1.

Table 1 Characteristics of the study samples (means ± standard deviations)

All participants were free from all the types of diseases that limit the physical work capacity and stated themselves as healthy on the test day. Furthermore, they were not taking any medications that could influence the relationship between HR and VO2. Exclusion criteria were smoking, snuff use, and medication with beta blockers or asthmatic medicine. Participants visited the test laboratory on one occasion to perform the submaximal EB-test and a maximal treadmill test to assess actual VO2max. Before the visit, the participants were asked to refrain from smoking and vigorous physical activity the day before and on the test day, and to not consume a heavy meal less than 3 h before the test. All participants were fully informed about the details of the study and provided written consent.

Submaximal and maximal test

All tests were performed in climate-controlled laboratory environment. When arriving to the test centre, body mass (measured while wearing in light-weight clothes to the nearest 0.1 kg) and height (to the nearest 0.1 cm) were measured. The participants were informed about the test procedure and equipped with a HR monitor (Polar Electro, Kempele, Finland). After individual adjustments of the seat and handlebar of the cycle ergometer and an introduction of the Borg´s scale of perceived exertion (RPE) (Borg 1970), the participant performed an EB-test according to the original 2012 test procedure (Ekblom-Bak et al. 2014). The test was performed on a mechanically braked cycle ergometer (Monark model 828E, Varberg, Sweden). Test procedure included 4 min of cycling on a standard and low work rate of 0.5 kilopond (kp) with a pedal frequency of 60 rpm (≈30 W when 1 W = 6.116 kpm/min), directly followed by 4 min of cycling on a higher individually chosen work rate (aiming at a RPE of ≈14 on the Borg scale). Mean steady-state HR during the last minute on the low and high work rates, respectively, was recorded by taking the mean of the observed HR at 3:15, 3:30, 3:45, and 4:00 min at each work rate. In addition, VO2max was also estimated by the Åstrand test method by applying the work rate and HR of the high work rate to the Åstrand nomogram (Åstrand and Ryhming 1954) and associated age-correction factors (Åstrand 1960). The same way of obtaining Åstrand test results from the EB-test procedure was used in the original publication of the first EB-test prediction equation, and is further described and discussed in the previous article (Ekblom-Bak et al. 2014). Direct measurement of VO2 during the submaximal cycle test was conducted in a subsample (n = 110) in the model group, using a computerised metabolic system (Jaeger Oxycon pro, Hoechberg, Germany) connected to a face mask worn by the participant. Before each test, ambient temperature, humidity, and barometric pressure were measured with built-in automatic procedures and a handheld instrument (HygroPalm, Rotronic, Bassersdorf, Schweiz). Gas analyzers and inspiratory flowmeter were calibrated with the metabolic system’s built-in automatic procedures, where high-precision calibration gases (15.00 ± 0.01 % O2 and 6.00 ± 0.01 % CO2, Air Liquid, Kungsängen, Sweden), and ambient indoor air was used for the gas analyses.

After a short rest, a 5 min warm-up on the treadmill preceded a graded maximal treadmill test to measure VO2max. The individually designed protocol for the VO2max test started off at 1° incline and a velocity corresponding to approximately 60–65 % of the participant’s estimated VO2max (usually the speed that the participant felt comfortable with during the warm-up). The speed increased 1 km/h during the first 3 to 4 min of the test, and thereafter, there was an increase in incline with +1° every minute until voluntary exhaustion. For some of the well-trained participants, running to an incline of 5°–6°, there was an additional increase in speed (+1 km h−1 per minute) to avoid too steep inclination on the treadmill. Direct measurements of VO2 were obtained during the test with the same computerised system as mentioned above (Jaeger Oxycon pro). Criteria for acceptance of the VO2max measurement were levelling off of VO2 despite an increase in speed or incline, a respiratory exchange ratio >1.1, RPE above 16, work time above 6 min, supported by a maximal HR within ±15 beats min−1 (bpm) from age-predicted maximal HR (ref Åstrand Rodahl). A test was accepted as VO2max when a minimum of three out of the five criteria was achieved. In the model group, nine participants were tested but later excluded due to non-fulfilling the requirements for acceptance of test (five participants failed the VO2max test and four participants had non-valid EB test). The corresponding values in the cross-validation group were four excluded participants in total, two with non-valid VO2max test and two with non-valid EB-test.

VO2max (L min−1) and maximal HR (bpm) were recorded into 30 and 5 s epochs, respectively. We have previously shown that there is no mean difference and a small variation (CV: 2.7 %) between test–retest of VO2max according to the above procedure in a mixed population (Ekblom-Bak et al. 2014), indicating no need for a second VO2max test on a separate test day to verify the first accepted measurement.

Development of the new EB-test prediction equation

In the EB-test2012 prediction equation, VO2max is predicted by entering sex (women = 0, men = 1) and age (years), and the difference in HR between the high and low work rate (ΔHR) divided by the difference in work rate between the high and low work rate (ΔPO) into the equation VO2max = 4.98196 − 2.88618 (ΔHR/ΔPO) + 0.65015 (sex) − 0.01712 (age). For the development of ΔHR/ΔPO, see the paper of the EB-test2012 (Ekblom-Bak et al. 2014). In line with the development of the EB-test2012 equation, but now based on the extended model group population, linear regression modelling was used to develop the EB-testnew prediction equation. Apart from performing separate prediction equations for men and women, the natural logarithm of VO2max (ln VO2max) was used for a better fit and for avoiding extrapolation to infinity in both ends of the regression. Since both the ΔPO variable (representing the high work rate) and the steady-state HR obtained on the low, the standard work rate was identified as highly associated with VO2max in both men and women in the EB-test2012 equation (for ΔPO, r = 0.76 in women and 0.79 in men; for HR at the standard work rate, r = −0.57 in women and r = −0.26 in men; p < 0.001 for all); we chose to include these as separate variables in the new model. The model construction is described in detail in the statistical analysis section below.

Validation, cross validation, and comparison groups

The EB-testnew prediction equation presented in the results section was internally validated in the model group as well as in the external cross-validation sample. Moreover, the validity and precision of the estimated VO2max by the EB-testnew was also compared with both the EB-test2012 and estimated VO2max derived from the Åstrand test method. This group is called the comparison group and consisted of only participants from the cross-validation group. Since both the EB-test2012 and the Åstrand test method have narrower valid age and VO2max ranges compared with the new prediction equation, only those fulfilling the prerequisites (i.e., participants aged 20–65 years with an HR on the high work rate applicable to the Åstrand nomogram, namely, 120–170 bpm) for all three tests were included in these comparison analyses (see Table 1 for the characteristics of participants in the validation group, cross-validation group, and comparison group).

VO2max estimated by the EB-test2012 was obtained by entering the corresponding values for each participant into the equation presented above. Results are also reported for quartiles in fitness level for absolute (L min−1) as well as relative (mL kg−1 min−1) VO2max.

Statistical analysis

A multiple linear regression with the forward method (probability of F = 0.05 for entry, and 0.10 for removal) was used, identifying age, ΔHR/ΔPO, ΔPO, and HR at the low standard work rate to be included in the final model as independent predictors of ln VO2max. Since we found significant gender-HR and gender-∆HR/∆PO interaction effects (both p < 0.001) for the prediction of ln VO2max, regressions were performed for men and women separately. The final model was checked for homoscedasticity and equal variance. Tolerance values >0.2 for all independent variables indicated low probability of multicollinearity. The 95 % confidence intervals (95 % CI) for the regression coefficients are given. Paired t tests were used to study the difference between measured and estimated VO2max. Bland–Altman plots, including limits of agreement (LoA), were produced for the cross-validation sample. The coefficient of variation variable (Tables 2, 3, 4) was calculated by dividing the standard deviation (SD) of the difference between measured and estimated VO2max with the mean of the measured and estimated VO2max. The adjusted R 2 and standard errors of the estimate (SEE) were obtained by linear regression, entering measured VO2max as the dependent variable and calculated VO2max as the independent variable. Two-sided statistical significance was set to p < 0.05 for all analyses. The SPSS statistical software version 21.0 (SPSS Inc., Chicago, IL, USA) was used for the statistical analyses.

Table 2 Validity of the EB-testnew equation (in the model group)
Table 3 Cross validation of the EB-testnew equation (in the external cross-validation group)
Table 4 Comparison of the three tests (in the comparison group)

Results

Subject characteristics in the model group, cross-validation group, and the comparison groups are shown in Table 1. The age and VO2max ranges for the participants in the model group were 21–86 years and 1.33–3.94 L min−1 (18.9–61.9 mL kg−1 min−1) in women, respectively, and 20–84 years and 1.67–5.97 L min−1 (23.5–76.4 mL kg−1 min−1) in men, respectively.

New equation based on data from the model group

The final sex-specific regression models (with 95 % CI for the independent variables) for EB-testnew were

Men: ln VO2max = 2.04900 (95 % CI 1.83517–2.26282) − 0.00858 (95 % CI −0.00987 to 0.00728) (age) − 0.90742 (95 % CI −1.11676 to −0.69808) (ΔHR/ΔPO) + 0.00178 (95 % CI 0.00127–0.00228) (ΔPO) − 0.00290 (95 % CI −0.00438 to −0.00141) (HR at standard work rate).

Women: ln VO2max = 1.84390 (95 % CI 1.53151–2.15628) − 0.00673 (95 % CI −0.00812 to 0.00534) (age) − 0.62578 (95 % CI −0.81368 to −0.43789) (ΔHR/ΔPO) + 0.00175 (95 % CI 0.00056–0.00295) (ΔPO) − 0.00471 (95 % CI −0.00674 to −0.00268) (HR at standard work rate).

After entering the corresponding values into the equation, VO2max (in L min−1) was estimated by putting in the obtained value (x) as an exponent in the natural logarithm. R 2 adjusted for the final models were 0.86 in men and 0.83 in women.

Table 2 presents the validity of the new equation in the model group. The systematic error (the difference between measured and estimated VO2max) was assessed in the full sample, as well as in subgroups for age and fitness level (presented as the quartiles for relative as well as absolute VO2max). The systematic error in the different subgroups ranges from an underestimation at the most of 0.14 L min−1 to an overestimation of 0.10 L min−1. The coefficient of variation was 8.7 % in the full sample, ranging between 6.5 and 11.0 % in the subgroups. In addition, the explained variance of the measured VO2max by the estimated value was 91 % in the full sample, and the SEE values were 0.28 L min−1 for the full sample and ranging between 0.20 and 0.31 L min−1 in the subgroups.

The differences between measured and estimated VO2max by the EB-testnew equation in the model group were not associated with maximal HR in women (Spearman ρ = 0.10, p = 0.33) or men (ρ = 0.14, p = 0.13), and not with deviation for age-predicted maximal HR (ρ = −0.17, p = 0.08) in women, but in men (ρ = −0.33, p < 0.001). In both women and men, the differences were associated with VO2max level, ρ = 0.40, p < 0.001, and ρ = 0.25, p = 0.006, respectively.

Cross validation

In the cross-validation sample, the estimation of VO2max by the EB-testnew equation was analysed in the full sample and thereafter stratified into the same subgroups as the model group. Data were found to be homoscedastic. Results are shown in Table 3. The cross-validation analyses showed similar results as in the internal validation sample, with a non-significant mean systematic difference of 0.02 (95 % CI −0.04 to 0.08) and a similar variation, 9.4 %, in the full sample, ranging from −0.09 to 0.16 L min−1 and 7.7 % to 12.8 %, respectively, in the different subgroups. Bland–Altman plots of the estimated and measured VO2max (in L min−1 and mL kg−1 min−1) are given in Figs. 1 and 2, respectively. LoA for absolute values were −0.54 to 0.76 L min−1 in men and −0.61 to 0.44 L min−1 in women. The corresponding values for relative VO2max were −7.2 to 10.3 mL kg−1 min−1 in men, and −9.7 to 7.1 mL kg−1 min−1 in women, respectively. SEE for the absolute values were 0.33 L min−1 for men and 0.27 min−1 for woman, and the corresponding relative values were 4.17 and 4.26 mL kg−1 min−1, respectively.

Fig. 1
figure 1

Bland Altman plot, including limits of agreement, for estimated and measured VO2max (L min−1) in the cross-validation group. Filled dots women. Transparent dots men. Black line mean difference. Dashed line ±1.96 × SD

Fig. 2
figure 2

Bland Altman plot, including limits of agreement, for estimated and measured VO2max (mL kg−1 min−1) in the cross-validation group. Filled dots women. Transparent dots men. Black line mean difference. Dashed line ±1.96 × SD

Comparison group

There were overall larger systematic errors and variations in estimations of VO2max by the EB-test2012 equation and the Åstrand method, compared with the EB-testnew equation (Table 4). Mean error was significantly lower for the EB-testnew compared with the EB-test2012 (p < 0.001). The adjusted R 2 between estimated and measured values was significantly higher in EB-testnew compared with the Åstrand test. Compared with the new test equation, the larger coefficient of variation and SEE and lower R 2 adjusted indicated a larger variability in the EB-test2012 and Åstrand test, respectively. In the full sample and in most subgroups, the difference between measured and estimated VO2max was lower for the new test equation compared with the other two tests.

In addition, difference between measured and estimated VO2max for the EB-testnew and the EB-test2012 equation was correlated with measured VO2max level in the comparison group. The associations between estimation error and VO2max level for the EB-test2012 were ρ = 0.37 in the full sample, and ρ = 0.44 and ρ = 0.83 (all p < 0.01) for women and men, respectively. For the EB-testnew, these correlations were lower and only significant in woman; the corresponding values were ρ = 0.074 (p = 0.53) in the full sample, and ρ = 0.43 (p = 0.004) and ρ = 0.36 (p = 0.069) in women and men, respectively.

Discussion

In the present paper, we have further developed a test, the EB-test presented in 2012, for the estimation of VO2max from submaximal work rates on cycle ergometers by including sex-specific prediction equations and expanding the test’s validity in respect to both VO2max and age range. Compared with the internal validation for the EB-test2012 (9.3 % and 0.30 L min−1), the variation between estimated and measured VO2max and corresponding SEE was lower for the EB-testnew; 8.7 % and 0.28 L min−1. Cross validation in an external sample showed similar improved results. Subgroup analyses in both the internal validation sample and the cross-validation sample showed consistency in accuracy and precision between sexes, age-groups, and quartiles of VO2max. The correlation between the estimation error and VO2max level seen for the EB-test2012 was lower for the EB-testnew in the full sample and especially among men. This reduction may be partly explained by the inclusion of two new variables in the prediction equation, ΔPO variable, and HR at standard work rate.

The objective of the EB-testnew, just as with the EB-test2012, is to provide a simple, time-effective and low-risk way to enable estimation of VO2max in settings outside the physiological laboratory. With the increased valid range for the test, it is now applicable on healthy individuals with wide ranging physical performance capacity. Another advantage of the EB-testnew is the further improved precision of the estimated VO2max compared with measured VO2max, with low variation between sexes and different age and VO2max groups (see Table 4). For example, by applying the variation between measured and estimated VO2max reported in the model group (8.4 % for men and 9.2 % for women), 95 out of 100 male individuals performing the EB-testnew with an actual VO2max of 3.0 L min−1 will be predicted within ±0.49 L min−1 and females within ± 0.54 L min−1. For a male or female with an absolute VO2max of 2.0 L min−1, the corresponding estimated VO2max is within 0.33 and 0.36 L min−1, respectively. However, these results from the EB-testnew rely on that the data for calculation are obtained in line with the description for test method and with the correct equipment (i.e., cycle ergometer), as described earlier. The same test manual and protocol should be applied for the EB-testnew as for the EB-test2012 (Ekblom-Bak et al. 2014), which is already in use. It is only the prediction equation that has been updated. The new equation is sex specific and with a wider age and VO2max ranges, with the inclusion of two additional test variables (the ΔPO variable and HR at the standard work rate) to reduce the estimation error and increase the precision of the estimation of VO2max.

There are a number of already existing submaximal tests for estimation of VO2max, where the participant performs an amount of submaximal work in the form of a step-up exercise, walking, running, or pedalling on a cycle ergometer (Brouha et al. 1943; Åstrand and Ryhming 1954; McArdle et al. 1972; Legge and Bannister 1986; Kline et al. 1987; Golding et al. 1989; Hartung et al. 1993; Swank et al. 2001; Solway et al. 2001; Bennett et al. 2015). Compared with previously described methods, the EB-testnew test has higher R2 than many step tests, which range from 0.22 to 0.90 (Perroni et al. 2013; Åstrand and Ryhming 1954; Santo and Golding 2003; McArdle et al. 1972; Francis and Culpepper 1989; Knight et al. 2014; Chatterjee et al. 2004). Compared with other studies on the validity of the Åstrand test, the EB-testnew test has similar R 2 of 0.90 (Hartung et al. 1993). Validity is slightly higher for maximal treadmill running tests for the determination of VO2max, e.g., Balke test protocol, R 2 0.85 or Bruce test protocol, R 2 0.77 (Pollock et al. 1976). However, maximal tests include an all-out performance, in which VO2max is only one part. Furthermore, maximal running tests are not applicable in many situations, such as when testing older people, patients with orthopedic diagnoses, obesity or people unaccustomed to intense running. Furthermore, relative estimates or agreement, such as the correlation coefficient, are highly dependent on absolute range. In this present paper, we have the same absolute range in VO2max, making R 2 comparisons meaningful. In the comparison group, the adjusted R 2 for EB-testnew was 0.83, and the corresponding value for the Åstrand test method was 0.50.

Absolute measures of agreement indicate similar or slightly better validity for the EB-testnew (SEE: 4.2 mL kg−1 min−1 for men and 4.3 mL kg−1 min−1 for women, respectively) compared with other tests. The previous studies report an SEE of 8.9 mL kg−1 min−1 for the submaximal YMCA cycle ergometer test (Beekley et al. 2004) and for the Åstrand test method SEE from 4.3 mL kg−1 min−1 (Hartung et al. 1993) to 5.7 mL kg−1 min−1 (Cink and Thomas 1981). In the comparison group in the present paper, SEE for relative values based on the Åstrand test method was 5.6 mL kg−1 min−1, compared with the EB-testnew 4.1 mL kg−1 min−1 (data not shown).

The commonly found low accuracy and precision with the run- walk and step tests could be due to the inter-individual variance in morphology, body mass, gait, and mechanical efficiency, which may be induced when individuals perform these types of weight-bearing activities. The use of the cycle ergometers for submaximal testing diminishes some of the mechanical variations in performance, as the previous research has shown rather constant mechanical efficiency in mixed populations (Åstrand and Rodahl 1970).

Another factor influencing the accuracy of submaximal tests is the use of age-predicted maximal HR in the calculations, a postulation where there is large individual variations (Engels et al. 1998). For example, the Åstrand test uses a one-point work rate methodology combined with the above-mentioned assumption regarding maximal HR, which partly explains some of the measurement error (in the present study population r = 0.71 and CV 18.1 %, respectively). In the present test, there are no calculations of maximal HR; therefore, this potential source of error is eliminated. We believe that the inclusion of the ΔPO variable and HR at the low standard work rate in the EB-testnew equation explains some of the reduced variation in the low and high ends of the VO2max range, resulting in a higher precision throughout the valid VO2max span.

Although showing better accuracy and precision compared with the EB-test2012, there is still an estimation error left in the EB-testnew equation with increased VO2max level, albeit non-significant. One possible source of error could be the usage of the same and rather low standard work rate (≈30 W) in a population with wide ranging physical performance capacity. This rate of work may be too low for obtaining full stroke volume and thereby contributes to a variation in HR response at the standard work rate, especially in highly trained subjects (Blomqvist and Saltin 1983). To analyse this, we examined the correlation of percentage of VO2max on the standard work rate and measured VO2 during submaximal cycling (n = 110) and the estimation error. The correlation was ρ = 0.18 (p = 0.061). The percentage of VO2max on the standard work rate was therefore ruled out as a main source of the estimation error. However, other factors, such as blood flow distribution and blood pressure, related to high VO2max level, may influence the estimation error.

Another potential source of error that may influence on the precision of the submaximal test is the reliance on the existence of a linear relationship between VO2 and power output, as previously shown by Åstrand and Rodahl (Åstrand and Rodahl 1970). This notion has been questioned due to later findings of a non-linear relation between VO2 and power output (Zoladz et al. 1995). This non-linearity may affect the ability of a submaximal test to predict VO2max. However, the non-linearity of this relationship seems to be found at high intensities above the anaerobic threshold (Zoladz et al. 1998; Majerczak et al. 2012). Nonetheless, the issue of non-linearity may well be an important factor for the proper execution of the test, stressing the importance of not choosing an individual work rate that is too high.

Strengths and limitations

A strength of the EB-testnew equation is the consistent high accuracy and precision in the external cross-validation sample, as well as in the different subgroups (men and women, different age-groups and a wide range of VO2max levels). This is of particular interest with regard to the fact that the new equation is based on a relatively large and heterogeneous sample, with a wide variation in age and VO2max. We, therefore, believe that the test now is suitable for most non-diseased individuals. The use of sex specific prediction equations, rather than controlling for sex within an equation, has undoubtedly contributed to better precision.

A common issue in studies involving maximal testing is the selection bias, often including more fit individuals than in the general population. This may limit the accuracy of the prediction equation in the general population. One way to analyse the representativeness of the present model group for the general population (with special regard to age), is to compare the decrease in VO2max with age between the model group and previously reported values from general population samples. The age-related decline per decade in measured VO2max for the model group sample in this study was −9.1 % per 10-year, compared with -6.5 % in 10,973 men and women, where VO2max was obtained from maximal testing on cycle ergometer (Eriksen et al. 2015) and −6.9 % in 3678 men and women, where VO2max was obtained from treadmill testing (Loe et al. 2013). The difference in slope may be due to a higher mean fitness among the young participants in the present paper, and may thus express a limited degree of bias. The implication of this bias on the validity of the test may be regarded as low.

Moreover, a submaximal test only estimates VO2max based on variables obtained during the submaximal exercise. Hence, individuals who deviate in physiological characteristics from the individuals included in the model group, for example with an extremely high or low cardiorespiratory fitness, exceptional work efficiency or abnormal HR response, may obtain an estimated VO2max further from their actual VO2max than expected. Furthermore, any medications that may have an influence on HR, also may affect the results from the test. This prediction equation is based on ∆HR, and the ∆HR relation to power output, whereas medications, such as beta block, may alter these relationships. Future research should focus on the feasibility of the EB test in clinical populations. Another scope of interest is the ability of the EB test to detect a change in actual VO2max over time, for example as a consequence of a training intervention. The use of the test in this situation has not yet been evaluated. To date, the recommendation is to keep the same high individually chosen work rate when monitoring an individual over time.

Conclusion

In the present paper, we have further developed an easy administered, non-expensive, and accurate submaximal ergometer test for the estimation of VO2max. The EB-testnew estimates VO2max throughout a wide range of ages and fitness levels, and can be used in health screenings and in research studies in large populations and in the general population.