Introduction

The important factors for decision-making in the treatment of distal radial fractures include the patient’s vitality, fracture classification, fracture dislocation and soft tissue condition. Distal radial fracture deformities are quantified by X-ray measurements which are also used to classify the result after reduction (e.g., good or fair result) [1, 2]. However, the clinical usefulness of X-ray measurements is hampered by their limited reproducibility [3, 4]. It is also recognized that the modest reproducibility of X-ray deformations is one of the keystones for the lack of reproducibility in the treatment of distal radial fractures [35]. Clearly, when measurements cannot be made reliably, a straightforward clinical comparison of treatment techniques or reduction results is impossible, making it difficult to come to level 1 evidence-based treatment results and advices.

Other studies that have evaluated sources of variance in the assessment of the quantitative X-ray measurements have reported difficulties with respect to the determination of the radial axis in the anterior–posterior (AP) view, which is essential to assess the radial height and the radial inclination. This is obviously much less a problem in the lateral view, where the higher reproducibility value was found [6]. A second source of inaccuracy might be the use of the existing normal values for radial length (11 mm), radial angle (22°) and volar angulation (10°) [711], despite the normal variation that exists within the population [12].

To circumvent these difficulties, we designed a technique to assess fracture deformities of the distal radius that is based on the comparison of the injured side to the uninjured side. The approach is based on research findings that indicate that the patients’ left and right side are comparable [13]. The aim of the study was to evaluate the reproducibility of the template assessment technique as compared with the direct quantitative measurement technique. We also examined whether the new technique resulted in a more reliable classification of reduction results for distal radius fractures.

Materials and methods

Inclusion criteria

Patients aged 18 years or above with a unilateral dislocated distal radial fracture with the history of neither a distal radial fracture nor a distal radial operation were included after informed consent was taken. A stratification for the AO fracture types was performed to reflect the prevalence in clinical practice at a ratio of 5:2:3 for the AO type A, B and C distal radial fractures. The standard post-reduction (postero-anterior (PA) and lateral) X-rays and standard distal radius X-rays of the unaffected side of 30 consecutive patients visiting the emergency department were included after stratification for AO fracture type.

Eight experienced observers, 4 trauma surgeons, 3 orthopedic surgeons and 1 trauma radiologist, independently measured the X-rays and subsequently classified the reduction results using the Lidström score (Fig. 1).

Fig. 1
figure 1

Anatomical radiological classification for distal radial fractures according to Lidström

In the first direct measurement round, the x-rays were only measured quantitatively. The radial length and the radial inclination were measured on the posterior–anterior plain X-ray film, while the dorsal/volar angulation was measured on the lateral X-ray film. The measurements were performed with a goniometer and a ruler. The normal values for radial length (11 mm), radial inclination (22°) and volar angulation (10°) were used to assign a Lidström score (Fig. 2).

Fig. 2
figure 2

Normal value measurement of the distal radius

During the second round, scheduled 2 months later, the template measurement technique of comparing the fractured side with the unaffected side was performed by the same observers measuring the X-rays of the same patients. The observers were instructed to assign a Lidström score using the X-rays of the unaffected distal radius as a template for the fractured side (Fig. 3). The templates of the unaffected side were made by the investigators on transparent sheets, by tracing the distal radius, ulna and first carpal row from the unaffected side plain PA X-Ray film on a overlaid transparent sheet. For the lateral film, only the distal radius and the lunate bone from the unaffected side were drawn on to the overlaid transparent sheet. The reference points on the AP templates were the ulna and/or the first carpal row (Fig. 3). On the lateral template, the reference points were the radial shaft/and or the lunatum. The measurements again were done with a goniometer and a ruler. And again the results were used to assign a Lidström score.

Fig. 3
figure 3

Measurements of distal radial fractures using the unaffected side as template

Statistical analysis

Overall inter-observer reproducibility of quantitative X-ray measurements for the difference in radial length, radial inclination and volar angulation for the eight observers was estimated using intraclass correlation coefficients (case 2, 1) [14]. The reproducibility of the ordinal Lidström scoring was quantified using a weighted kappa coefficient (Kw). Since intraclass correlations (ICC) and Kw are computationally equivalent [15], both were interpreted using the arbitrary classification proposed by Landis and Koch [16]: <0.00 poor; 0.00–0.20 slight; 0.21–0.40 fair; 0.41–0.60 moderate; 0.61–0.80 substantial and 0.81–1.00 almost perfect agreement. Calculation of the reproducibility coefficients were carried out considering the pair-wise ratings of all possible combinations of observers. For example, observer 1 and 2, observer 1 and 3, observer 1 and 4, and so on to observer 1 and 8. Then observer 2 and 3, observer 2 and 4, observer 2 and 5 and so on to observer 2 and 8. This was done for each observer to the last observers 7 and 8. For 8 observers and 30 X-rays this resulted in 28 (8 × 7/2) possible pairs times 30 X-rays, totalling 840 pairs of observations. Sample size and number of observers were chosen to accommodate the precision of the reproducibility estimates we wanted to achieve. When the number of observations is 840, a 2-sided 95% confidence interval (CI) for a reproducibility coefficient arbitrarily set at 0.40 will extend 0.05 around the point estimate (from 0.35 to 0.45). Statistical evaluations were carried out using SPSS 11.5 and StatXact 3.02 for Windows.

Results

Thirty patients with distal radial fractures were included. Of these patients, 15 fractures were of AO classification group A, 6 AO classification group B and 9 of AO classification group C.

Table 1, shows the reproducibility coefficients for the direct quantitative X-ray measures and the template assessment technique. Slight and insignificant increases in ICCs (95% CI) were observed for the radial length from 0.53 (0.48/0.57) to 0.54 (0.49/0.59) and volar angulation from 0.60 (0.56/0.64) to 0.64 (0.60/0.68). Marked improvements in reproducibility were observed for the radial inclination measurements. Intraclass correlations increased from 0.36 (0.30/0.41) with the old technique to 0.49 (0.43/0.55) with the template technique. The overall kappa for the Lidström score in the old technique was 0.37 (0.31/0.43) and improved in the template technique to 0.59 (0.52/0.63).

Table 1 Reproducibility coefficients for the X-ray measurements: old direct measurement technique versus template technique (N = 840 observations)

Discussion

Compared to the traditional quantitative technique, the template assessment technique resulted only in an improved inter-observer reproducibility for the radial inclination. There were slight improvements in reproducibility of the radial length and volar angulation. However, there was a notable improvement in the reproducibility of reduction results as assessed using the Lidström score.

The results of the quantative measurements of radial length and radial inclination are comparable to those reported by Kreder et al. They reported an inter-observer reliability coefficient of 0.49 for the radial length, and 0.32 for the AP radial angle [6]. As in our study, a better reproducibility was found for the volar angulation on the lateral view.

In this comparative study we had the same observers and the same patients with a 2-month interval and X-rays presented in another sequence order to overcome possible bias. The patients’ sample was chosen to reflect the daily practice in which the AO type A fractures are more frequent than the Type B and C.

One could argue that the use of normal values for the traditional quantitative measurement technique would negatively affect its reproducibility since there is variation in anatomical proportions between individuals in the population. For example, the ulnar length varies within 30% of the population [12]. Therefore, the use of normal values may lead to inaccurate classification of the reduction result. However, in our study, the comparison with the normal values did not negatively influenc the reproducibility, since all observers used the same values. By doing so, the only source of error left was the direct measurement of the fractured side. When the normal side is used to compare with in the classical measurement way, there would be measurement errors of the fractured side and of the uninjured side.

The improved reproducibility of the fractures deformity classification as expressed by the Lidström score (from 0.37 to 0.59) was primarily due to improvement in measurement of the radial inclination in the template technique. By using the template technique for the Lidström score, one could expect the most improvements in both the radial length and radial inclinations. The volar angulation would not be expected to improve much since it is measured independently from the other side (Fig. 1).

Surprisingly, we could not demonstrate an influence of the template technique for the radial length in this study, although it is partly dependent on the position of the radial axis on the PA view and could therefore benefit from templating. An explanation for this phenomenon could be that for measuring the difference in radial height (as shown in Fig. 3), there is still an axis needed. This becomes more eminent as the radial styloid is displaced more laterally (or rotated) and both tips of the radial styloid are not in line with the radial axis any more. Differences in placing the axis could be a source for variance, leading to a limited improvement in the Kappa value. In contrast, the radial inclination on the PA view in the template technique is basically independent of the radial axis and therefore more easy to measure with the template.

A limitation of the template technique is that it may not be applicable in patients with a bilateral fracture or patients with a history of a distal radial fracture or deformity on the other side. In this study, only patients with a unilateral distal radial fracture were included, reflecting the most common occurrence of distal radial fractures.

Another limitation of this study was that, in order to avoid learning curves, only experienced observers were used, which may not always be the case in daily practice. Also, the templates of the unaffected side were made by the same investigator. In daily practice, making the template could also be a source of variation since the correct position of the templates may be more difficult in case of severe dislocation or with slight differences in projection of the bones due to the angle of the X-ray beam. Perhaps the use of digital subtraction may be helpful to overcome this.

In conclusion, by using the template technique for X-Ray measurement we improved the reproducibility of the classification of radiological dislocation of distal radius fractures which is a step to come to evidence based distal radial fracture treatment as outlined in two recent Cochrane reviews [4, 5]. This procedure can also be performed for other radiological parameters such as ulnar variance. Future research should focus on the improvement of the template technique and the use of digital measurement—subtraction techniques/software modules in this time of filmless X-Ray pictures.