Abstract
Background
The Multiple Sclerosis Walking Scale (MSWS-12) was developed to measure the impact of multiple sclerosis on walking. Many other disabling neurological conditions affect patients’ ability to walk, and a generic measure of walking could provide valuable insights into patients’ perceptions in clinical trials and epidemiological studies as well as routine clinical practice.
Objective
To evaluate the clinical usefulness and psychometric properties of the Walking Impact Scale (Walk-12), a modified version of the MSWS-12, in patients with neurological conditions.
Design
A prospective, observational study of 120 consecutive patients admitted for rehabilitation. The Walk-12 was used to measure the impact of neurological disability on walking. Traditional psychometric methods (data quality, scaling assumptions, targeting, reliability, validity and responsiveness) were used to assess the Walk-12. Transition questions were used on discharge to measure perception of change. Outcome was also measured using the timed walk test (TWT), Barthel Index (BI) and Functional Independence Measure (FIM).
Results
For the total group, missing data were few, scaling assumptions were satisfied, and internal consistency was 0.94. Correlations between the Walk-12 and TWT, BI and FIM motor score were moderate (r = −0.58, −0.26, −0.31). Responsiveness of the Walk-12 was high (effect size = 1.12). Relationships between effect size and patients’ and physiotherapists’ opinion of change in walking demonstrated good concordance. Preliminary subgroup analyses indicate satisfactory psychometric properties across different neurological conditions; however, sample numbers in these analyses are small.
Conclusions
In this sample of neurologically disabled patients the Walk-12 was clinically useful and satisfied standard psychometric criteria. This provides preliminary evidence that it may be suitable as a generic measure of walking ability.
Similar content being viewed by others
References
Bergner M, Rothman ML (1987) Health status measures: an overview and guide for selection. Annu Rev Public Health 8:191–210
Bohannon RW, Andrews AW (1990) Correlation of knee extensor muscle torque and spasticity with gait speed in patients with stroke. Arch Phys Med Rehabil 71:330–333
Bohannon RW, Horton MG, Wikholm JB (1991) Importance of four variables of walking to patients with stroke. Int J Rehabil Res 14:246–250
Campbell DT (1959) Recommendations for APA test standards regarding construct, trait, or discriminant validity. Am Psychol:546–553
Cohen J (1992) A Power Primer. Psychol Bull 112:155–159
Collin C, Wade DT, Davies S, Horne V (1988) The Barthel ADL Index: a reliability study. Int Disabil Stud 10:61–63
Cortina JM (1993) What is coefficient alpha? An examination of theory and applications. J Appl Psychol 78:98–104
Cronbach LJ (1951) Coefficient alpha and the internal structure of tests. Psychometrika 16:297–334
Fitzpatrick R, Davey C, Buxton MJ, Jones DR (1998) Evaluating patient-based outcome measures for use in clinical trials. Health Technol Assess 2:1–74
Freeman JA, Hobart JC, Playford ED, Undy B, Thompson AJ (2005) Evaluating neurorehabilitation: lessons from routine data collection. J Neurol Neurosurg Psychiatry 76:723–728
Freeman JA, Playford ED, Nicholas RS, Thompson AJ (1996) A neurological rehabilitation unit: audit of activity and outcome. J R Coll Physicians Lond 30:21–26
Granger CV, Hamilton BB, Keith RA, Zielezny M, Sherwin FS (1986) Advances in functional assessment for medical rehabilitation. Topics Geriatric Rehabil 1:59–74
Hobart JC, Cano SJ, Thompson AJ (2002) What sample sizes for reliability and validity studies. Quality Life Res 11:636
Hobart JC, Freeman JA, Greenwood RJ, McLellan DL, Thompson AJ (1998) Responsiveness of outcome measures: beware the effect of different effect sizes. Ann Neurol 44:519A
Hobart JC, Freeman JA, Lamping DL (1996) Physician and patient-oriented outcomes in progressive neurological disease: which to measure? Curr Opin Neurol 9:441–444
Hobart JC, Lamping DL, Freeman JA, Langdon DW, McLellan DL, Greenwood RJ, Thompson AJ (2001) Evidence-based measurement. Which disability scale for neurologic rehabilitation? Neurology 57:639–644
Hobart JC, Riazi A, Lamping DL, Fitzpatrick R, Thompson AJ (2005) How responsive is the Multiple Sclerosis Impact Scale (MSIS-29)? A comparison with other self-report scales. J Neurol Neurosurg Psychiatry 76: 1539–1543
Hobart JC, Riazi A, Lamping DL, Fitzpatrick R, Thompson AJ (2003) Measuring the impact of MS on walking ability: the 12-Item MS Walking Scale (MSWS-12). Neurology 60:31–36
Hobart JC, Thompson AJ (2001) The five item Barthel index. J Neurol Neurosurg Psychiatry 71:225–230
Holmes WC, Shea JA (1997) Performance of a new, HIV/AIDS-targeted quality of life (HAT-QoL) instrument in asymptomatic seropositive individuals. Quality Life Res 6:561–571
Kazis LE, Anderson JJ, Meenan RF (1989) Effect sizes for interpreting changes in health status. Med Care 27:S178-S189
Korner-Bitensky N, Wood-Dauphinee S, Siemiatycki J, Shapiro S, Becker R (1994) Health-related information postdischarge: telephone versus face-to-face interviewing. Arch Phys Med Rehabil 75:1287–1296
Liang MH (2000) Longitudinal construct validity: establishment of clinical meaning in patient evaluative instruments. Med Care 38:II84–II90
Likert RA (1932) A technique for the measurement of attitudes. Arch Psychol 140:5–55
McGuigan C, Hutchinson M (2004) Confirming the validity and responsiveness of the Multiple Sclerosis Walking Scale-12 (MSWS-12). Neurology 62:2103–2105
McHorney CA, Ware JE Jr., Lu JF, Sherbourne CD (1994) The MOS 36-item Short-Form Health Survey (SF-36): III. Tests of data quality, scaling assumptions, and reliability across diverse patient groups. Med Care 32:40–66
Medical Outcomes Trust (2002) Assessing health status and quality of life instruments. Quality Life Res 11:193–205
Middel B, Stewart R, Bouma J, van Sonderen E, van den Heuvel WJ (2001) How to validate clinically important change in health-related functional status. Is the magnitude of the effect size consistently related to magnitude of change as indicated by a global question rating? J Eval Clin Pract 7:399–410
Nunnally JC (1970) Introduction to psychological measurement. McGraw- Hill, New York
Nunnally JC, Bernstein IH (1994) Psychometric Theory. McGraw-Hill, New York
O’Connor RJ, Cano SJ, Thompson AJ, Hobart JC (2004) Exploring rating scale responsiveness: does the total score reflect the sum of its parts? Neurology 62:1842–1844
Turner-Stokes L, Turner-Stokes T (1997) The use of standardized outcome measures in rehabilitation centres in the UK. Clin Rehabil 11:306–313
Wade DT (1992) Measurement in neurological rehabilitation. Oxford University Press, Oxford
Ware JE, Harris WJ, Gandek B (1997) MAP-R for windows: multitrait/multi-item analysis program: revised user’s guide. Health Assessment Lab, Boston
Ware JE, Snow KK, Kosinski M, Gandek B (1993) SF-36 Health Survey manual and interpretation guide. Nimrod Press, Boston (MA)
Wolfson L, Whipple R, Amerman P, Tobin JN (1990) Gait assessment in the elderly: a gait abnormality rating scale and its relation to falls. J Gerontol 45:M12–19
Acknowledgements
We are grateful to the staff at the Neurorehabilitation Unit who routinely collect outcome data. ROC was supported by grants from the UCLH Trustees. During the writing of this paper JH was on secondment to the School of Education, Murdoch University, Perth, Western Australia. This research attachment was supported by the Royal Society of Medicine through an Ellison-Cliffe Travelling Fellowship and the MS Society of Great Britain and Northern Ireland.
Author information
Authors and Affiliations
Corresponding author
Additional information
Received in revised form: 9 February 2006
Appendices
Appendix 1: Walking impact scale (Walk-12)
These questions ask about limitations to your walking due to your illness during the past two weeks. For each statement, please circle the one answer that best describes your degree of limitation. Please answer all questions even if some seem rather similar to others, or seem irrelevant to you. If you cannot walk at all, please tick this box: □
In the past two weeks, how much has your illness ... | Not at all | A little | Moderately | Quite a bit | Extremely |
---|---|---|---|---|---|
Limited your ability to walk? | 1 | 2 | 3 | 4 | 5 |
Limited your ability to run? | 1 | 2 | 3 | 4 | 5 |
Limited your ability to climb up and down stairs? | 1 | 2 | 3 | 4 | 5 |
Made standing when doing things more difficult? | 1 | 2 | 3 | 4 | 5 |
Limited your balance when standing or walking? | 1 | 2 | 3 | 4 | 5 |
Limited how far you are able to walk? | 1 | 2 | 3 | 4 | 5 |
Increased the effort needed for you to walk? | 1 | 2 | 3 | 4 | 5 |
Made it necessary for you to use support when walking indoors (e.g. holding on to furniture, using a stick, etc)? | 1 | 2 | 3 | 4 | 5 |
Made it necessary for you to use support when walking outdoors (e.g. using a stick, a frame, etc)? | 1 | 2 | 3 | 4 | 5 |
Slowed down your walking? | 1 | 2 | 3 | 4 | 5 |
Affected how smoothly you walk? | 1 | 2 | 3 | 4 | 5 |
Made you concentrate on your walking? | 1 | 2 | 3 | 4 | 5 |
Appendix 2: Analysis plans
Data quality
The Walk-12 data were examined for percentage missing items and the percentage of the sample for whom total scores could be calculated. For responders with missing items, a total score can be calculated if at least 50% of the items (n ≥ 6) have been completed. Each missing item is replaced with an imputed score, the patient specific mean score, which is the mean score across completed items for that patient [35]. The analyses of percentage missing items did not include imputed items.
Scaling assumptions
It has been proposed by others [24, 30], and is generally accepted [27] that a series of criteria should be satisfied for a set of items to be summed, legitimately, to form a total score. We tested the Walk-12 against these criteria, which are:
-
1.
Items should be roughly parallel, that is, measure at the same point on the scale and have similar variance, otherwise they do not contribute equally to the variance of the total score and should be standardised before combination [26]. A set of items is considered parallel when their item response option frequency distributions, and their item mean scores and standard deviations are roughly similar [24].
-
2.
Items should measure the same underlying construct, otherwise it is not appropriate to combine them to generate a total score. A set of items is considered to be measuring the same construct when each item’s corrected item-total correlation, which is the correlation between each item and the total score computed from the remaining items in that scale, exceeds 0.30 [26, 30].
-
3.
Items in the scale should contain a similar proportion of information concerning the construct being measured. This criterion is considered satisfied the corrected item-total correlations exceed 0.30 [34].
Targeting
Targeting refers to the match between the distribution of walking problems in the sample and the range of walking problems measured by the scale. The better this match, the greater the potential for precise measurement. Targeting was evaluated by examining score distributions, skewness statistics and floor and ceiling effects. Floor effects are the percentage of patients scoring 100 (greatest impact on walking) and ceiling effects are the percentage of patients scoring zero (least impact on walking). It is recommended that floor and ceiling effects should be less than 20% each [20].
Reliability
Scales should generate reliable estimates of the construct being measured (internal consistency). Cronbach’s alpha coefficient was used to determine this criterion [8]. Although a range of minimum values has been suggested, it is widely accepted that Cronbach’s alpha should exceed 0.70 [27] or 0.80 for group comparison studies [30].
Validity
Convergent and discriminant construct validity [4] were evaluated by examining the extent to which Spearman’s correlations between the Walk-12 and the velocity and cadence of walking, the BI, and the FIM motor and cognitive scores were consistent with expectations. It was predicted that correlations between the Walk-12 and the velocity and cadence, BI and FIM motor score would be moderate (between 0.3 and 0.7) and exceed the correlation with the FIM cognitive score. It was also expected that the correlations between the Walk-12 and the velocity and cadence (indicators of walking ability) would be greater than the correlations with the BI and FIM motor score (indicators of overall disability).
Responsiveness
Responsiveness was evaluated for the outcome measures using the effect size statistic, calculated as the mean change from admission to discharge divided by the standard deviation of the admission score [21]. Larger effect sizes indicate greater change measured by the scale. Effect sizes were interpreted using Cohen’s criteria, where 0.20 is a small effect, 0.50 is moderate and 0.80 is large [5]. The significance of change scores for the Walk-12 was also assessed using Wilcoxon Signed Ranks test.
Effect sizes were calculated for the sample as a whole, for each of the main diagnostic subgroups, and for patients in each response category of the change in walking ability transition question [28]. This relates the degree of change measured by the scale with patients’ own opinion of their change in walking ability. Similarly, we compared change detected by the Walk-12 with physiotherapists’ opinions of change in walking ability at discharge.
Rights and permissions
About this article
Cite this article
Holland, A., O’Connor, R., Thompson, A. et al. Talking the talk on walking the walk. J Neurol 253, 1594–1602 (2006). https://doi.org/10.1007/s00415-006-0272-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00415-006-0272-2