| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(Stroke. 2004;35:721.)
© 2004 American Heart Association, Inc.
Original Contributions |
From the School of Occupational Therapy, College of Medicine, National Taiwan University, Taipei (I-P.H., C-L.H.); Department of Psychology, National Chung Cheng University (W-C.W.), Chia-Yi, Taiwan; and Department of Psychology, DePaul University, Chicago, Ill (C-F.S.).
Correspondence to Ching-Lin Hsieh, PhD, School of Occupational Therapy, College of Medicine, National Taiwan University, 7 Chung-Shan S Rd, Taipei 100, Taiwan. E-mail mike26{at}ha.mc.ntu.edu.tw
| Abstract |
|---|
|
|
|---|
Methods A total of 245 patients at 1 year after stroke participated in this study. The BI and FAI were administered to the patient and/or the patients main caregiver by interview.
Results The initial Rasch analysis indicated that the middle scoring points for many items of the BI and FAI could be collapsed to allow only dichotomous response categories. All but 2 items of the FAI, social occasions and walking outside, fitted the models expectations rather well. These 2 items were excluded from further analysis. A factor analysis performed on the residuals of the Rasch-transformed scores recovered no dominant component. These results indicate that the combined 23 dichotomous items of the BI and FAI assess a single unidimensional ADL function.
Conclusions A clinically useful assessment of the comprehensive ADL function of patients at or later than 1 year after stroke can be obtained by combining the items of the BI and FAI (excluding 2 FAI items) and simplifying the responses into dichotomous categories. It is also demonstrated that the items of the new scale measure comprehensive ADL function as a single unidimensional construct when assessed at 1 year after stroke.
Key Words: activities of daily living cerebrovascular accident disability evaluation
| Introduction |
|---|
|
|
|---|
Previous studies using factor analysis found that the Barthel Index (BI) (measuring basic ADL) and the Frenchay Activities Index (FAI) (measuring IADL) assessed different constructs in patients with stroke.4,5 Because the BI and FAI supplemented each other with a minimum of overlap in content, it was proposed that they be combined to assess comprehensive ADL function.4,5 However, the main question in combining items of ADL and IADL indices is whether these items together contribute to a unidimensional construct, which is necessary to justify the summation of scores.8 Furthermore, factor analysis operates on interval scale scores,9 whereas the BI and FAI scores are ordinal by nature. Thus, the results of studies using factor analysis to examine whether items of both indices could be combined to measure a single construct were inconclusive.
Rasch analysis transforms ordinal scores to the logit scale and thus to an interval-level measurement.10,11 Furthermore, the Rasch model can examine whether items from a scale measure a unidimensional construct.10,11 If items do not fit the models expectation, unidimensionality is not preserved.10,11 Therefore, Rasch analysis is a useful method to validate the unidimensional construct of all the items of the BI and FAI and to obtain an objective, interval scale.10,11 Appropriateness of scoring points refers to whether or not participants can be differentiated by their responses as clearly as the points allow.12 The items of the BI are either dichotomous or on a 3- or 4-point scale; those of the FAI are on a 4-point scale. Recent studies13,14 have shown that a higher number of scoring points may not lead to finer differentiation of the participants. Given that the BI has been found to have a notable ceiling effect4,5 and that the FAI demonstrated a floor effect,4,5 it is of interest to examine whether or not the scoring points of these 2 indices are appropriate. Such an investigation can also be conducted by performing a Rasch analysis on the scores of these indices.12 This study used Rasch analysis to (1) investigate the appropriateness of the scoring points of the BI and FAI and (2) examine whether all the items of the BI and FAI contribute to a unidimensional construct in patients with stroke living in the community.
| Subjects and Methods |
|---|
|
|
|---|
Procedure
Data were collected at 1 year after stroke. An occupational therapist administered the BI and FAI to the patient and/or the patients main caregiver by face-to-face or telephone interview. The therapist tried to obtain the actual performance of the patients in daily life from all the informants available.
Measures
Two dichotomous, six 3-point, and two 4-point items constitute the BI.16 The FAI, which was developed to assess social activities or lifestyle after stroke,17 consists of fifteen 4-point items. Stroke severity at admission to the hospital was determined by the Canadian Neurological Scale.18 All 3 scales have been shown to be valid and reliable.1822
Data Analysis
The analysis consisted of 2 parts. First, the appropriateness of the scoring points in each item of the BI and FAI was investigated with the use of the Rasch partial credit model.10 The scoring points of the items of both indices were reorganized if they were found to be inappropriate. Second, the unidimensionality of the combined BI and FAI scale with appropriate categories was examined by Rasch analysis. The WINSTEPS computer program23 was used to perform the Rasch analysis.
The partial credit model can be used for polytomous items such as those of the BI and FAI. We examined the order of the step difficulties of each item of both indices to see whether their scoring points could be simplified.
To examine unidimensionality of the combined scale, infit and outfit statistics were used to examine whether the data fit the models expectation. The infit mean square (MNSQ) is sensitive to unexpected behavior affecting responses to items near the persons proficiency measure; the outfit MNSQ is sensitive to unexpected behavior by persons on items far from the persons proficiency level.10,24 MNSQ can be transformed to a t statistic, termed the standardized Z value (ZSTD), which follows approximately the t or standard normal distribution when the items fit the models expectation. In this study items with both infit and outfit ZSTD beyond ±2 were considered poor fitting.
When items fit the models expectation, the residuals (observed scores minus expected scores) should be randomly distributed. A factor analysis was conducted to verify whether any dominant component existed among the residuals. The unidimensionality assumption held if no dominant component was found.25
The sufficiency of the BI and FAI combined was verified by examining whether there were substantial gaps between the item difficulties along the item hierarchy.
| Results |
|---|
|
|
|---|
|
Appropriateness of Level of Scaling
All but 2 items (social occasions and walking outside) fitted the models expectation fairly well, indicating that these items measured a single construct for patients at or after 1 year after stroke. Table 2 shows that all but 2 of the 23 polytomous items of both indices exhibited disordering of the step difficulty (ie, that the difficulty of a higher step was lower than that of its adjacent lower step), indicating that the middle categories were never the most likely responses of any patient and were thus redundant. Accordingly, the response categories were collapsed into a dichotomy. The highest score category of the items in the BI was rescored as 1, indicating full independence in the activity. The rest of the categories were rescored as 0, indicating dependence in the activity. For the FAI, all but the lowest score category (0) of the items were collapsed into a category and rescored as 1, indicating participation in the activity. The lowest score category of the items in the FAI remained 0, indicating no participation in the activity.
|
Unidimensionality of Simplified Scale
When the Rasch dichotomous model was applied to examine the 25 dichotomous items together, 2 misfit items (social occasions and walking outside) had outfit ZSTD statistics of 2.9 (MNSQ=9.90 and 4.76, respectively) and infit ZSTD statistics of 3.4 (MNSQ=1.37) and 5.3 (MNSQ=1.86), respectively, which were statistically significant at the 0.05 level. These 2 items were excluded from the rest of the analyses. Table 3 shows that none of the outfit ZSTD was statistically significant for the remaining 23 items. The infit ZSTD had slightly more variations (SD=1.7) than the outfit ZSTD (SD=0.5), indicating that there was some degree of unexpected behavior affecting responses to items near the persons ability measure. However, because they were not very far away from the critical point ±2 or ±3 (more liberal) and their corresponding outfit ZSTD statistics were not significant, those items with slightly extreme infit ZSTD were not treated as poor-fitting items. A factor analysis on the residuals of Rasch-transformed scores showed no dominant factors. The first and second factors accounted for only 12.3% and 8.3% of the residual variance, respectively. These results indicated that there was a good model-data fit and that the assumption of unidimensionality held for these 23 dichotomous items when assessed at 1 year after stroke.
|
We further examined the psychometric properties of the 23-item scale. The person measures (ranging from -9.46 to 6.04) had a mean of -1.17 and a SD of 3.94. The person reliability was 0.94 (which can be similarly interpreted as Cronbachs
), indicating that these items yielded very precise estimates for the patients. The person reliability was practically identical to that under the partial credit modeling (0.93), ie, collapsing categories did not reduce the person reliability. The person measures under the Rasch dichotomous model were also highly correlated with those under the partial credit model (r=0.97), indicating that collapsing the categories did not substantially alter ADL score in the patients.
The Figure shows the map of persons and items along the continuum. The item difficulties were spread out across the patients, indicating that these items could well differentiate these patients. The Figure also shows that there were obvious gaps between items BI3 (grooming) and BI8 (transfer), between BI9 (mobility) and BI4 (dressing), between BI2 (bathing) and BI10 (stairs), and between FAI6 (local shopping) and FAI10 (driving a car/bus travel).
|
A conversion table showing combined BI and FAI raw scores (23 items) and their Rasch-transformed scores (ie, person measure in logit) is shown in Table 4. It allows prospective users to easily derive the Rasch-transformed scores from the raw scores.
|
| Discussion |
|---|
|
|
|---|
There are some advantages to using the combined scale. The items of the BI are located in the lower part of item difficulty, indicating easier activities, and those of the FAI are located in the upper part, indicating more difficult activities. The total score of the combined scale is useful in showing whether the subject has basic ADL problems or IADL problems. A conversion table was compiled showing combined BI and FAI raw scores (0 to 23) and their Rasch-transformed scores. The Rasch-transformed scores can be viewed as interval-level measurements,24 whereas the combined BI and FAI raw scores are ordinal data. Since most statistical techniques assume that the data are at least on an interval scale, the Rasch-transformed scores of the combined scale are recommended for future applications. In addition, both the original BI and FAI and their scoring guidelines19,22 as used in this study can be employed as usual. To use the new scale proposed in this article, the users can first obtain the combined ADL score for a patient by adding the dichotomized scores of all but the 2 poor-fitting items and then use the conversion table to look up the corresponding Rasch-transformed interval score.
There have been few empirical investigations of the appropriateness of scoring points of the BI and FAI. The items in both indices contain 2 to 4 scoring points. We found that the middle categories of both indices provide little information about the patients and are redundant. Thus, the response categories of all the items were recoded as dichotomous in this study. Given that the psychometric properties of the dichotomous items were equivalent to those of the polytomous items and that a scale with only dichotomous items is much more convenient and efficient to administer, the dichotomous categories are recommended for use in both indices for patients at or after 1 year after stroke.
Tennant et al26 also used Rasch analysis to examine the dimensionality of the BI and found that the item "bladder control" did not fit the models expectation. The discrepancy between their findings and our findings may be explained by the different scoring guidelines of the BI used in the respective studies. We adopted the scoring guidelines suggested by Wade and Collin,22 whereas they used those from Shah et al.27 Therefore, the results from this study should be interpreted with caution when other scoring guidelines are used.
Cultural or environmental factors may explain why the items "social occasions" and "walking outside" did not fit the model well.28,29 In addition, all the patients were assessed at 1 year after stroke and had been discharged from the hospital >6 months previously. Our results may not be broadly applicable to patients with different characteristics (eg, patients at a subacute stage or those who have just been discharged from the hospital).
The combined scale can be further improved by generating items to fill the gaps among pairs of current items (eg, grooming and transfer). These new items will be able to differentiate the ability levels of patients falling in the gaps of the current scale. Future studies to examine utility and other psychometric properties (eg, interrater reliability and responsiveness) of the combined scale in stroke patients are needed. Furthermore, it may be promising to extend the BI and FAI into an item bank and to use computerized adaptive testing to further elaborate ADL measurement.
In summary, this study provides good evidence for a unidimensional construct of the dichotomized and combined BI and FAI, with the exclusion of 2 FAI items. The results imply that these 2 indices can be combined to assess comprehensive ADL function in patients after 1 year after stroke.
| Acknowledgments |
|---|
Received August 12, 2003; revision received November 23, 2003; accepted December 2, 2003.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
Y.-W. Hsieh, I-P. Hsueh, Y.-T. Chou, C.-F. Sheu, C.-L. Hsieh, and G. Kwakkel Development and Validation of a Short Form of the Fugl-Meyer Motor Scale in Patients With Stroke Stroke, November 1, 2007; 38(11): 3052 - 3054. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. F Marchetti and S. L Whitney Construction and Validation of the 4-Item Dynamic Gait Index Physical Therapy, December 1, 2006; 86(12): 1651 - 1660. [Abstract] [Full Text] [PDF] |
||||
![]() |
I-P. Hsueh, W.-C. Wang, C.-H. Wang, C.-F. Sheu, S.-K. Lo, J.-H. Lin, and C.-L. Hsieh A Simplified Stroke Rehabilitation Assessment of Movement Instrument Physical Therapy, July 1, 2006; 86(7): 936 - 943. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. van Hartingsveld, C. Lucas, G. Kwakkel, and R. Lindeboom Improved Interpretation of Stroke Trial Results Using Empirical Barthel Item Weights Stroke, January 1, 2006; 37(1): 162 - 166. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Wang, I-P. Hsueh, C.-F. Sheu, and C.-L. Hsieh Discriminative, Predictive, and Evaluative Properties of a Trunk Control Measure in Patients With Stroke Physical Therapy, September 1, 2005; 85(9): 887 - 894. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Stroke Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2004 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |