Journal Information
Vol. 24. Issue 2.
Pages 161-166 (01 March 2020)
Download PDF
More article options
Vol. 24. Issue 2.
Pages 161-166 (01 March 2020)
Original Research
DOI: 10.1016/j.bjpt.2019.02.009
Full text access
Inter-day reliability of the Upper Body Test for shoulder and pelvic girdle stability in adults
Alan dos Santos Fontesa,
Corresponding author

Corresponding author at: Av. Gonçalo Rolemberg Leite, 1960, Ed. Gama, apt. 1201, CEP: 49050-370 Aracaju, Sergipe, Brazil.
, Marta Silva Santosa, Marcos Bezerra de Almeidaa, Pedro J. Marínb, Danilo Rodrigues Pereira da Silvaa, Marzo E. Da Silva-Grigolettoa
a Physical Education Department, Universidade Federal de Sergipe (UFS), São Cristóvão, Sergipe, Brazil
b CYMO Research Institute, Valladolid, Spain

  • Assessing the stability of the body may help to determine risk of injuries.

  • The Upper Body Test (UBT) presented good inter-day reliability.

  • Previous familiarization to the UBT is essential for the analysis.

Article information
Full Text
Download PDF
Figures (4)
Show moreShow less
Tables (1)
Table 1. Values of the tests (day 2 and 3), intraclass correlation coefficient (ICC), coefficient of variation (CV), standard error of measurement (SEM) and minimum detectable difference (MDD95%) between the second and third assessment of the Upper Body Test.

There are a limited number of tests for the assessment of shoulder and pelvic girdle stability. Reliable instruments are important to evaluate movement dysfunction at these joints in order to provide more objective parameters.


To evaluate the inter-day reliability of the Upper Body Test in young adults.


A reliability study was carried out with three assessments of the shoulder and pelvic girdle stability within 48-h intervals (Monday, Wednesday, and Friday). The OctoBalance® platform was used to perform the Upper Body Test in 31 active young adults (24.5±8 years). Intraclass Correlation Coefficient (ICC2,2) two-way mixed model, Coefficient of Variation, and Bland–Altman plots were used to verify the reliability of the test. The standard error of measurement (SEM) and the minimum detectable difference (MDD95%) were calculated for clinical applicability.


The Intraclass Correlation Coefficient ranged from 0.87 to 0.94 – Featuring a mean difference of 0.89 (95%CI=−0.19–1.97) to left and 0.95 (95%CI=−0.38–2.27) for the right side, with a low variation coefficient (3.31–5.91%) between the second and third days of assessment. There was a statistically significant difference between the first assessment day and the other test sessions. The Bland–Altman analyses revealed low bias with scores within the limits of agreement. Minimum detectable difference scores were between 4.02 and 5.10, and standard error of measurement between 1.75 and 2.72, depending on the movement side.


The Upper Body Test presented good inter-day reliability for assessing the stability of the shoulder and pelvic girdle in young active healthy adults.

Core stability
Physical tests
Physical therapy
Full Text

Closed kinetic chain exercises for the upper extremity challenge the individuals in terms of strength, speed, mobility and stability.1,2 Stability is the ability of the musculoskeletal structures to maintain a position or to move despite the presence of intrinsic or extrinsic disturbances to the body during the movement.3

Assessing the stability of the shoulder and pelvic girdle may help therapists to determine risk of injuries, and to prescribe exercise programs to individuals with shoulder dysfunction.1,4 Few tests for the assessment of functional movements and stability of the shoulder girdle in closed kinetic chain have been described in literature.5,6 The One-arm Hop test, the Closed-Kinetic Chain Upper Extremity Stability and the Upper Quarter Y-Balance tests have a base of support that limits joint mobility, challenging the assessment of trunk and shoulder girdle stability.1 The Upper Quarter Y-Balance Test guides the trajectory of the movement with plastic tubes. These sets of characteristics limit the observation of compensatory movements, which might have a direct effect on the occurrence of injuries. Therefore, these tests could mislead interpretation. The One-arm Hop test and the Closed-Kinetic Chain Upper Extremity Stability require that the individual quickly reach both left and right body sides, combining closed and open kinetic chain activities. These characteristics are relevant when assessing the plyometric ability of the upper limbs.7

The Upper Body Test (UBT) has been created to meet the need for more accurate assessment of the shoulder girdle stability.2 To apply the UBT, a base of support with two points is provided to the individual, increasing the difficulty of the test and allowing the evaluation of the lumbo-pelvic segment.8

However, the literature lacks studies on the reliability of the UBT for assessing stability of the shoulder and pelvic girdle. The objective of the present study was to evaluate the inter-day reliability of the UBT in the assessment of the shoulder and pelvic girdle stability. We hypothesized that the UBT is a reliable inter-day measurement test to evaluate shoulder and pelvic stability.

MethodsExperimental design

This is a methodological study carried out to test the reliability of the UBT. Measurements were taken in 3 days with intervals of 48h.


Based on a pilot study, sample size was calculated using the G*power program and considering α=0.05, β=0.20, power correlation ratio to null hypothesis of 0.35 and power correlation ratio to alternative hypothesis of 0.80.9,10 At least 29 participants were needed for the study. Considering a potential loss of 20%, 35 university students were recruited. Of those, four missed the second test day, due to personal problems. Therefore, 31 individuals (9 men and 22 women) composed the final sample and completed the three-day assessment.

Inclusion criteria were participants aged 18–45 years, and physically active measured by the International Physical Activity Questionnaire short form.11 Exclusion criteria were: history of orthopedic, neurological or cardiac problems; upper or lower limbs amputation; history of vestibular or physical disorders that could influence joint mobility and muscle tone; under medication that could inhibit the central nervous system; and failure to attend one of the sessions and if the measurement was assessed in a time interval over 48h. The present study was approved by the ethical committee of the Universidade Federal de Sergipe (HU-UFS), São Cristóvão, Sergipe, Brazil under the number 053820/2017 and participants signed a free and informed consent for participation in the study.


Body mass and height were obtained to calculate the body mass index (BMI, kg/m2) and characterize the sample. The OctoBalance® was used to assess the UBT. The device provides direction for performance of movements through colored arrows and a flexible metric scale. These features facilitate the performance of the movement as well as the range of motion achieved in each pattern of movement. The fixed platform is connected through magnets to a mobile part, which contains a scale that measures the distance in millimeters. These magnets are sensitive to changes in movement direction and get detached as it occurs. From the magnet detachment we can infer that the participant performed compensatory movements or changed movement direction. This feature helps understand the movement pattern.

Before the tests were carried out, the length of the upper limbs was obtained with a tape measure graded in millimeters and with the participant in the orthostatic position, shoulders flexed to 90°, elbows extended and wrists hyperextended. The measurement was taken from the acromion of the scapula to the first joint line, after the styloid process of the ulna. Upper limbs length was used to normalize the measurements obtained with the OctoBalance®, providing an assessment of the range achieved adjusted by limb length.1 Following, the participants received instructions about the assessments.

In order to familiarize with the UBT procedures, the participant performed three movements in the superior-medial and inferior-lateral direction to the left and right sides, with a 30-s interval between movements. After one minute of rest, the individuals repeated the same movements but at this time the distance obtained in each movement pattern was recorded.7 Thus, each participant performed 12 movements for each limb (24 total) per assessment day (Fig. 1).

Figure 1.

(A) Illustration of the OctoBalance®; (B) Initial position for assessment (quadruped); (C) Final position for assessment of the left superomedial pattern; (D) Final position for assessment of the left inferolateral pattern.


A physical therapist, with three years of experience, conducted the experiment, and was blinded to scores that were recorded on a form by a research assistant. Prior to the start of each trial, the measuring tape was placed 20cm from the platform of the OctoBalance®. The initial position of the test was standardized with the individual in a quadruped position (Fig. 1B) with the following recommendations: (a) individual in the quadruped position, supported in four points, with hands on the platform of the OctoBalance®; (b) metacarpophalangeal joints positioned on the arrows that were perpendicular to the body of the individual; (c) knees over a thin layer of foam, for protection; (d) hip, knees, ankles and shoulders flexed at 90°.

The participant performed specific movements supported in two points (knee and contralateral upper limb). To evaluate the left superior-medial pattern (Fig. 1C), the individual was asked to perform hip and knee extension on the left side, followed by movement of the right upper limb pushing the movable platform in the direction indicated by the yellow arrow. After this movement, the individual returned to the quadruped position and then, the assessment of the left inferior-lateral pattern (Fig. 1D) was conducted: elevation of the left lower limb followed by right upper limb pushing the platform in the direction indicated by the blue arrow. The assessments followed a specific order, starting with the left side followed by the right side, using the antero-medial and postero-lateral directions of the OctoBalance® (Fig. 1A).

The individuals were instructed to maintain their posture during the assessments. They were asked to avoid trunk rotation, flexion of the elevated lower limb, partial dislocation of the shoulder joint, flexion of the elbow of the supporting limb, and to maintain normal breathing and voluntary activation of the abdominal muscles to ensure trunk position.

Tests were discarded in the following cases: (A) abrupt pushing of the mobile platform; (B) intermittent displacement of the platform; (C) loss of balance during the test or failure to return to the initial supporting position; (D) diagonal displacement of the mobile platform in relation to the standard movement, (E) movement between the fixed base and the movable platforms of the OctoBalance®; (F) exaggerated elbow flexion (approximately 15°) of the supporting limb; (G) loss of hip extension; and (H) elevation of the ipsilateral lower limb to the side that moves the mobile platform.

The index of the UBT for each pattern of movement was calculated by dividing the mean value obtained in each direction by the length of the corresponding limb, and then multiplied by 100.12 The formula is showed below:

where r1, r2, r3 are the scores obtained by the individuals in each direction; and UL the length of the upper limb. The index was obtained for the left and right sides of the body and expressed as percentages in relation to the length of the upper limb.

Statistical analysis

The Shapiro–Wilk test was used to check the normality of the data. Descriptive statistics were expressed as mean and standard deviation (SD). After checking the assumptions of parametric statistics, the scores of the shoulder and pelvic girdle stability in the different assessment days were compared by repeated-measures Analysis of Variance (one-way ANOVA), followed by Bonferroni post hoc test. For the reliability, intraclass correlation coefficient (ICC2,2) two-way mixed model, with 95% confidence intervals and coefficient of variation (CV) were calculated. ICC values <0.25 were considered poor correlation, between 0.26–0.49 low correlation, 0.50–0.69 moderate correlation, 0.70–0.89 high correlation and >0.90 very high correlation.13 Bland–Altman plots were used to verify the agreement between measurements.14 Pearson correlation test was used for trend analyses. The standard error of measurement (SEM) was calculated using the equation: SEM = SD × 1−ICC, where SD corresponds to the standard deviation from day two. The minimum detectable difference (MDD95%) with 95% confidence interval was obtained with the equation MDD95% = 1.96 × 2×SEM.15 All analyses assumed statistical significance at p0.05. Statistical procedures were performed with SPSS® software version 22.0.


The mean age of the participants was 24.8±5 years, body mass of 66.6±13.5kg, height of 1.65±0.08m and body mass index of 24.4±4kg/m2. Statistically significant differences in the superior medial pattern of the left and right sides were observed between the first and second days of assessment (Fig. 2). However, these differences were not observed between the second and third assessment days.

Figure 2.

Upper BodyTest indexes in the three assessment days. Note. (*) Statistical difference (p0.05) in relation to day 1.


As there were significant differences between the first assessment compared to the second and third assessments, only the second and third measures are described in the following analysis. ICCs, CVs, SEM and MDD95% between the second and third assessments are presented on Table 1. High and very high correlation values (ICC 0.87–0.94) and low CV's were observed for all movements. The indicators of reliability and clinical relevance for the global indices of the left and right sides were: MDD95% sides Left 4.02 and Right 5.10.

Table 1.

Values of the tests (day 2 and 3), intraclass correlation coefficient (ICC), coefficient of variation (CV), standard error of measurement (SEM) and minimum detectable difference (MDD95%) between the second and third assessment of the Upper Body Test.

Movements  Day 2  Day 3  Day 3–2  Day 2 and 3
  Mean±SD  Mean±SD  MD (CI 95%)  ICC (CI 95%)  CV  SEM  MDD95% 
L superomedial  92.60±7.7  93.62±8.1  1.02 (−0.89–2.93)  0.87 (0.73–0.94)  5.6%  2.59  7.18 
L inferolateral  84.67±8.3  85.43±9.3  0.76 (−1.08–2.59)  0.91 (0.82–0.96)  5.9%  2.46  6.82 
R superomedial  91.74±7.8  93.09±8.2  1.35 (−0.58–3.28)  0.88 (0.75–0.94)  5.7%  2.70  7.48 
R inferolateral  86.55±7.6  87.10±8.4  0.55 (−1.44–2.53)  0.87 (0.74–0.94)  6.2%  2.72  7.54 
L side  88.63±6.0  89.52±6.5  0.89(−0.19–1.97)  0.94 (0.88–0.97)  3.3%  1.45  4.02 
R side  89.15±6.4  90.09±6.6  0.95(−0.38–2.27)  0.92 (0.83–0.96)  4.0%  1.84  5.10 

Note: L, left; R, right; SD, standard deviation; MD, mean difference.

The agreement between the measurements between the second and third days (Fig. 3) and body sides (Fig. 4) are presented below. A bias close to zero and a range of acceptable agreement between standards were observed. The best indicators (low bias and values within the limits of agreement) were observed for the global analysis for the left and right body sides, with most individuals within the limits of agreement. Only one participant was above the limits of agreement of 95% for the left side (Fig. 4A).

Figure 3.

Bland–Altman plots of differences and averages between the assessed patterns obtained in the UBT: (A) left superomedial (LSM); (B) left inferolateral (LIL); (C) right superomedial (RSM); (D) right inferolateral (RIL).

Figure 4.

Bland–Altman plots for visualization of the differences and means between the sides obtained by the UBT: (A) Left Side (LS); (B) Right Side (RS).


The results indicate that the UBT is a reliable instrument to assess stability of the shoulder girdle and lumbo-pelvic segment. However, familiarization to the UBT is essential before measures are taken.

High ICC values were found between the second and third assessments. The values were higher for the left and right sides of the body. Bland–Altman plots confirmed the reliability of the measures, showing a bias close to zero for most of the participants. Only one individual extrapolated the limits of agreement to the left side. The other individuals are within the limits of agreement. Thus, it is possible to suggest that there is good agreement between the attempts. However, we were unable to compare the values obtained in the present study with other data, because reliability studies with the UBT are scarce.

Similar studies assessing the functional performance of the upper limbs, shoulder and pelvic girdle stability using the Upper Quarter Y-Balance Test in active adults presented reliable results.1,7 However, the minimum differences detected in those studies (medial 8.1; superolateral 6.4; inferolateral 6.1)1 were similar than those found with the UBT – measures lower than 7.54 for movement patterns, and lower for the body sides (4.02 for the left and 5.10 for the right side). These results indicate greater sensitivity of the UBT, permitting observation of slight changes between two assessments. It is important to highlight the limitations to visualize compensatory movements in the Upper Quarter Y-Balance Test due to the presence of a plastic tube.

MDD95% values differ from reproducibility tests but are important for clinical practice, since indicate the minimum difference detected between two assessments that are not attributable to measurement error.16,17 Thus, if the UBT is used on a post-intervention and the scores found were below the MDD95% presented on Table 1, the interpretation would be no clinical improvement because the value would be within the margin error of the instrument. On the other hand, scores higher than the MDD95% suggested in this study means functional improvement.17

The Closed-Kinetic Chain Upper Extremity Stability Test measures movements of the body sides simultaneously, limiting the comparison between sides.7,18 The method used in the present study permits free and full range of motion, the detection of compensations by synergistic muscles and independent comparisons between the performance of the left and right sides.19 These attributes associated with the results of the present study favor the selection of UBT.

Stability tests aim to differentiate levels of physical performance (school, university and at professional levels) among individuals, and identify thresholds for risk of injuries.20 For health professionals, it may help in the prescription of exercises for stabilization and reducing the mechanical overload on the spine.20 In addition, it has been shown that adolescent athletes who reported shoulder pain had deficits in stability tests.21 Therefore, upper limbs test in closed kinetic chain with the UBT characteristics may potentially function as a differential method between an injured limb and an asymptomatic limb. This type of comparison is necessary to evaluate rehabilitation programs, to provide indicators for the return to the activity, or to suggest biomechanical adjustments to improve performance.4 Thus, future studies with UBT should include healthy and symptomatic individuals to provide parameters for treatment of injured upper limbs.

Finally, the familiarization measure is necessary to reduce the learning effects of the practice. Previous studies22,23 have shown that this need is greater for exercises with high complexity, and high activation of the core muscles, as in the case of the movements used by UBT.

One limitation of the study was the impossibility to test the criterion validity of the instrument due to the lack of a gold standard measure. In addition, the results are limited to other populations, like older adults, children and people with injuries, emphasizing the need for studies in other populations. Lastly, future investigations should also use anchor methods to determine the clinical relevance to the UBT.


The Upper Body Test presented satisfactory reliability for the assessment of shoulder and pelvic girdle stability in asymptomatic and physically active adults. However, we recommend at least two test sessions for good reliability results.

Conflicts of interest

The authors declare no conflicts of interest. Except Pedro J. Marín who is the creator of the OctoBalance instrument.

P.P. Gorman, R.J. Butler, P.J. Plisky, K.B. Kiesel.
Upper Quarter Y Balance Test: reliability and performance comparison between genders in active adults.
J Strength Cond Res, 26 (2012), pp. 3043-3048
G.S. Bullock, N. Brookreson, A.M. Knab, R.J. Butler.
Examining fundamental movement competency and closed-chain upper-extremity dynamic balance in swimmers.
J Strength Cond Res, 31 (2017), pp. 1544-1551
F.J. Vera-García, D. Barbado, V. Moreno-Pérez, S. Hernández-Sánchez, C. Juan-Recio, J.L.L. Elvira.
Core stability. Concepto y aportaciones al entrenamiento y la prevención de lesiones.
Rev Andal Med Deporte, 8 (2015), pp. 79-85
S. Contemori, A. Biscarini, F.M. Botti, D. Busti, R. Panichi, V.E. Pettorossi.
Sensorimotor control of the shoulder in professional volleyball players with isolated infraspinatus muscle atrophy.
J Sport Rehabil, 12 (2017), pp. 1-29
J. Cramer, M. Quintero, A. Rhinehart, et al.
Exploration of score agreement on a modified upper quarter y-balance test kit as compared to the upper quarter y-balance test.
Int J Sports Phys Ther, 12 (2017), pp. 117-124
V.M. De Oliveira, A.C. Pitangui, V.Y. Nascimento, H.A. Da Silva, M.H. Dos Passos, R.C. De Araújo.
Test–retest reliability of the Closed Kinetic Chain Upper Extremity Stability Test (CKCUEST) in adolescents: reliability of CKCUEST in adolescents.
Int J Sports Phys Ther, 12 (2017), pp. 125-132
R.B. Westrick, J.M. Miller, S.D. Carow, J.P. Gerber.
Exploration of the y-balance test for assessment of upper quarter closed kinetic chain performance.
Int J Sports Phys Ther, 7 (2012), pp. 139-147
M.P. García-Vaquero, J.M. Moreside, E. Brontons-Gil, N. Peco-González, F.J. Vera-Garcia.
Trunk muscle activation during stabilization exercises with single and double leg support.
J Electromyogr Kinesiol, 22 (2012), pp. 398-406
M.M. Shoukri, M.H. Asyali, A. Donner.
Sample size requirements for the design of reliability study: review and new results.
Stat Med, 13 (2004), pp. 251-271
G. Shieh.
Sample size requirements for the design of reliability studies: precision consideration.
Behav Res Methods, 46 (2014), pp. 808-822
S.M. Matsudo, T. Araújo, V.R. Matsudo, et al.
Questionário Internacional de Atividade Física (IPAQ): estudo de validade e reprodutibilidade no Brasil.
Rev Bras Ativ Fís Saúde, 6 (2001), pp. 5-18
H. Myers, M. Poletti, R.J. Butler.
Difference in functional performance on the upper-quarter y-balance test between high school baseball players and wrestlers.
J Sport Rehabil, 26 (2017), pp. 253-259
E. Domholdt.
Physical Therapy Research.
W.B. Saunden Company, (1993),
J.M. Bland, D.G. Altman.
Statistical methods for assessing agreement between two methods of clinical measurement.
Lancet, 327 (1986), pp. 307-310
C.B. Terwee 1, S.D. Bot, M.R. de Boer, et al.
Quality criteria were proposed for measurement properties of health status questionnaires.
J Clin Epidemiol, 60 (2007), pp. 34-42
S.M. Haley, M.A. Fragala-Pinkham.
Interpreting change scores of tests and measures used in physical therapy.
Phys Ther, 86 (2006), pp. 735-743
M. Paço, E. Cruz.
Fiabilidade intra-observador, erro de medida e mudança mínima detectável do weight-bearing lunge-test e do teste de deslizamento posterior do astrágalo em indivíduos com história de entorse do tornozelo.
ifisioonline, 2 (2011), pp. 25-31
T.G. Goldbeck, G.J. Davies.
Test–retest reliability of the closed kinetic chain upper extremity stability test.
J Sport Rehabil, 9 (2000), pp. 35-45
O. Gonzalo-Skok, J. Serna, M.R. Rhea, P.J. Marín.
Relationships between functional movement tests and performance tests in young elite male basketball players.
Int J Sports Phys Ther, 10 (2015), pp. 628-638
R.J. Butler, C. Southers, P.P. Gorman, K.B. Kiesel, P.J. Plisky.
Differences in soccer players’ dynamic balance across levels of competition.
J Athl Train, 47 (2012), pp. 616-620
V.M.A. Oliveira, A.C.R. Pitangui, M.R.A. Gomes, H.A.D. Silva, M.H.P.D. Passos, R.C. Araújo.
Shoulder pain in adolescent athletes: prevalence, associated factors and its influence on upper limb function.
Braz J Phys Ther, 21 (2017), pp. 107-113
R.M. Williams, M.A. Corvo, K.C. Lam, T.A. Williams, L.K. Gilmer, T.C. McLeod.
Test–retest reliability and practice effects of the stability evaluation test.
J Sport Rehabil, 26 (2017),
M. Moya-Ramón, C. Juan-Recio, D. Lopez-Plaza, F.J. Vera-Garcia.
Dynamic trunk muscle endurance profile in adolescents aged 14–18: normative values for age and gender differences.
J Back Musculoskelet Rehabil, 31 (2018), pp. 155-162
Copyright © 2019. Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia
Brazilian Journal of Physical Therapy

Subscribe to our newsletter

Article options
en pt
Cookies policy Política de cookies
To improve our services and products, we use "cookies" (own or third parties authorized) to show advertising related to client preferences through the analyses of navigation customer behavior. Continuing navigation will be considered as acceptance of this use. You can change the settings or obtain more information by clicking here. Utilizamos cookies próprios e de terceiros para melhorar nossos serviços e mostrar publicidade relacionada às suas preferências, analisando seus hábitos de navegação. Se continuar a navegar, consideramos que aceita o seu uso. Você pode alterar a configuração ou obter mais informações aqui.