Development of the Scientific, Transparent and Applicable Rankings (STAR) tool for clinical practice guidelines.
- Author:
Nan YANG
1
;
Hui LIU
2
;
Wei ZHAO
3
;
Yang PAN
4
;
Xiangzheng LYU
5
;
Xiuyuan HAO
6
;
Xiaoqing LIU
7
;
Wen'an QI
8
;
Tong CHEN
9
;
Xiaoqin WANG
10
;
Boheng ZHANG
11
;
Weishe ZHANG
12
;
Qiu LI
13
;
Dong XU
14
;
Xinghua GAO
15
;
Yinghui JIN
16
;
Feng SUN
17
;
Wenbo MENG
18
;
Guobao LI
19
;
Qijun WU
20
;
Ze CHEN
1
;
Xu WANG
21
;
Janne ESTILL
1
;
Susan L NORRIS
22
;
Liang DU
23
;
Yaolong CHEN
1
;
Junmin WEI
24
Author Information
- Publication Type:Journal Article
- MeSH: Reproducibility of Results; Surveys and Questionnaires; Practice Guidelines as Topic; Humans
- From: Chinese Medical Journal 2023;136(12):1430-1438
- CountryChina
- Language:English
-
Abstract:
BACKGROUND:This study aimed to develop a comprehensive instrument for evaluating and ranking clinical practice guidelines, named Scientific, Transparent and Applicable Rankings tool (STAR), and test its reliability, validity, and usability.
METHODS:This study set up a multidisciplinary working group including guideline methodologists, statisticians, journal editors, clinicians, and other experts. Scoping review, Delphi methods, and hierarchical analysis were used to develop the STAR tool. We evaluated the instrument's intrinsic and interrater reliability, content and criterion validity, and usability.
RESULTS:STAR contained 39 items grouped into 11 domains. The mean intrinsic reliability of the domains, indicated by Cronbach's α coefficient, was 0.588 (95% confidence interval [CI]: 0.414, 0.762). Interrater reliability as assessed with Cohen's kappa coefficient was 0.774 (95% CI: 0.740, 0.807) for methodological evaluators and 0.618 (95% CI: 0.587, 0.648) for clinical evaluators. The overall content validity index was 0.905. Pearson's r correlation for criterion validity was 0.885 (95% CI: 0.804, 0.932). The mean usability score of the items was 4.6 and the median time spent to evaluate each guideline was 20 min.
CONCLUSION:The instrument performed well in terms of reliability, validity, and efficiency, and can be used for comprehensively evaluating and ranking guidelines.