1.Psychometric properties of self-report questionnaires in evaluating blended learning in health science university students: A systematic review
Valentin C. Dones III ; Maria Teresita B. Dalusong ; Donald G. Manlapaz ; Juan Alfonso S. Rojas ; Ma. Bianca Beatriz P. Ballesteros ; Ron Kevin S. Flores ; Kaela Celine C. Hor ; Jose Angelo D. Monreal ; Audrey Marie A. Narselles ; Jose Joaquin R. Reyes ; Lianna Andrea B. Sangatanan
Acta Medica Philippina 2025;59(Early Access 2025):1-14
BACKGROUND
Due to the COVID-19 outbreak, schools had to switch online. The sudden transition to blended teaching and learning (BTL) poses challenges for students and teachers, especially for health science programs that require hands-on practical experience. The validity, reliability, and responsiveness of these self-report questionnaires (SRQs) should be established to ensure the accuracy of the results as intended by the SRQ.
OBJECTIVESThis study critically appraised, compared, and summarized the psychometric properties of SRQ evaluating BTL among health science university students. This review determined the SRQ’s reliability, internal consistency, various forms of validity (content, criterion, construct), and responsiveness.
METHODSFollowing a 10-step procedure based on COSMIN guidelines, we conducted a systematic review of SRQs used by health science university students to evaluate blended teaching and learning. Studies were eligible if they reported psychometric properties of SRQs related to blended learning among university health science students; exclusions included studies focusing on perceptions, attitudes, self-efficacy, and satisfaction, as well as articles such as biographies, editorials, and conference materials. Searches covered multiple electronic databases until April 26, 2023, including PubMed, EMBASE, Web of Science, MEDLINE (OVID), PsycInfo, CINAHL, EBSCOHOST, ERIC, Scopus, Science Direct, Google Scholar, JSTOR, Acta Medica Philippina, Philippine Journal of Health Research and Development, and HERDIN, managed through Zotero. Two independent reviewers performed database searches, title and abstract screening, and full-text evaluations, with a third reviewer resolving any disputes. The COSMIN Risk of Bias Checklist was employed to evaluate included studies on the development and various measurement properties of SRQs. The reviewers assessed SRQ standards, including validity, reliability, internal consistency, measurement error, responsiveness, interpretability, and feasibility. Data extraction and result tabulation were independently completed, with content comparison by two health education experts. This evaluation categorized the SRQs into three quality and validity levels.
RESULTSThe study examined five articles; four were rated as 'doubtful' and one as 'inadequate' in the overall development of SRQ. All four 'doubtful' studies demonstrated questionable content validity when university students were asked about the questionnaire's relevance, comprehensiveness, and comprehensibility. Only half of these studies achieved an 'adequate' rating for content validity based on expert opinions on relevance and comprehensiveness. All but one study scored from 'very good' to 'adequate' in structural validity. Three out of the four studies scored a very good rating for internal consistency, while one was deemed 'inadequate' in internal consistency, cross-cultural validity, and reliability. Three out of four studies scored 'very good' on construct validity, but all overlooked criterion validity and responsiveness. Conducted in various locations, including Australia, Romania, Turkey, and Taiwan, these studies highlighted both common characteristics and limitations in questionnaire development according to the COSMIN guidelines. Four studies were deemed reliable and valid for BTL constructs (Category A); Wu et al. requires further validation (Category B). Study limitations included heterogeneity in populations, settings, and questionnaire versions, potential subjective bias in SRQ content comparison, and the evolving nature of SRQs in blended learning contexts.
CONCLUSIONThe systematic review reports the development and evaluation of SRQs for BTL while identifying gaps in their applicability to health science programs. The Blended Learning Scale (BLS) of Lazar et al. and the Blended Learning Questionnaire (BLQ) of Ballouk et al. showed an ‘adequate' rating for content validity. BLS revealed very good structural validity, internal consistency, and adequate content validation. Although the BLQ lacked Confirmatory Factor Analysis, it yielded valuable constructs for evaluating health sciences students' experiences in BTL. Both tools require improvements on recall period, completion time, interpretability, and feasibility. The review underscores the necessity for cont inuous assessment and enhancement of such instruments in BTL, advocating a rigorous scale development process. Furthermore, it encourages the customization of teaching and learning evaluation tools to suit specific institutional contexts while promoting further validation of these questionnaires across different populations in future research.
Human ; Psychometrics ; Checklist ; Self Report ; Universities ; Health Education
2.Psychometric properties of self-report questionnaires in evaluating blended learning in health science university students: A systematic review.
Valentin C. DONES III ; Maria Teresita B. DALUSONG ; Donald G. MANLAPAZ ; Juan Alfonso S. ROJAS ; Ma. Bianca Beatriz P. BALLESTEROS ; Ron Kevin S. FLORES ; Kaela Celine C. HO ; Jose Angelo D. MONREAL ; Audrey Marie A. NARCELLES ; Jose Joaquin R. REYES ; Lianna Andrea B. SANGATANAN
Acta Medica Philippina 2025;59(16):79-92
BACKGROUND
Due to the COVID-19 outbreak, schools had to switch online. The sudden transition to blended teaching and learning (BTL) poses challenges for students and teachers, especially for health science programs that require hands-on practical experience. The validity, reliability, and responsiveness of these self-report questionnaires (SRQs) should be established to ensure the accuracy of the results as intended by the SRQ.
OBJECTIVESThis study critically appraised, compared, and summarized the psychometric properties of SRQ evaluating BTL among health science university students. This review determined the SRQ’s reliability, internal consistency, various forms of validity (content, criterion, construct), and responsiveness.
METHODSFollowing a 10-step procedure based on COSMIN guidelines, we conducted a systematic review of SRQs used by health science university students to evaluate blended teaching and learning. Studies were eligible if they reported psychometric properties of SRQs related to blended learning among university health science students; exclusions included studies focusing on perceptions, attitudes, self-efficacy, and satisfaction, as well as articles such as biographies, editorials, and conference materials. Searches covered multiple electronic databases until April 26, 2023, including PubMed, EMBASE, Web of Science, MEDLINE (OVID), PsycInfo, CINAHL, EBSCOHOST, ERIC, Scopus, Science Direct, Google Scholar, JSTOR, Acta Medica Philippina, Philippine Journal of Health Research and Development, and HERDIN, managed through Zotero. Two independent reviewers performed database searches, title and abstract screening, and full-text evaluations, with a third reviewer resolving any disputes. The COSMIN Risk of Bias Checklist was employed to evaluate included studies on the development and various measurement properties of SRQs. The reviewers assessed SRQ standards, including validity, reliability, internal consistency, measurement error, responsiveness, interpretability, and feasibility. Data extraction and result tabulation were independently completed, with content comparison by two health education experts. This evaluation categorized the SRQs into three quality and validity levels.
RESULTSThe study examined five articles; four were rated as 'doubtful' and one as 'inadequate' in the overall development of SRQ. All four 'doubtful' studies demonstrated questionable content validity when university students were asked about the questionnaire's relevance, comprehensiveness, and comprehensibility. Only half of these studies achieved an 'adequate' rating for content validity based on expert opinions on relevance and comprehensiveness. All but one study scored from 'very good' to 'adequate' in structural validity. Three out of the four studies scored a very good rating for internal consistency, while one was deemed 'inadequate' in internal consistency, cross-cultural validity, and reliability. Three out of four studies scored 'very good' on construct validity, but all overlooked criterion validity and responsiveness. Conducted in various locations, including Australia, Romania, Turkey, and Taiwan, these studies highlighted both common characteristics and limitations in questionnaire development according to the COSMIN guidelines. Four studies were deemed reliable and valid for BTL constructs (Category A); Wu et al. requires further validation (Category B). Study limitations included heterogeneity in populations, settings, and questionnaire versions, potential subjective bias in SRQ content comparison, and the evolving nature of SRQs in blended learning contexts.
CONCLUSIONThe systematic review reports the development and evaluation of SRQs for BTL while identifying gaps in their applicability to health science programs. The Blended Learning Scale (BLS) of Lazar et al. and the Blended Learning Questionnaire (BLQ) of Ballouk et al. showed an ‘adequate' rating for content validity. BLS revealed very good structural validity, internal consistency, and adequate content validation. Although the BLQ lacked Confirmatory Factor Analysis, it yielded valuable constructs for evaluating health sciences students' experiences in BTL. Both tools require improvements on recall period, completion time, interpretability, and feasibility. The review underscores the necessity for cont inuous assessment and enhancement of such instruments in BTL, advocating a rigorous scale development process. Furthermore, it encourages the customization of teaching and learning evaluation tools to suit specific institutional contexts while promoting further validation of these questionnaires across different populations in future research.
Human ; Psychometrics ; Checklist ; Self Report ; Universities ; Health Education
3.Development and Initial Validation of the Multi-Dimensional Attention Rating Scale in Highly Educated Adults.
Xin-Yang ZHANG ; Karen SPRUYT ; Jia-Yue SI ; Lin-Lin ZHANG ; Ting-Ting WU ; Yan-Nan LIU ; Di-Ga GAN ; Yu-Xin HU ; Si-Yu LIU ; Teng GAO ; Yi ZHONG ; Yao GE ; Zhe LI ; Zi-Yan LIN ; Yan-Ping BAO ; Xue-Qin WANG ; Yu-Feng WANG ; Lin LU
Chinese Medical Sciences Journal 2025;40(2):100-110
OBJECTIVES:
To report the development, validation, and findings of the Multi-dimensional Attention Rating Scale (MARS), a self-report tool crafted to evaluate six-dimension attention levels.
METHODS:
The MARS was developed based on Classical Test Theory (CTT). Totally 202 highly educated healthy adult participants were recruited for reliability and validity tests. Reliability was measured using Cronbach's alpha and test-retest reliability. Structural validity was explored using principal component analysis. Criterion validity was analyzed by correlating MARS scores with the Toronto Hospital Alertness Test (THAT), the Attentional Control Scale (ACS), and the Attention Network Test (ANT).
RESULTS:
The MARS comprises 12 items spanning six distinct dimensions of attention: focused attention, sustained attention, shifting attention, selective attention, divided attention, and response inhibition.As assessed by six experts, the content validation index (CVI) was 0.95, the Cronbach's alpha for the MARS was 0.78, and the test-retest reliability was 0.81. Four factors were identified (cumulative variance contribution rate 68.79%). The total score of MARS was correlated positively with THAT (r = 0.60, P < 0.01) and ACS (r = 0.78, P < 0.01) and negatively with ANT's reaction time for alerting (r = -0.31, P = 0.049).
CONCLUSIONS
The MARS can reliably and validly assess six-dimension attention levels in real-world settings and is expected to be a new tool for assessing multi-dimensional attention impairments in different mental disorders.
Humans
;
Adult
;
Male
;
Attention/physiology*
;
Female
;
Middle Aged
;
Reproducibility of Results
;
Young Adult
;
Psychometrics
4.Application of Assessment Scales in Palliative Care for Glioma: A Systematic Review.
Zhi-Yuan XIAO ; Tian-Rui YANG ; Ya-Ning CAO ; Wen-Lin CHEN ; Jun-Lin LI ; Ting-Yu LIANG ; Ya-Ning WANG ; Yue-Kun WANG ; Xiao-Peng GUO ; Yi ZHANG ; Yu WANG ; Xiao-Hong NING ; Wen-Bin MA
Chinese Medical Sciences Journal 2025;40(3):211-218
BACKGROUND AND OBJECTIVE: Patients with glioma experience a high symptom burden and have diverse palliative care needs. However, the assessment scales used in palliative care remain non-standardized and highly heterogeneous. To evaluate the application patterns of the current scales used in palliative care for glioma, we aim to identify gaps and assess the need for disease-specific scales in glioma palliative care. METHODS: We conducted a systematic search of five databases including PubMed, Web of Science, Medline, EMBASE, and CINAHL for quantitative studies that reported scale-based assessments in glioma palliative care. We extracted data on scale characteristics, domains, frequency, and psychometric properties. Quality assessments were performed using the Cochrane ROB 2.0 and ROBINS-I tools. RESULTS: Of the 3,405 records initially identified, 72 studies were included. These studies contained 75 distinct scales that were used 193 times. Mood (21.7%), quality of life (24.4%), and supportive care needs (5.2%) assessments were the most frequently assessed items, exceeding half of all scale applications. Among the various assessment dimensions, the Distress Thermometer (DT) was the most frequently used tool for assessing mood, while the Short Form-36 Health Survey Questionnaire (SF-36) was the most frequently used tool for assessing quality of life. The Mini Mental Status Examination (MMSE) was the most common tool for cognitive assessment. Performance status (5.2%) and social support (6.8%) were underrepresented. Only three brain tumor-specific scales were identified. Caregiver-focused scales were limited and predominantly burden-oriented. CONCLUSIONS: There are significant heterogeneity, domain imbalances, and validation gaps in the current use of assessment scales for patients with glioma receiving palliative care. The scale selected for use should be comprehensive and user-friendly.
Humans
;
Glioma/psychology*
;
Palliative Care/methods*
;
Quality of Life
;
Psychometrics
;
Brain Neoplasms/psychology*
5.Validation and cultural adaptation of the Japanese version of the Self-Care Inventory across different research settings: a cross-sectional study.
Atsushi TAKAYAMA ; Shiho KOIZUMI ; Yoshihito KATO ; Tatsuya ISOMURA ; Tatsuyuki HOSOYA ; Koji KAWAKAMI
Environmental Health and Preventive Medicine 2025;30():85-85
BACKGROUND:
Self-care is increasingly recognized as the foundation of person-centered healthcare and a key driver for simultaneously improving population health outcomes and reducing healthcare expenditures. While the Self-Care Inventory (SCI) has been validated in several languages, Japan lacks a standardized instrument for assessing self-care in the general adult population. Moreover, it remains unclear whether the SCI reflects culturally specific self-care behaviors and retains its psychological measurement properties in non-Western contexts. Addressing both aspects, this study aimed to evaluate the Japanese version of the SCI (JSCI) in terms of its psychometric properties and its association with concrete health behaviors.
METHODS:
We adapted the JSCI following COSMIN guidelines using forward/backward translation, expert review, and cognitive debriefing. Psychometric evaluation was based on two samples: a nationwide web-based survey (n = 504) and a community-based paper survey (n = 75). Structural validity was examined via CFA; internal consistency via Cronbach's alpha and McDonald's omega; and test-retest reliability via ICCs. Convergent and criterion validity were assessed through correlations with relevant psychological constructs. Measurement invariance and DIF across modes were tested, and associations with five external self-care behaviors were evaluated using AUC.
RESULTS:
The hypothesized three-factor structure of the JSCI was supported across both administration modes (CFI = 0.926-0.942; SRMR < 0.06), although some subscales had elevated RMSEA. Internal consistency was acceptable to high (α = 0.75-0.85; ω = 0.81-0.92). ICCs indicated moderate to good temporal stability. JSCI scores correlated with self-care efficacy and other related constructs, supporting convergent and criterion validity. Configural invariance was confirmed, and no significant DIF was detected across modes. JSCI scores modestly discriminated individuals engaging in concrete self-care behaviors such as physical activity, strength training, Helicobacter pylori testing, and having a regular primary or dental care provider (AUCs = 0.62-0.80).
CONCLUSIONS
The JSCI demonstrated satisfactory psychometric properties and structural validity across diverse research settings. Its observed associations with a range of meaningful self-care behaviors support the scale's ecological and practical relevance in the Japanese context. The JSCI may serve as a reliable tool for evaluating and promoting self-care in both research and population health initiatives.
Humans
;
Japan
;
Self Care/statistics & numerical data*
;
Psychometrics
;
Male
;
Female
;
Adult
;
Cross-Sectional Studies
;
Middle Aged
;
Reproducibility of Results
;
Surveys and Questionnaires
;
Young Adult
;
Aged
;
Health Behavior
;
Translations
;
East Asian People
6.Sinicization and psychometric validation of the German Pelvic Floor Questionnaire for Pregnant and Postpartum Women.
Liping ZHU ; Chengyu ZHOU ; Xuhong LI ; Qiao HOU ; Shuo YANG
Journal of Central South University(Medical Sciences) 2025;50(1):72-80
OBJECTIVES:
Pelvic floor dysfunction is common among pregnant and postpartum women and significantly impacts quality of life. This study aims to translate the German Pelvic Floor Questionnaire for Pregnant and Postpartum Women into Chinese and to evaluate its reliability and validity in the Chinese population.
METHODS:
The questionnaire was translated using the Brislin model. A cross-sectional study was conducted among pregnant and postpartum women to assess the content validity, construct validity, Cronbach's α coefficient, test-retest reliability, and split-half reliability of the Chinese version.
RESULTS:
A total of 72 women were included, with 6.9% being pregnant and 93.1% postpartum; the age was (32.3±3.6) years. The Chinese version of the questionnaire contains 4 dimensions and 45 items. The content validity index of individual items ranged from 0.833 to 1.000, with a scale-level content validity index of 0.977 and intraclass correlation coefficients (ICCs) exceeding 0.90. The overall Cronbach's α coefficient was 0.891, with subscale coefficients ranging from 0.732 to 0.884 (all ICCs>0.70). The test-retest reliability of the total scale was 0.833, and for the 4 dimensions, bladder, bowel, prolapse, and sexual function, the values were 0.776, 0.579, 0.732, and 0.645, respectively. The split-half reliability was 0.74.
CONCLUSIONS
The Chinese version of the questionnaire demonstrated good reliability and validity, indicating its applicability in assessing pelvic floor dysfunction and associated risk factors during pregnancy and postpartum.
Humans
;
Female
;
Surveys and Questionnaires
;
Pregnancy
;
Adult
;
Postpartum Period
;
Psychometrics
;
Pelvic Floor Disorders/diagnosis*
;
Cross-Sectional Studies
;
Quality of Life
;
Pelvic Floor/physiopathology*
;
Reproducibility of Results
;
China
;
Translations
;
Young Adult
7.The modified Chinese version of Wong and Law Emotional Intelligence Scale for measurement of emotional health: revision and psychometric evaluation.
Journal of Southern Medical University 2025;45(10):2191-2198
OBJECTIVES:
To revise and evaluate the psychometric properties of the Chinese version of the Wong and Law's Emotional Intelligence Scale (WLEIS).
METHODS:
The 11 items of the original WLEIS were modified to form the WLEIS-CR, with the Generalized Anxiety Disorder Scale (GAD-7), 9-item Patient Health Questionnaire (PHQ-9), and Flourishing Scale (FS) as the validity criteria. A total of 1546 adult participants were evaluated using all these scales, and a retest was conducted among 192 college students to assess the item discrimination, reliability, validity and measurement invariance of the modified WLEIS-CR.
RESULTS:
All the 16 items of the modified WLEIS-CR demonstrated good discriminative power (r=0.570 -0.764, P<0.001). The structural equation model from a confirmatory factor analysis showed excellent fit indices (χ²/df=4.610, GFI=0.965, PGFI=0.674, RMR=0.028, NFI=0.975, CFI=0.980, RMSEA=0.048). The criterion-related validity of the modified WLEIS-CR with FS, GAD-7, and PHQ-9 was 0.674, -0.347, and -0.368, respectively (P<0.001). The internal consistency (Cronbach's α) was 0.913 for the total scale and ranged from 0.867 to 0.916 for the subscales. The split-half reliability was 0.956 for the total scale and 0.865-0.924 for the subscales. Test-retest reliability was 0.701 for the total scale and 0.610-0.684 for the subscales. Normative interpretation criteria were established: 7.6% of participants had "low", 19.3% had "below average", 22.3% had "moderate", 34.3% had "above average", and 16.5% had "very high" emotional intelligence. The scale demonstrated a good measurement invariance across gender, identity, and age groups.
CONCLUSIONS
The modified WLEIS-CR has good reliability, validity and measurement invariance, and is suitable for evaluating emotional intelligence of Chinese adults to assess their emotional health.
Humans
;
Psychometrics
;
Emotional Intelligence
;
Young Adult
;
Adult
;
Male
;
Female
;
Surveys and Questionnaires
;
Reproducibility of Results
;
Adolescent
8.Scale development and validation of perimenopausal women disability index in the workplace.
Kyoko NOMURA ; Kisho SHIMIZU ; Fumiaki TAKA ; Melanie GRIFFITH-QUINTYNE ; Miho IIDA
Environmental Health and Preventive Medicine 2024;29():4-4
BACKGROUND:
Menopausal disorders include obscure symptomatology that greatly reduce work productivity among female workers. Quantifying the impact of menopause-related symptoms on work productivity is very difficult because no such guidelines exist to date. We aimed to develop a scale of overall health status for working women in the perimenopausal period.
METHODS:
In September, 2021, we conducted an Internet web survey which included 3,645 female workers aged 45-56 years in perimenopausal period. We asked the participants to answer 76 items relevant to menopausal symptomatology, that were created for this study and performed exploratory and confirmatory factor analyses for the scale development. Cronbach's alpha, receiver operating characteristic analysis, and logistic regression analysis were used to verify the developed scale.
RESULTS:
Approximately 85% participants did not have menstruation or disrupted cycles. Explanatory factor analysis using the maximum likelihood method and Promax rotation identified 21 items with a four-factor structure: psychological symptoms (8 items, α = 0.96); physiological symptoms (6 items, alpha = 0.87); sleep difficulty (4 items, alpha = 0.92); human relationship (3 items, alpha = 0.92). Confirmatory factor analyses found excellent model fit for the four-factor model (RMSR = 0.079; TLI = 0.929; CFI = 0.938). Criterion and concurrent validity were confirmed with high correlation coefficients between each of the four factors, previously validated menopausal symptom questionnaire, and Copenhagen Burnout Inventory scales, respectively (all ps < 0.0001). The developed scale was able to predict absenteeism with 78% sensitivity, 58% specificity, and an AUC of 0.727 (95%CI: 0.696-0.757). Higher scores of each factor as well as total score of the scale were more likely to be associated with work absence experience due to menopause-related symptoms even after adjusting for Copenhagen Burnout Inventory subscales (all ps < 0.0001).
CONCLUSION
We found that the developed scale has high validity and reliability and could be a significant indicator of absenteeism for working women in perimenopausal period.
Humans
;
Female
;
Perimenopause
;
Reproducibility of Results
;
Menopause/psychology*
;
Workplace
;
Surveys and Questionnaires
;
Psychometrics
9.Adapting the media exposure survey to measure parental attitude and screen use of Filipino children: A psychometric study
Paulin Grace Morato-Espino ; Maria Patricia Josefina Berceno ; Elijah Miguel Guiao ; Elyssa Manuel ; Dana Marie Salo ; Catherine Anne Tan ; Julie Franz Tanchuling
Philippine Journal of Allied Health Sciences 2024;7(2):28-39
Background:
There are various attitudes regarding their child's screen usage. However, there are no existing Filipino-translated and culturally
appropriate questionnaires or assessment tools that can measure a child's media exposure, screen use, and parental attitude. The Media Exposure
Survey is an assessment tool that measures a child’s media exposure, screen use, and parental attitudes regarding their child’s screen usage.
Objectives:
The study aims to contextualize and translate the questionnaire into Filipino, determine its content validity and internal consistency, and check the translated questionnaire's compatibility and applicability.
Methods:
The study involves four steps: 1) content validity testing, 2)
forward and backward translation and equivalence, 3) pilot testing of the pre-final version, and 4) reliability resting. Data analysis was done to
evaluate the content validity and internal consistency of the questionnaire. Thirty-six parents of children aged 0-5 in Metro Manila pilot tested the
tool.
Results:
A cross-culturally adapted version of the Media Exposure Survey has been produced with good content validity. The S-CVI of the
questionnaire is 95%, which is excellent. The parental attitude towards childhood media use subscale has an acceptable internal consistency with
a Cronbach's alpha of 0.77.
Conclusion
The translated and adapted Media Exposure Survey has good content validity and acceptable internal
consistency and can be used to assess Filipino children’s media exposure, screen use, and parental attitudes toward media use.
Surveys and Questionnaires
;
Screen Time
;
Psychometrics
10.Revision of brief health literacy assessment scale among the older adults and its reliability and validity test.
Shaojie LI ; Guanghui CUI ; Huilan XU
Journal of Central South University(Medical Sciences) 2023;48(1):123-129
OBJECTIVES:
The development and validation of the specific health literacy assessment tool for older adults is the basis for conducting the research on health literacy among older adults. The existing health literacy assessment scale for older adults in Chinese mainland has some limitations, such as too many items and poor compliance during the survey. It is necessary to develop or introduce simplified assessment tools to support large-scale surveys in the future. This study aims to modify the brief health literacy assessment scale compiled by Taiwan scholars, and to conduct the test for the reliability, validity and the measurement equivalence across gender in the older population in mainland China.
METHODS:
From March to April 2021, 508 older adults from Jinan, Shandong Province, China were selected by cluster sampling method to conduct a questionnaire survey using the brief health literacy assessment scale and health-promoting lifestyle profile. After 4 weeks, 83 of them were selected for retesting. SPSS 25.0 statistical software was used for descriptive analysis, item analysis, exploratory factor analysis, correlation analysis, and reliability test, and Mplus 8.0 was used for confirmatory factor analysis and gender measurement equivalence test.
RESULTS:
Each item of the scale had good discrimination, and there were significant differences in the scores of each item between high score and low score groups (P<0.05), and the coefficient of correlation between the scores of each item and the total score was between 0.721 and 0.891. Exploratory factor analysis extracted a factor with a characteristic root greater than 1, and the cumulative variance interpretation amount was 67.94%. The confirmatory factor analysis showed that the single factor structure fit was good [χ2/df was 2.260, the Tucker-Lewis index was 0.973, the comparison fit index (CFI) was 0.982, and the root mean square error of approximation (RMSEA) was 0.071]. The multi-group confirmatory factor analysis results showed that the brief health literacy assessment scale's configural equivalence, weak equivalence, and strong equivalence models were all accepted. The comparison results of measurement equivalence models showed that the changes of RMSEA were less than 0.015, and the changes of CFI were less than 0.01, indicating that the brief health literacy assessment scale had measurement equivalence between different gender groups. Cronbach's α coefficient was 0.945, and the test-retest reliability was 0.946. The correlation coefficient between health literacy and health-promotion lifestyles was 0.557 (P<0.05).
CONCLUSIONS
The brief health literacy assessment scale has good reliability, validity, and measurement equivalence across gender, and can be used as an effective measurement tool for the health literacy of the older people in Chinese mainland.
Humans
;
Aged
;
Reproducibility of Results
;
Health Literacy/methods*
;
Psychometrics
;
Surveys and Questionnaires
;
Asian People
;
China
;
Factor Analysis, Statistical


Result Analysis
Print
Save
E-mail