1.Alternatives to P value: confidence interval and effect size.
Korean Journal of Anesthesiology 2016;69(6):555-562
The previous articles of the Statistical Round in the Korean Journal of Anesthesiology posed a strong enquiry on the issue of null hypothesis significance testing (NHST). P values lie at the core of NHST and are used to classify all treatments into two groups: "has a significant effect" or "does not have a significant effect." NHST is frequently criticized for its misinterpretation of relationships and limitations in assessing practical importance. It has now provoked criticism for its limited use in merely separating treatments that "have a significant effect" from others that do not. Effect sizes and CIs expand the approach to statistical thinking. These attractive estimates facilitate authors and readers to discriminate between a multitude of treatment effects. Through this article, I have illustrated the concept and estimating principles of effect sizes and CIs.
Anesthesiology
;
Confidence Intervals*
;
Thinking
3.Significant results: statistical or clinical?.
Korean Journal of Anesthesiology 2016;69(2):121-125
The null hypothesis significance test method is popular in biological and medical research. Many researchers have used this method for their research without exact knowledge, though it has both merits and shortcomings. Readers will know its shortcomings, as well as several complementary or alternative methods, as such the estimated effect size and the confidence interval.
Biostatistics
;
Confidence Intervals
;
Models, Statistical
4.Comparison of Parametric and Bootstrap Method in Bioequivalence Test.
The Korean Journal of Physiology and Pharmacology 2009;13(5):367-371
The estimation of 90% parametric confidence intervals (CIs) of mean AUC and Cmax ratios in bioequivalence (BE) tests are based upon the assumption that formulation effects in log-transformed data are normally distributed. To compare the parametric CIs with those obtained from nonparametric methods we performed repeated estimation of bootstrap-resampled datasets. The AUC and Cmax values from 3 archived datasets were used. BE tests on 1,000 resampled datasets from each archived dataset were performed using SAS (Enterprise Guide Ver.3). Bootstrap nonparametric 90% CIs of formulation effects were then compared with the parametric 90% CIs of the original datasets. The 90% CIs of formulation effects estimated from the 3 archived datasets were slightly different from nonparametric 90% CIs obtained from BE tests on resampled datasets. Histograms and density curves of formulation effects obtained from resampled datasets were similar to those of normal distribution. However, in 2 of 3 resampled log (AUC) datasets, the estimates of formulation effects did not follow the Gaussian distribution. Bias-corrected and accelerated (BCa) CIs, one of the nonparametric CIs of formulation effects, shifted outside the parametric 90% CIs of the archived datasets in these 2 non-normally distributed resampled log (AUC) datasets. Currently, the 80~125% rule based upon the parametric 90% CIs is widely accepted under the assumption of normally distributed formulation effects in log-transformed data. However, nonparametric CIs may be a better choice when data do not follow this assumption.
Area Under Curve
;
Confidence Intervals
;
Phenothiazines
;
Therapeutic Equivalency
5.Comparison of 7 methods for sample size determination based on confidence interval estimation for a single proportion.
Mi Lai YU ; Xiao Tong SHI ; Bi Qing ZOU ; Sheng Li AN
Journal of Southern Medical University 2023;43(1):105-110
OBJECTIVE:
To compare different methods for calculating sample size based on confidence interval estimation for a single proportion with different event incidences and precisions.
METHODS:
We compared 7 methods, namely Wald, AgrestiCoull add z2, Agresti-Coull add 4, Wilson Score, Clopper-Pearson, Mid-p, and Jefferys, for confidence interval estimation for a single proportion. The sample size was calculated using the search method with different parameter settings (proportion of specified events and half width of the confidence interval [ω=0.05, 0.1]). With Monte Carlo simulation, the estimated sample size was used to simulate and compare the width of the confidence interval, the coverage of the confidence interval and the ratio of the noncoverage probability.
RESULTS:
For a high accuracy requirement (ω =0.05), the Mid-p method and Clopper Pearson method performed better when the incidence of events was low (P < 0.15). In other settings, the performance of the 7 methods did not differ significantly except for a poor symmetry of the Wald method. In the setting of ω=0.1 with a very low p (0.01-0.05), failure of iteration occurred with nearly all the methods except for the Clopper-Pearson method.
CONCLUSION
Different sample size determination methods based on confidence interval estimation should be selected for single proportions with different parameter settings.
Confidence Intervals
;
Sample Size
;
Computer Simulation
;
Monte Carlo Method
;
Probability
6.Comparative pharmacokinetic analysis based on nonlinear mixed effect model.
Lu-jin LI ; Xian-xing LI ; Ling XU ; Ying-hua LÜ ; Jun-chao CHEN ; Qing-shan ZHENG
Acta Pharmaceutica Sinica 2011;46(4):447-453
Comparative pharmacokinetic (PK) analysis is often carried out throughout the entire period of drug development, the common approach for the assessment of pharmacokinetics between different treatments requires that the individual PK parameters, which employs estimation of 90% confidence intervals for the ratio of average parameters, such as AUC and Cmax, these 90% confidence intervals then need to be compared with the pre-specified equivalent interval, and last we determine whether the two treatments are equivalent. Unfortunately in many clinical circumstances, some or even all of the individuals can only be sparsely sampled, making the individual evaluation difficult by the conventional non-compartmental analysis. In such cases, nonlinear mixed effect model (NONMEM) could be applied to analyze the sparse data. In this article, we simulated a sparsely sampling design trial based on the dense sampling data from a truly comparative PK study. The sparse data were analyzed with NONMEM method, and the original dense data were analyzed with non-compartment analysis. Although the trial design and analysis methods are different, the 90% confidence intervals for the ratio of PK parameters based on 1000 Bootstrap are very similar, indicated that the analysis based on NONMEM is a reliable method to treat with the sparse data in the comparative pharmacokinetic study.
Area Under Curve
;
Confidence Intervals
;
Humans
;
Nonlinear Dynamics
;
Pharmacokinetics
;
Sampling Studies
7.Fine mapping of susceptibility genes by Lewontin's linkage disequilibrium measure with application to Alzheimer's disease.
Gordon GONG ; Gleb HAYNATZKI ; Hong-Wen DENG ; Robert R RECKER ; John MORDESON ; Shih-Chuan CHENG ; Nelson FONG
Chinese Medical Journal 2002;115(8):1233-1240
OBJECTIVESTo formulate an equation for fine mapping of disease loci under complex conditions and determine the marker-disease distance in a specific case using this equation.
METHODSLewontin's linkage disequilibrium (LD) measure D' was used to formulate an equation for mapping disease genes in the presence of phenocopies, locus heterogeneity, gene-gene and gene-environment interactions, incomplete penetrance, uncertain liability and threshold, incomplete initial LD, natural selection, recurrent mutation, high disease allele frequency and unknown mode of inheritance. This equation was then used to determine the distance between a marker ( epsilon 4 within the apolipoprotein E gene, APOE) and Alzheimer's disease (AD) loci using published data.
RESULTSAn equation was formulated for mapping disease genes under the above conditions.If these conditions are present but ignored, then recombination fraction theta between marker and disease loci will be either overestimated or estimated with little bias. Therefore, an upper limit of theta can be obtained. AD has been found to be associated with the marker allele epsilon 4 in Africans, Asians, and Caucasians. This suggests that the AD- epsilon 4 allelic LD predates the divergence of peoples occurring 100 000 years ago. With the age of AD- epsilon 4 allelic LD so estimated, the maximal distance was calculated to be 23.2 kb (mean 5.8 kb).
CONCLUSIONS(1) A method is developed for LD mapping of susceptibility genes. (2) A mutation within the APOE gene itself, among others, is responsible for the susceptibility to AD, which is supported by recent evidence from studies using transgenic mice.
Alzheimer Disease ; genetics ; Chromosome Mapping ; Confidence Intervals ; Genetic Predisposition to Disease ; genetics ; Humans ; Linkage Disequilibrium ; Mutation
8.Number needed to treat (NNT), an index for clinical therapeutic efficacy assessment--its significance and application.
Chinese Journal of Integrated Traditional and Western Medicine 2010;30(7):752-756
Number needed to treat (NNT) is a simple and effective index for clinical therapeutic effect assessment worldwide accepted in recent years. By calculation of absolute risk reduction (ARR) of classified variables, it made the effect estimate reflect objectively the therapeutic effect of an intervention. However, clinical application of this index was introduced rarely in Chinese literature. With the examples from some published clinical reports, the calculation of NNT and its 95% confidence interval were demonstrated in this paper, and its application was illustrated by some relevant terms explanation and Meta-analysis methods introduction.
Confidence Intervals
;
Humans
;
Meta-Analysis as Topic
;
Numbers Needed To Treat
;
Outcome Assessment (Health Care)
;
methods
;
Treatment Outcome
9.The Discriminatiuon between Congenital and Acquired Color Vision Defects by Computerized Color Vision Test.
Young Joo SHIN ; Sang Yul CHOI ; Kyu Hyoung PARK ; Min Seoup KIM ; Jeoung Min HWANG ; Won Ryang WEE ; Jin Hak LEE ; In Bum LEE ; Young Suk YU ; Jae Hee CHOI
Journal of the Korean Ophthalmological Society 2005;46(1):125-132
PURPOSE: To investigate the characteristics of congenital and acquired color vision defects with Seohan computerized hue test and SNU (Seoul National University) computerized color test and to help to discriminate between congenital and acquired color vision defect METHODS: from June 2003 to January 2004, patient with congenital and acquired color vision defect and visual acuities more than 20/30 underwent Seohan computerized hue and SNU computerized color tests. Their results were compared with each other. Quadrant analysis and RQ calculation were done. RESULTS: On Seohan computerized hue and SNU computerized color tests, congenital color vision defects showed mainly red-green color vision defects (p<0.01, paired t-test) while acquired color vision defects showed blue-yellow color vision defect(p<0.01, paired t-test). RQ had 95% sensitivity and 98% specificity with a standard of 1.5 by Seohan computerized hue test, and 96% sensitivity and 98% specificity with standard of 1.0 by SNU computerized color test, for the discrimination of congenital and acquired color vision defects (ROC curve, confidence interval 95%). CONCLUSIONS: Seohan computerized hue and SNU computerized color tests were effective to classify types of color vision defects and discriminate between the congenital and acquired color vision defects.
Color Vision Defects*
;
Color Vision*
;
Confidence Intervals
;
Discrimination (Psychology)
;
Humans
;
Sensitivity and Specificity
;
Visual Acuity
10.Statistical Data Editing in Scientific Articles.
Journal of Korean Medical Science 2017;32(7):1072-1076
Scientific journals are important scholarly forums for sharing research findings. Editors have important roles in safeguarding standards of scientific publication and should be familiar with correct presentation of results, among other core competencies. Editors do not have access to the raw data and should thus rely on clues in the submitted manuscripts. To identify probable errors, they should look for inconsistencies in presented results. Common statistical problems that can be picked up by a knowledgeable manuscript editor are discussed in this article. Manuscripts should contain a detailed section on statistical analyses of the data. Numbers should be reported with appropriate precisions. Standard error of the mean (SEM) should not be reported as an index of data dispersion. Mean (standard deviation [SD]) and median (interquartile range [IQR]) should be used for description of normally and non-normally distributed data, respectively. If possible, it is better to report 95% confidence interval (CI) for statistics, at least for main outcome variables. And, P values should be presented, and interpreted with caution, if there is a hypothesis. To advance knowledge and skills of their members, associations of journal editors are better to develop training courses on basic statistics and research methodology for non-experts. This would in turn improve research reporting and safeguard the body of scientific evidence.
Confidence Intervals
;
Editorial Policies
;
Journalism
;
Normal Distribution
;
Peer Review
;
Publications
;
Research Design
;
Research Report