Sample size considerations for stratified cluster randomization design with binary outcomes and varying cluster size

Stat Med. 2019 Aug 15;38(18):3395-3404. doi: 10.1002/sim.8175. Epub 2019 Apr 29.

Abstract

Stratified cluster randomization trials (CRTs) have been frequently employed in clinical and healthcare research. Comparing with simple randomized CRTs, stratified CRTs reduce the imbalance of baseline prognostic factors among different intervention groups. Due to the popularity, there has been a growing interest in methodological development on sample size estimation and power analysis for stratified CRTs; however, existing work mostly assumes equal cluster size within each stratum and uses multilevel models. Clusters are often naturally formed with random sizes in CRTs. With varying cluster size, commonly used ad hoc approaches ignore the variability in cluster size, which may underestimate (overestimate) the required number of clusters for each group per stratum and lead to underpowered (overpowered) clinical trials. We propose closed-form sample size formulas for estimating the required total number of subjects and for estimating the number of clusters for each group per stratum, based on Cochran-Mantel-Haenszel statistic for stratified cluster randomization design with binary outcomes, accounting for both clustering and varying cluster size. We investigate the impact of various design parameters on the relative change in the required number of clusters for each group per stratum due to varying cluster size. Simulation studies are conducted to evaluate the finite-sample performance of the proposed sample size method. A real application example of a pragmatic stratified CRT of a triad of chronic kidney disease, diabetes, and hypertension is presented for illustration.

Keywords: binary outcomes; cluster randomization design; sample size; stratification; varying cluster size.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Biostatistics
  • Clinical Protocols
  • Cluster Analysis
  • Computer Simulation
  • Diabetic Angiopathies / complications
  • Diabetic Angiopathies / therapy
  • Diabetic Nephropathies / complications
  • Diabetic Nephropathies / therapy
  • Humans
  • Hypertension / complications
  • Hypertension / therapy
  • Outcome Assessment, Health Care / statistics & numerical data
  • Pragmatic Clinical Trials as Topic / statistics & numerical data
  • Random Allocation
  • Randomized Controlled Trials as Topic / statistics & numerical data*
  • Renal Insufficiency, Chronic / complications
  • Renal Insufficiency, Chronic / therapy
  • Sample Size