Sampling Methods

Cluster Sampling: One-Stage vs. Two-Stage, Design Effect, and Examples

7 min read

Learn how cluster sampling works, the difference between one-stage and two-stage designs, how to calculate design effect, and when to choose cluster over stratified sampling.

What Is Cluster Sampling?

Cluster sampling is a probability sampling method where the population is divided into groups (clusters), typically geographic areas or organizational units, and a random subset of those clusters is selected. Researchers then collect data from individuals within the selected clusters, either surveying everyone in each cluster (one-stage) or drawing a random sample within each cluster (two-stage). It's the standard approach when a population is too geographically dispersed or logistically fragmented to sample individuals directly, which is why it's the backbone of national household surveys, public health studies, and multi-site organizational research.

Why Cluster Sampling Matters in Research

Reaching individuals spread across an entire country or across hundreds of organizational sites is expensive. If you need to survey hospital patients nationwide, visiting all 6,000 hospitals isn't feasible. Cluster sampling lets you randomly select 50 hospitals and survey patients there, dramatically cutting travel, coordination, and field costs while maintaining a probability-based design. The trade-off is lower precision per dollar compared to stratified or simple random sampling, but that trade-off is often worth it when the alternative is not doing the study at all.

How Cluster Sampling Works

Defining Clusters

Good clusters should be:

  • Internally heterogeneous: Each cluster should contain a mix of people that resembles the overall population. Schools with diverse student bodies, neighborhoods with mixed demographics, and hospitals serving varied patient populations all make good clusters.
  • Relatively similar to each other: If one cluster is dramatically different from another (a wealthy suburb vs. A low-income urban area), you'll get more between-cluster variability, which inflates sampling error.
  • Naturally occurring: Use existing administrative or geographic boundaries, schools, voting districts, retail store locations, hospital systems. Don't create clusters artificially.

One-Stage vs. Two-Stage Cluster Sampling

One-stage cluster sampling: Randomly select clusters, then survey every individual within each selected cluster. This is simpler and captures complete within-cluster information, but total sample sizes can balloon if clusters are large.

Two-stage cluster sampling: Randomly select clusters, then randomly sample individuals within each selected cluster. This is more common in practice because it controls total sample size and cost.

Multi-stage sampling extends the logic further. A national health survey might: (1) randomly select states, (2) randomly select counties within those states, (3) randomly select census tracts within those counties, (4) randomly select households within those tracts. Each stage introduces a randomization step and a design effect.

Worked Example

A retail chain wants to measure customer satisfaction across its 200 store locations. Budget allows surveying at 20 locations.

  1. List all clusters: 200 stores across the country
  2. Randomly select clusters: Use a random number generator to pick 20 stores
  3. Sample within clusters (two-stage): At each selected store, randomly sample 50 customers from the past month's transaction records
  4. Total sample: 20 stores x 50 customers = 1,000 respondents

If using one-stage instead: survey all customers who visited the 20 stores during the field period. This could yield thousands of respondents but with less control over total sample size.

The Design Effect

Cluster sampling is less precise than simple random sampling because people within the same cluster tend to be more alike than people across different clusters. Patients at the same hospital share the same care protocols. Students at the same school share the same teachers. This within-cluster similarity inflates your effective sampling error.

The design effect (DEFF) quantifies this inflation:

DEFF = 1 + (average cluster size - 1) x ICC

Where ICC is the intraclass correlation coefficient, a measure of how similar individuals are within clusters relative to the total population. ICC values in survey research typically range from 0.01 to 0.05, but can be higher for tightly clustered variables.

Example: If your average cluster size is 50 and the ICC is 0.03:

DEFF = 1 + (50 - 1) x 0.03 = 1 + 1.47 = 2.47

This means you need 2.47 times as many respondents as you would with simple random sampling to achieve the same precision. A study that would need 400 respondents under simple random sampling needs about 988 under this cluster design.

Cluster Sampling vs. Stratified Sampling

Feature Cluster Sampling Stratified Sampling
Groups should be Internally diverse (heterogeneous within) Internally similar (homogeneous within)
Groups sampled Random subset All groups
Individual selection Random within selected groups Random within all groups
Precision vs. SRS Lower Higher
Primary advantage Cost reduction Precision improvement
Best when Population is geographically dispersed Key subgroups need guaranteed representation

The key distinction: in stratified sampling, you sample from every stratum. In cluster sampling, you skip entire clusters. Stratification improves precision; clustering saves money.

When to Use Cluster Sampling

  • Large, geographically dispersed populations where reaching individuals directly would require unrealistic travel or logistics, national consumer studies, multi-site employee surveys, public health research
  • No individual-level sampling frame exists, but a list of clusters does: you may not have a list of all patients in a region, but you have a list of all hospitals
  • Budget constraints make individual-level probability sampling infeasible: cluster sampling trades precision for affordability
  • Multi-site organizational research: studying practices across schools, franchises, clinics, or warehouses where site-level variation is part of the research question
  • Paired with stratification for efficiency: stratify clusters first (e.g., by size or region), then randomly select clusters within each stratum to get the cost benefits of clustering with some of the precision benefits of stratification

Common Mistakes to Avoid

  • Ignoring the design effect in sample size calculations. Using simple random sampling formulas for cluster designs underestimates the sample you need and overstates the precision of your results. Always multiply your base sample size by the DEFF.
  • Creating clusters that are too homogeneous internally. If each cluster contains only similar people (e.g., all high-income or all rural), between-cluster differences will dominate your results and precision drops sharply.
  • Selecting too few clusters. Precision in cluster sampling depends more on the number of clusters than on the number of individuals per cluster. Twenty clusters of 25 people typically outperforms 5 clusters of 100 people.
  • Analyzing clustered data as if it were simple random. Standard error estimates that ignore clustering will be too small, making results look more statistically significant than they are. Use software that accounts for complex survey designs.
  • Confusing cluster sampling with convenience sampling. Randomly selecting 10 stores and surveying everyone there is cluster sampling. Choosing 10 stores near the head office because they're convenient is not.

How Quali-Fi Supports Cluster Sampling

Quali-Fi's Research platform supports multi-site study designs with per-location survey deployment, real-time monitoring of response rates by cluster, and data exports that preserve cluster identifiers for proper analysis. The platform's multi-channel distribution (web, email, SMS, QR code, kiosk) lets you deploy through different channels at different cluster sites based on what works best for each location's population.

Deploy multi-site research with Quali-Fi

Frequently Asked Questions

How many clusters should I select?

More clusters with fewer individuals each generally produces better estimates than fewer clusters with more individuals. A common guideline is a minimum of 30 clusters for stable variance estimation. If budget allows, 50+ clusters is preferable. The exact number depends on your ICC, desired precision, and budget constraints.

Can I use cluster sampling for online research?

Cluster sampling is most relevant when geography or organizational structure creates natural groupings that affect logistics or cost. For purely online studies where reaching any individual costs the same regardless of location, cluster sampling doesn't offer its usual cost advantage. You might still cluster by organizational unit (company, department) if that's how your population is structured.

What's a typical ICC value for market research?

Most survey variables in market research have ICC values between 0.01 and 0.05 when clusters are geographic areas or retail locations. Attitudinal variables tend to have lower ICCs than behavioral ones. Satisfaction scores within a single store location might have an ICC of 0.03-0.08, since the service experience is partly shared.

Frequently Asked Questions

Related Guides

Sampling Methods

Sampling Methods in Research: Complete Guide to Choosing the Right Approach

Learn how to choose the right sampling method for your research. Covers probability and non-probability techniques, sample size basics, and a decision framework for surveys and market research.

13 min readRead
Sampling Methods

Probability Sampling: Methods, Examples, and When to Use Each

Learn what probability sampling is, how the four main methods work, and when each is the right choice for surveys and market research studies.

10 min readRead
Sampling Methods

Stratified Sampling: How It Works, When to Use It, and Examples

Learn how stratified sampling works, when to use proportionate vs. disproportionate allocation, and how it compares to cluster sampling in survey research.

8 min readRead
Sampling Methods

Simple Random Sampling: What It Is, How to Do It, and When to Use It

Learn what simple random sampling is, how to implement it with random number generators, and when it's the right choice for surveys and research studies.

5 min readRead
Sampling Methods

Systematic Sampling: How It Works, Periodicity Risk, and Worked Example

Learn how systematic sampling works, how to calculate the sampling interval, the periodicity risk to watch for, and when to use it in survey research.

6 min readRead
Sampling Methods

Random Sampling: What It Is, How It Works, and Why It Matters

Learn what random sampling is, how random selection works in practice, the difference between random and non-random methods, and when random sampling is the right choice.

6 min readRead
Sampling Methods

Sample Size Determination: Formulas, Factors, and Practical Guidelines

Learn how to calculate the right sample size for surveys and research studies. Covers the core formula, factors that affect sample size, and practical guidelines for market research.

6 min readRead

Put it into practice

Ready to apply this in your research?

Quali-Fi makes it easy to run surveys, conjoint studies, and more, all in one platform.