Cluster Sampling: One-Stage vs. Two-Stage, Design Effect, and Examples

Learn how cluster sampling works, the difference between one-stage and two-stage designs, how to calculate design effect, and when to choose cluster over stratified sampling.

What Is Cluster Sampling?

Cluster sampling is a probability sampling method where the population is divided into groups (clusters), typically geographic areas or organizational units, and a random subset of those clusters is selected. Researchers then collect data from individuals within the selected clusters, either surveying everyone in each cluster (one-stage) or drawing a random sample within each cluster (two-stage). It's the standard approach when a population is too geographically dispersed or logistically fragmented to sample individuals directly, which is why it's the backbone of national household surveys, public health studies, and multi-site organizational research.

Why Cluster Sampling Matters in Research

Reaching individuals spread across an entire country or across hundreds of organizational sites is expensive. If you need to survey hospital patients nationwide, visiting all 6,000 hospitals isn't feasible. Cluster sampling lets you randomly select 50 hospitals and survey patients there, dramatically cutting travel, coordination, and field costs while maintaining a probability-based design. The trade-off is lower precision per dollar compared to stratified or simple random sampling, but that trade-off is often worth it when the alternative is not doing the study at all.

How Cluster Sampling Works

Defining Clusters

Good clusters should be:

Internally heterogeneous: Each cluster should contain a mix of people that resembles the overall population. Schools with diverse student bodies, neighborhoods with mixed demographics, and hospitals serving varied patient populations all make good clusters.
Relatively similar to each other: If one cluster is dramatically different from another (a wealthy suburb vs. A low-income urban area), you'll get more between-cluster variability, which inflates sampling error.
Naturally occurring: Use existing administrative or geographic boundaries, schools, voting districts, retail store locations, hospital systems. Don't create clusters artificially.

One-Stage vs. Two-Stage Cluster Sampling

One-stage cluster sampling: Randomly select clusters, then survey every individual within each selected cluster. This is simpler and captures complete within-cluster information, but total sample sizes can balloon if clusters are large.

Two-stage cluster sampling: Randomly select clusters, then randomly sample individuals within each selected cluster. This is more common in practice because it controls total sample size and cost.

Multi-stage sampling extends the logic further. A national health survey might: (1) randomly select states, (2) randomly select counties within those states, (3) randomly select census tracts within those counties, (4) randomly select households within those tracts. Each stage introduces a randomization step and a design effect.

Worked Example

A retail chain wants to measure customer satisfaction across its 200 store locations. Budget allows surveying at 20 locations.

List all clusters: 200 stores across the country
Randomly select clusters: Use a random number generator to pick 20 stores
Sample within clusters (two-stage): At each selected store, randomly sample 50 customers from the past month's transaction records
Total sample: 20 stores x 50 customers = 1,000 respondents

If using one-stage instead: survey all customers who visited the 20 stores during the field period. This could yield thousands of respondents but with less control over total sample size.

The Design Effect

Cluster sampling is less precise than simple random sampling because people within the same cluster tend to be more alike than people across different clusters. Patients at the same hospital share the same care protocols. Students at the same school share the same teachers. This within-cluster similarity inflates your effective sampling error.

The design effect (DEFF) quantifies this inflation:

DEFF = 1 + (average cluster size - 1) x ICC

Where ICC is the intraclass correlation coefficient, a measure of how similar individuals are within clusters relative to the total population. ICC values in survey research typically range from 0.01 to 0.05, but can be higher for tightly clustered variables.

Example: If your average cluster size is 50 and the ICC is 0.03:

DEFF = 1 + (50 - 1) x 0.03 = 1 + 1.47 = 2.47

This means you need 2.47 times as many respondents as you would with simple random sampling to achieve the same precision. A study that would need 400 respondents under simple random sampling needs about 988 under this cluster design.

Cluster Sampling vs. Stratified Sampling

Feature	Cluster Sampling	Stratified Sampling
Groups should be	Internally diverse (heterogeneous within)	Internally similar (homogeneous within)
Groups sampled	Random subset	All groups
Individual selection	Random within selected groups	Random within all groups
Precision vs. SRS	Lower	Higher
Primary advantage	Cost reduction	Precision improvement
Best when	Population is geographically dispersed	Key subgroups need guaranteed representation

The key distinction: in stratified sampling, you sample from every stratum. In cluster sampling, you skip entire clusters. Stratification improves precision; clustering saves money.

When to Use Cluster Sampling

Large, geographically dispersed populations where reaching individuals directly would require unrealistic travel or logistics, national consumer studies, multi-site employee surveys, public health research
No individual-level sampling frame exists, but a list of clusters does: you may not have a list of all patients in a region, but you have a list of all hospitals
Budget constraints make individual-level probability sampling infeasible: cluster sampling trades precision for affordability
Multi-site organizational research: studying practices across schools, franchises, clinics, or warehouses where site-level variation is part of the research question
Paired with stratification for efficiency: stratify clusters first (e.g., by size or region), then randomly select clusters within each stratum to get the cost benefits of clustering with some of the precision benefits of stratification

Common Mistakes to Avoid

Ignoring the design effect in sample size calculations. Using simple random sampling formulas for cluster designs underestimates the sample you need and overstates the precision of your results. Always multiply your base sample size by the DEFF.
Creating clusters that are too homogeneous internally. If each cluster contains only similar people (e.g., all high-income or all rural), between-cluster differences will dominate your results and precision drops sharply.
Selecting too few clusters. Precision in cluster sampling depends more on the number of clusters than on the number of individuals per cluster. Twenty clusters of 25 people typically outperforms 5 clusters of 100 people.
Analyzing clustered data as if it were simple random. Standard error estimates that ignore clustering will be too small, making results look more statistically significant than they are. Use software that accounts for complex survey designs.
Confusing cluster sampling with convenience sampling. Randomly selecting 10 stores and surveying everyone there is cluster sampling. Choosing 10 stores near the head office because they're convenient is not.

How Quali-Fi Supports Cluster Sampling

Quali-Fi's Research platform supports multi-site study designs with per-location survey deployment, real-time monitoring of response rates by cluster, and data exports that preserve cluster identifiers for proper analysis. The platform's multi-channel distribution (web, email, SMS, QR code, kiosk) lets you deploy through different channels at different cluster sites based on what works best for each location's population.

Deploy multi-site research with Quali-Fi

Frequently Asked Questions

How many clusters should I select?

More clusters with fewer individuals each generally produces better estimates than fewer clusters with more individuals. A common guideline is a minimum of 30 clusters for stable variance estimation. If budget allows, 50+ clusters is preferable. The exact number depends on your ICC, desired precision, and budget constraints.

Can I use cluster sampling for online research?

Cluster sampling is most relevant when geography or organizational structure creates natural groupings that affect logistics or cost. For purely online studies where reaching any individual costs the same regardless of location, cluster sampling doesn't offer its usual cost advantage. You might still cluster by organizational unit (company, department) if that's how your population is structured.

What's a typical ICC value for market research?

Most survey variables in market research have ICC values between 0.01 and 0.05 when clusters are geographic areas or retail locations. Attitudinal variables tend to have lower ICCs than behavioral ones. Satisfaction scores within a single store location might have an ICC of 0.03-0.08, since the service experience is partly shared.

What Is Cluster Sampling?

Why Cluster Sampling Matters in Research

How Cluster Sampling Works

Defining Clusters

One-Stage vs. Two-Stage Cluster Sampling

Worked Example

The Design Effect

Cluster Sampling vs. Stratified Sampling

When to Use Cluster Sampling

Common Mistakes to Avoid

How Quali-Fi Supports Cluster Sampling

Frequently Asked Questions

How many clusters should I select?

Can I use cluster sampling for online research?

What's a typical ICC value for market research?

Frequently Asked Questions

Related Guides

Sampling Methods in Research: Complete Guide to Choosing the Right Approach

Probability Sampling: Methods, Examples, and When to Use Each

Stratified Sampling: How It Works, When to Use It, and Examples

Simple Random Sampling: What It Is, How to Do It, and When to Use It

Systematic Sampling: How It Works, Periodicity Risk, and Worked Example

Random Sampling: What It Is, How It Works, and Why It Matters

Sample Size Determination: Formulas, Factors, and Practical Guidelines

Ready to apply this in your research?

Cluster Sampling: One-Stage vs. Two-Stage, Design Effect, and Examples

What Is Cluster Sampling?

Why Cluster Sampling Matters in Research

How Cluster Sampling Works

Defining Clusters

One-Stage vs. Two-Stage Cluster Sampling

Worked Example

The Design Effect

Cluster Sampling vs. Stratified Sampling

When to Use Cluster Sampling

Common Mistakes to Avoid

How Quali-Fi Supports Cluster Sampling

Frequently Asked Questions

How many clusters should I select?

Can I use cluster sampling for online research?

What's a typical ICC value for market research?

Related Topics

Frequently Asked Questions

Related Guides

Sampling Methods in Research: Complete Guide to Choosing the Right Approach

Probability Sampling: Methods, Examples, and When to Use Each

Stratified Sampling: How It Works, When to Use It, and Examples

Simple Random Sampling: What It Is, How to Do It, and When to Use It

Systematic Sampling: How It Works, Periodicity Risk, and Worked Example

Random Sampling: What It Is, How It Works, and Why It Matters

Sample Size Determination: Formulas, Factors, and Practical Guidelines

Ready to apply this in your research?