What does "Generalizability" mean?

Definition of Generalizability in the context of A/B testing (online controlled experiments).

What is Generalizability?

Aliases: external validity, representativeness

Generalizability of an online controlled experiment refers to the predictive value of its outcomes. It refers to how well the results generalize to time periods and populations other than the test duration and the population that experienced the treatment. A term with the same meaning is "representativeness" as well as "external validity" in more scientific contexts.

It should not be confused with the statistical validity of the test (the adequacy of its statistical model) nor with the types of errors controlled by statistical methods such as type I and type II errors as these, when viewed in the context of the primary KPI only apply to the internal validity of a test.

The generalizability of the outcome of an A/B test can be threatened by many factors external to the test itself with three main types of such factors: time-related, population change related, and novelty/learning related. We have examined some of these in separate glossary entries, respectively for seasonality, learning effects, novelty effects, cookie churn, survivorship bias and selection bias.

Ways to improve the generalizability include managing the test duration so that the data is balanced across different important known factors (acquiring a "representative sample"), checking for strong trends within the test period, checking for trends persisting after the test has ended an was switched back to control vs control (A/A test), and others. None of these is flawless and all of them have statistical premises which may need testing on their own.

Generalizability is an unsolvable problem in the long term due to the adaptive nature of human behavior as well as the ever changing technological and competitive context. The different measures described above can help alleviate concerns about short and mid-term generalizability, as well as generalizability across populations.

Like this glossary entry? For an in-depth and comprehensive reading on A/B testing stats, check out the book "Statistical Methods in Online A/B Testing" by the author of this glossary, Georgi Georgiev.

Articles on Generalizability

Representative samples and generalizability of A/B testing results
blog.analytics-toolkit.com

Related A/B Testing terms

Learning Effect Seasonality Novelty Effect Cookie Churn Survivorship Bias Selection Bias

About the author

Georgi Z. Georgiev

Georgi has over twenty years of experience in online marketing, web analytics, statistics, and design of business experiments.

Author of the book "Statistical Methods in Online A/B Testing", white papers on statistical analysis of A/B tests, and a speaker, he has been distinguished as a winner in the Data & Analytics category of the 2024 Experimentation Thought Leadership Awards.

Statistical Methods in Online A/B Testing

Take your A/B testing program to the next level with the most comprehensive book on user testing statistics in e-commerce.

Learn more

Glossary index by letter

A B C D E F G H I K L M N O P R S T U V Z

Select a letter to see all A/B testing terms starting with that letter or visit the Glossary homepage to see all.