What does "Overall Evaluation Criterion" mean?

Definition of Overall Evaluation Criterion in the context of A/B testing (online controlled experiments).

What is an Overall Evaluation Criterion?

Alias: OEC

An Overall Evaluation Criterion (OEC) is a (usually composite) quantitative measure of the experiment’s objective. Other names include Response or Dependent Variable, Outcome Variable, Evaluation metric, Performance metric. Most often one speaks of an overall evaluation criterion when a single key performance indicator is deemed insufficient for the evaluation of the outcome of an online controlled experiment and separate evaluation of metrics is not desirable for some reason. In such a case a weighted combination KPIs as a primary KPI is deemed desirable and is recommended as long as there is agreement on the relative weights.

According to R.Kohavi et al. ^[1] "a single metric forces trade-offs to be made once for multiple experiments and aligns the organization behind a clear objective. A good OEC should not be short-term focused (e.g., clicks); to the contrary, it should include factors that predict long-term goals, such as predicted lifetime value and repeat visits." So, ideally, an OEC is based on metrics in a short-term experiment that are good predictors of long-term value.

An example provided by Kohavi et. al. is "units purchased, revenue, profit, expected lifetime value, or some weighted combination of these", but a conversion rate optimization practitioner should make the decision on a suitable overall evaluation criterion based on the particular business objectives, measurement capabilities and established knowledge from prior experimental and observational data.

References:
[1] Kohavi R. et al. (2009) "Controlled experiments on the web - survey and practical guide"

Like this glossary entry? For an in-depth and comprehensive reading on A/B testing stats, check out the book "Statistical Methods in Online A/B Testing" by the author of this glossary, Georgi Georgiev.

Related A/B Testing terms

Online Controlled Experiment Key Performance Indicator

About the author

Georgi Z. Georgiev

Georgi has over twenty years of experience in online marketing, web analytics, statistics, and design of business experiments.

Author of the book "Statistical Methods in Online A/B Testing", white papers on statistical analysis of A/B tests, and a speaker, he has been distinguished as a winner in the Data & Analytics category of the 2024 Experimentation Thought Leadership Awards.

Statistical Methods in Online A/B Testing

Take your A/B testing program to the next level with the most comprehensive book on user testing statistics in e-commerce.

Learn more

Glossary index by letter

A B C D E F G H I K L M N O P R S T U V Z

Select a letter to see all A/B testing terms starting with that letter or visit the Glossary homepage to see all.