What does "Null Hypothesis" mean?

Definition of Null Hypothesis in the context of A/B testing (online controlled experiments).

What is a Null Hypothesis?

The null hypothesis in the colloquial sense is a claim that we want to put to the test. It is usually the default, inherited state of matters, or a state of matters that would be preferred if no further information is available (we do not test). In other words, it should correspond to the position of a critic, skeptical to our proposed alternative or new treatment (the alternative hypothesis). In the strict statistical sense of the term it is defined more strictly as a statistical model (see hypothesis).

The null hypothesis is usually denoted by H₀.

There are different kinds of null hypotheses, depending on the inference we want to make in the case at hand but in all cases the null and alternative should exhaust the entire parameter space (Θ₀ ∪ Θ₁ = Θ, Θ₀ ∩ Θ₁ = ∅). Most often in A/B tests we set up a superiority alternative such that the null hypothesis becomes H₀: μ ≤ 0 or H₀: μ ≤ m where m is some positive discrepancy. However, sometimes the alternative is that of non-inferiority so the null becomes H₀ ≤ -m where m is the magnitude of the non-inferiority margin.

The null hypothesis should not be confused with the nil hypothesis (no difference hypothesis) although the two can coincide in the case of particular two-sided tests. The null hypothesis can be defined as any point or range which corresponds to a claim that needs to be tested, e.g. μ ≤ 2.

From a practical standpoint the null hypothesis is what gives the p-value a meaningful interpretation since the probability expressed by it is calculated under the assumption that the null is true. Upon observing a p-value below our chosen significance threshold we have sufficient evidence to reject the null hypothesis using the modus tollens logic or the argument from coincidence. We effectively shift the burden of proof to the one who would wand to make a claim corresponding to the null hypothesis.

The null can be redefined after a test is completed in order to check how warranted different conclusions are. For example, if you observed a given mean value x and the result is statistically significant for the original H₀: μ < ≤ 0 if you were to define H1₀: μ ≤ x then you know immediately that you have poor evidence to reject H1₀ since the probability of observing this value assuming H1₀ is true is exactly 0.5, thus very likely.

A confidence interval can also be used to test a null hypothesis: every null hypothesis defined over any set of values not covered by the confidence interval can be rejected at significance level equal to 1 minus the confidence level, e.g. if a 95% confidence interval covers the values from 0.01 to +∞ then a null that covers the values from -∞ to 0.005 can be rejected at the 1-0.95 = 0.05 level.

Like this glossary entry? For an in-depth and comprehensive reading on A/B testing stats, check out the book "Statistical Methods in Online A/B Testing" by the author of this glossary, Georgi Georgiev.

Articles on Null Hypothesis

A p-value is meaningless without a specified null hypothesis
www.onesided.org

Related A/B Testing terms

Alternative Hypothesis Hypothesis Statistical Model One-Sided Hypothesis Two-Sided Hypothesis

About the author

Georgi Z. Georgiev

Georgi has over twenty years of experience in online marketing, web analytics, statistics, and design of business experiments.

Author of the book "Statistical Methods in Online A/B Testing", white papers on statistical analysis of A/B tests, and a speaker, he has been distinguished as a winner in the Data & Analytics category of the 2024 Experimentation Thought Leadership Awards.

Statistical Methods in Online A/B Testing

Take your A/B testing program to the next level with the most comprehensive book on user testing statistics in e-commerce.

Learn more

Glossary index by letter

A B C D E F G H I K L M N O P R S T U V Z

Select a letter to see all A/B testing terms starting with that letter or visit the Glossary homepage to see all.