Computing Minimum Sample Size for A/B Tests in Statsmodels: How and Why

Author:Murphy | View: 26599 | Time: 2025-03-22 21:32:54

Introduction

There is currently no good resource on how Statsmodels computes the minimum sample size.

It is critical to calculate the minimum sample size required before conducting an A/B test. A popular way to do it is by calling the tt_ind_solve_power function in Python's Statsmodels package, but there are currently 2 gaps when it comes to understanding how it works:

There are many great articles (e.g. by Stan Nsky, TDS 2019) explaining what the parameters mean and provide examples of function calls. However, they do not explain how the function actually computes the sample size and why the procedure is correct.
There are also many great articles (e.g. by Mintao Wei, TDS 2023) that explain the statistical derivation based on a z-test for proportions such as conversion rates, which is also a popular choice for many online sample size calculators (e.g. Evan Miller's Calculator). However, this is not the method used by Statsmodels and results can differ.

This is important for data scientists because Statsmodels is commonly used to compute sample size in Python.

Data scientists frequently use Statsmodels to get the minimum sample size, but may not be aware that it employs a different method from what most articles describe and what most online calculators use. It is essential to understand how the function works so that we can trust its results.

This article bridges the gap by explaining how Statsmodels actually works.

This article aims to make the novel contribution of explaining how tt_ind_solve_power actually computes the sample size, why the procedure is correct and what benefits it brings over closed-form solutions. [1]

Part 1: It will first explain how sample size is computed and why the procedure is correct in two steps:

Show the statistical derivation for sample size calculations.
Write a stripped-down version of tt_ind_solve_power that is an exact implementation of the statistical derivation and produces the same output as the original function

Part 2: Following which, it will explain two benefits it brings over closed-form solutions:

Benefits to generalizability
Benefits to statistical intuition

Part 1: How Statsmodels computes minimum sample size and why it is correct

1.1. Showing the statistical derivation for sample size calculations

Core Idea

A general A/B test is an unpaired two-sample t-test. Rather than using a closed-form solution, Statsmodels obtains the minimum sample size in two steps:

For a given sample size, compute the associated power of the test.
Run a numerical optimization algorithm to find the sample size that returns the target power of the test.

Notation and Concepts

These are some terms we will use throughout the article:

n: minimum required sample size. n = n_1 + n_2
n_1, n_2: minimum required sample size for the treatment and control group, respectively
ratio: n_2 = n_1 * ratio, where for a 50:50 allocation, ratio = 1
p: p-value
Tags: A B Testing Data Science Hands On Tutorials Product Analytics Sample Size

Add Fav

Comment

Murphy

Add friends

View space

Message

Recommend

◦ The Power of Pandas Plots: Backends

◦ Deploying SageMaker Endpoints With Terraform

◦ Visualize Endangered Animal Populations with Python

◦ How I Became a Data Scientist at Meta Without A "Perfect" Degree

◦ Another (Conformal) Way to Predict Probability Distributions

◦ Understand SQL Injection and Learn to Avoid It in Python with SQLAlchemy

◦ Revisiting the Death of Data Science

◦ 9.11 or 9.9 – which one is higher?

◦ Optimizing Multi-task Learning Models in Practice

◦ Monitoring Sea Surface Temperature at the global level with GEE

◦ How to Use and Test WizardLM2: Microsoft's New LLM

◦ Breaking boundaries in protein design with a new AI model that understands interactions with anyR