Empty Model (Null Model)
An empty model is a statistical model that describes a quantitative outcome variable using only a single overall value, typically the mean of the response. It is called empty because it contains no explanatory (predictor) variables.
“If we ignore all predictors, what single value best represents the data?”
Even though they are very simple, empty models are important because they:
Provide a baseline for comparison with more complex models
Represent the model you would use if you had no information other than the outcome itself
Are often the starting point for model building and hypothesis testing
Many modeling procedures compare a more complex model to the empty model to ask whether adding predictors actually improves prediction.
For an outcome variable ( Y ), the empty model can be represented as follows:
OR
OR
Every observation is predicted to be the same value (the sample mean)
All variation in the data is treated as unexplained noise
Suppose you are modeling exam scores for a class.
The empty model predicts that every student will score the class average.
It does not use study time, attendance, or prior GPA.
Any difference between a student’s actual score and the mean is considered error.
Using a linear model with no predictors:
# Generic format
empty_model <- lm(Y ~ NULL, data = data_set)
# student score example
empty_model <- lm(score ~ NULL, data = exams)
~ NULL means “model the response using only an intercept”
The intercept estimate is the mean of score
mean(exams$score)
This value will match the intercept from the empty model.
model_with_predictor <- lm(score ~ study_hours, data = exams)
The empty model assumes no relationship between predictors and the response
The model with predictors tests whether including variables like study_hours improves predictions beyond the mean
The empty model is the simplest possible model for a quantitative variable.
It uses the mean as the prediction for all observations and serves as a critical reference point for understanding and evaluating more complex models.