Maximum likelihood estimation

Prof. Maria Tackett

Mar 18, 2025

Example: Basketball shots

Suppose the a basketball player shoots the ball, such that the probability of making the basket (successfully making the shot) is $p$

What is the probability distribution for this random phenomenon?
Suppose the probability is $p = 0.5$ . What is the probability the player makes a single basket, given this value of $p$ ?
Suppose the probability is $p = 0.8$ . What is the probability the player makes a single basket, given this value of $p$ ?

Shooting the ball three times

Suppose the player shoots the ball three times. They are all independent and the player has the same probability $p$ of making each basket.

Let $B$ represent a made basket, and $M$ represent a missed basket. The player shoots the ball three times with the outcome $B B M$ .

Suppose the probability is $p = 0.5$ . What is the probability of observing the data $B B M$ , given this value of $p$ ?
Suppose the probability is $p = 0.3$ . What is the probability of observing the data $B B M$ , given this value of $p$ ?

Likelihood

A likelihood function is a measure of how likely we are to observe our data under each possible value of the parameter(s)
Note that this is not the same as the probability function.
Probability function: Fixed parameter value(s) + input possible outcomes
- Given $p = 0.8$ , what is the probability of observing $B B M$ in three basketball shots?
Likelihood function: Fixed data + input possible parameter values
- Given we’ve observed $B B M$ , what is the most plausible value of $p$ ?

Likelihood: Three basketball shots

The likelihood function for $p$ given the data $B B M$ is

$L (p | B B M) = p \times p \times (1 - p) = p^{2} \times (1 - p)$

We want of the value of $p$ that maximizes this likelihood function, i.e., the value of $p$ that is most likely given the observed data.
The process of finding this value is maximum likelihood estimation.
There are three primary ways to find the maximum likelihood estimator
- Approximate using a graph
- Using calculus
- Numerical approximation

Shooting the ball $n$ times

Suppose the player shoots the ball $n$ times. They are all independent and the player has the same probability $p$ of making each one.

Suppose the player makes $k$ baskets out of the $n$ shots. This is the observed data.

What is the formula for the probability distribution to describe this random phenomenon?
What is the formula for the likelihood function for $p$ given the observed data?
For what value of $p$ do we maximize the likelihood given the observed data? Use calculus to find the response.

Why maximum likelihood estimation?

“Maximum likelihood estimation is, by far, the most popular technique for deriving estimators.” (Casella and Berger 2024, 315)
MLEs have nice statistical properties (more on this next class)
- Consistent
- Efficient
- Asymptotically normal

Note

If the normality assumption holds, the least squares estimator is the maximum likelihood estimator for $β$ . Therefore, it has all the properties of the MLE.

1 / 30

Maximum likelihood estimation Prof. Maria Tackett Mar 18, 2025

Maximum likelihood estimation
Announcements
Topics
Motivation
Maximum likelihood estimation
Example: Basketball shots
Shooting the ball three times
Shooting the ball three times
Likelihood
Likelihood: Three basketball shots
Likelihood: Three basketball shots
Likelihood: Three basketball shots
Finding the MLE using graphs
Finding the MLE using calculus
Shooting the ball $n$ times
MLE in linear regression
Why maximum likelihood estimation?
Linear regression
Simple linear regression model
Side note: Normal distribution
SLR: Likelihood for $β_{0}, β_{1}, σ_{ϵ}^{2}$
Log-Likelihood for $β_{0}, β_{1}, σ_{ϵ}^{2}$
MLE for $β_{0}$
MLE for $β_{0}$
MLE for $β_{0}$
MLE for $β_{1}$ and $σ_{ϵ}^{2}$
MLE in matrix form
MLE for linear regression in matrix form
Putting it all together
References