<Home>Teaching><MATH 170S (Winter 2024)>Homework 2

Homework 2

Problem 1
Problem 2
Problem 3
Problem 4
- Python Code
Problem 5

Problem 1

Let $X_1, \ldots, X_n$ be i.i.d. random variables with Poisson distribution with parameter $\lambda$ . Find the maximum likelihood estimator for $\lambda$ .

Solution.

The log-likelihood function is

\begin{aligned} \ln L\p{\theta} &= \ln \prod_{i=1}^n \frac{\lambda^{x_i}}{x_i!} e^{-\lambda} \\ &= \sum_{i=1}^n x_i \ln\lambda - \sum_{i=1}^n \ln\p{x_i!} - n\lambda \\ &= n\mean{x} \ln\lambda - \sum_{i=1}^n \ln\p{x_i!} - n\lambda \\ \end{aligned}

To optimize, we take the derivative with respect to $\lambda$ and set it equal to $0$ to get

\begin{gathered} \frac{\partial \ln L\p{\theta}}{\partial \theta} = \frac{n\mean{x}}{\lambda} - n = 0 \\ \implies \boxed{\widehat{\lambda} = \mean{X}}. \end{gathered}

Problem 2

You throw a fair coin $30$ times. You observe that $5$ times it lands on heads, and $25$ times it lands on tails. What is the maximum likelihood estimate for the probability that a single coin toss gives you heads?

Solution.

The MLE for a Bernoulli trial is $\widehat{p} = \mean{X}$ , where $X_i = 1$ if the coin lands on heads and $0$ otherwise. In our case, we have the observed value $\mean{X} = \frac{5}{30} = \frac{1}{6}$ , so

\boxed{\widehat{p} = \frac{1}{6}}.

Problem 3

Let $y > 0$ , $\theta > 1$ . Define

f\p{x \given y, \theta} = \theta y^\theta x^{-\theta-1}, \quad x \geq y.

Show that $f\p{x \given y, \theta}$ is a probability density function.
Assume that $y > 0$ is given. Let $X_1, \ldots, X_n$ be i.i.d. random variables with probability density function $f\p{x \given y, \theta}$ . Find the maximum likelihood estimator for $\theta$ .

Solution.

We just integrate with respect to $x$ :
$\begin{aligned} \int_y^\infty \theta y^\theta x^{-\theta-1} \,\diff{x} &= \theta y^\theta \int_y^\infty x^{-\theta-1} \,\diff{x} \\ &= \theta y^\theta \left. \frac{x^{-\theta}}{-\theta} \right\rvert_y^\infty \\ &= \theta y^\theta \p{0 - \frac{y^{-\theta}}{-\theta}} \\ &= 1. \end{aligned}$
Note that $\lim_{x\to\infty} x^{-\theta} = 0$ since $\theta > 0$ .
The log-likelihood function is
$\begin{aligned} \ln L\p{\theta} &= \ln \prod_{i=1}^n \theta y^\theta x_i^{-\theta-1} \\ &= n\ln\theta + n\theta \ln y - \p{\theta + 1} \sum_{i=1}^n \ln x_i. \end{aligned}$
Setting its derivative equal to $0$ , we get
$\begin{gathered} \frac{\partial \ln L\p{\theta}}{\partial \theta} = \frac{n}{\theta} + n\ln y - \sum_{i=1}^n \ln x_i = 0 \\ \implies \boxed{\widehat{\theta} = \frac{1}{\frac{1}{n} \sum_{i=1}^n \ln X_i - \ln y}}. \end{gathered}$

Problem 4

The midterm and final exam scores for $10$ students in a statistics course are tabulated as shown below.

\begin{array}{r|rrrrrrrrrr} \text{Midterm} & 62 & 74 & 66 & 30 & 84 & 45 & 14 & 70 & 82 & 42 \\ \text{Final} & 71 & 84 & 76 & 36 & 95 & 53 & 18 & 80 & 93 & 49 \end{array}

Calculate the least squares regression line for these data.
Plot the points and the least squares regression line on the same graph.
Draw the residuals plot. What does this tell you?
Find the maximum likelihood estimate for $\sigma^2$ .

Solution.

The least squares regression line is given by $y = \widehat{\alpha} + \widehat{\beta} x$ with square residuals $\widehat{\sigma}^2$ , where

\boxed{ \begin{aligned} \widehat{\alpha} &\approx 2.98 \\ \widehat{\beta} &\approx 1.10 \\ \widehat{\sigma}^2 &\approx 0.09 \end{aligned} }

The residuals are relatively small, so this is a good fit.

Python Code

I used numpy since it's faster to type (and runs faster), but you can do the computations without it if you don't want to. However, using numpy makes plotting with matplotlib a bit easier.

from matplotlib import pyplot as plt
import numpy as np

x = np.array([62, 74, 66, 30, 84, 45, 14, 70, 82, 42])
y = np.array([71, 84, 76, 36, 95, 53, 18, 80, 93, 49])

x_bar = np.mean(x)
y_bar = np.mean(y)

beta_hat = np.sum((y - y_bar) * (x - x_bar)) / np.sum((x - x_bar) ** 2)
alpha_hat = y_bar - beta_hat * x_bar
sigma2_hat = np.mean((y - alpha_hat - beta_hat * x) ** 2)

# Or without numpy:
# beta_hat = sum([(y[i] - y_bar) * (x[i] - x_bar) for i in range(n)]) / sum(
#     [(x[i] - x_bar) ** 2 for i in range(n)]
# )
# alpha_hat = y_bar - beta_hat * x_bar
# sigma_hat_2 = sum([y[i] - alpha_hat - beta_hat * x[i] for i in range(n)]) / n

print(f"alpha_hat:  {alpha_hat}")   # 2.9788895969084805
print(f"beta_hat:   {beta_hat}")    # 1.0987892865218194
print(f"sigma2_hat: {sigma2_hat}")  # 0.09218688976135175

fig = plt.figure(figsize=(8, 16))

# Plot sample and regression line
ax = fig.add_subplot(2, 1, 1)
ax.set_title(
    f"$\\alpha$ (intercept): {alpha_hat:>.2f}, $\\beta$ (slope): {beta_hat:>.2f}"
)
ax.set_xlim(0, np.max(x))
ax.set_ylim(0, np.max(y))
ax.scatter(x, y, c="black", label="sample")
ax.axline((0, alpha_hat), slope=beta_hat, label="regression line", zorder=-1)
ax.legend()

# Plot residuals
ax = fig.add_subplot(2, 1, 2)
ax.set_title("residuals")
ax.spines["top"].set_visible(False)
ax.spines["right"].set_visible(False)
ax.spines["bottom"].set_position("zero")
ax.set_ylim(-1, 1)
ax.set_yticks(np.linspace(-1, 1, 9))
ax.scatter(x, y - (alpha_hat + beta_hat * x), c="black", label="residual")
ax.legend()

Problem 5

For $\alpha > 0$ , $\beta > 0$ , the Beta distribution has probability density function

f\p{x \given \alpha, \beta} = \frac{\Gamma\p{\alpha + \beta}}{\Gamma\p{\alpha} \Gamma\p{\beta}} x^{\alpha-1} \p{1 - x}^{\beta-1}, \quad x \in \p{0, 1}.

Find the method of moments estimator for $\alpha$ and $\beta$ .

Solution.

This is a straight calculation. The mean and variance of a Beta distribution are

\E\br{X} = \frac{\alpha}{\alpha + \beta}, \quad \Var X = \frac{\alpha\beta}{\p{\alpha + \beta}^2\p{\alpha + \beta + 1}}.

Note that

\frac{1}{n} \sum_{i=1}^n \p{x_i - \mean{x}}^2 = \frac{1}{n} \sum_{i=1}^n x_i^2 + \mean{x}^2,

so when computing the method of moments estimator, we can just set

v = \frac{1}{n} \sum_{i=1}^n \p{x_i - \mean{x}}^2 = \Var X

instead of using the second moment. This is only slightly easier, and the computations will be similar.

From the first moment,

\mean{x} = \frac{\widehat{\alpha}}{\widehat{\alpha} + \widehat{\beta}} \implies \widehat{\beta} = \p{\frac{1 - \mean{x}}{\mean{x}}} \widehat{\alpha} \quad\text{and}\quad \widehat{\alpha} + \widehat{\beta} = \frac{\widehat{\alpha}}{\mean{x}},

which we can substitute into

\begin{aligned} v &= \frac{\widehat{\alpha}\widehat{\beta}}{\p{\widehat{\alpha} + \widehat{\beta}}^2\p{\widehat{\alpha} + \widehat{\beta} + 1}} \\ &= \frac{\p{\frac{1 - \mean{x}}{\mean{x}}} \widehat{\alpha}^2}{\p{\frac{\widehat{\alpha}}{\mean{x}}}^2\p{\frac{\widehat{\alpha}}{\mean{x}} + 1}} \\ &= \frac{\p{1 - \mean{x}}\mean{x}}{\frac{\widehat{\alpha}}{\mean{x}} + 1} \\ \implies \widehat{\alpha} &= \p{\frac{\p{1 - \mean{x}}\mean{x}}{v} - 1} \mean{x}. \end{aligned}

For $\widehat{\beta}$ ,

\widehat{\beta} = \p{\frac{1 - \mean{x}}{\mean{x}}} \widehat{\alpha} = \p{\frac{\p{1 - \mean{x}}\mean{x}}{v} - 1} \p{1 - \mean{x}}

so in summary,

\boxed{ \begin{aligned} \widehat{\alpha} &= \p{\frac{\p{1 - \mean{x}}\mean{x}}{v} - 1} \mean{x} \\ \widehat{\beta} &= \p{\frac{\p{1 - \mean{x}}\mean{x}}{v} - 1} \p{1 - \mean{x}} \end{aligned} }

Homework 2

Table of Contents

Problem 1

Solution.

Problem 2

Solution.

Problem 3

Solution.

Problem 4

Solution.

Python Code

Problem 5

Solution.