Continuous models

Caution

Lecture room change! Effective now (January 20), all lectures will be in MCML 360. It is a 5 minutes walk from the previous room.

Outline

Topics

  • Review of key concepts for continuous random variables.
    • Density.
    • Computing their expectation.
  • Bayes rule with continuous random variables.

Rationale

Since the parameters of distributions families are continuous, it natural to use continuous random variables for them in Bayesian statistics. For example it lets us get rid of the artificial discretization \(K\) we used in last week’s exercise.

Today we will approach the calculations with both manual mathematic derivation and PPLs. Later we will increasingly rely only on PPLs. But it is still important to understand both methods.

Densities

Definition: \(X\) has density \(f(x)\) if \[\mathbb{P}(X \in A) = \int_A f(x) \mathrm{d}x.\]

LOTUS: works the same as with discrete models: \[\mathbb{E}[g(X)] = \int f(x) g(x) \mathrm{d}x.\]

Joint densities

Definition: \((X, Y)\) has joint density \(f(x, y)\) if \[\mathbb{P}((X, Y) \in A) = \int \int_A f(x, y) \mathrm{d}x \mathrm{d}y.\]

Key properties

Note that in much of what follows, everything works the same as with with discrete models except we replace sums by integrals and PMFs \(p\) by densities \(f\).

LOTUS: \[\mathbb{E}[g(X, Y)] = \int \int f(x, y) g(x, y) \mathrm{d}x \mathrm{d}y.\]

Marginalization: going from a joint density on \((X, Y)\) to the density of \(X\) only (the latter is called the marginal): \[f_X(x) = \int f(x, y) \mathrm{d}y.\]

Chain rule: \[f(x, y) = f_X(x) f_{Y|X}(y|x).\]

Bayes theorem: \[f_{X|Y}(x|y) = \frac{f(x, y)}{f_Y(y)}.\]

Example: Doomsday model

Mathematical description

Consider the following joint density described using chain rule: \[\begin{align*} X &\sim {\mathrm{Unif}}(0, 5) \\ Y | X &\sim {\mathrm{Unif}}(0, X). \end{align*}\]

Interpretation 1

  • I have a measuring tape, but you do not know how long is it.
    • Length of tape: \(X\).
    • Let’s say we think it’s less than 5m.
  • I go in a separate room, unroll it fully, and pick a number at random from the tape.
    • Random point on tape: \(Y\)

Goal: from the \(Y\), trying to guess the full length of the tape.

Interpretation 2

  • \(X\) is the total number of humans to ever live, future and past (in trillion).
  • \(Y\) is the number of humans that were born before present (from archeological records, ~0.06 trillion).

Goal: Can we guess (probabilistically) how many more human there will ever be in total?

This is known as the Doomsday argument

Computations on the Doomsday model

Joint density: which of these is the joint density?

  1. \(\frac{\mathbb{1}[y < x < 5]}{y - 5} \frac{\mathbb{1}[0 < y < x]}{x}\)
  2. \(\frac{\mathbb{1}[0 < y < 5]}{5} \frac{\mathbb{1}[0 < x < y]}{y}\)
  3. \(\frac{\mathbb{1}[0 < x < 5]}{5} \frac{\mathbb{1}[0 < y < x]}{x}\)
  4. \(\frac{\mathbb{1}[0 < x < 5]}{5} \frac{\mathbb{1}[0 < y < 5]}{5}\)
  5. None of the above

We have \[f_X(x) = \frac{\mathbb{1}[0 < x < 5]}{5}\] and \[f_{Y|X}(y|x) = \frac{\mathbb{1}[0 < y < x]}{x},\] therefore \[f(x, y) = \frac{\mathbb{1}[0 < x < 5]}{5} \frac{\mathbb{1}[0 < y < x]}{x}.\]

Marginal density: compute the marginal \(f_Y(y)\)

  1. \(\frac{1}{5} (\log 5 - \log y)\)
  2. \(\log 5 - \log y\)
  3. \(\frac{2}{y^2} - \frac{2}{25}\)
  4. \(\frac{1}{5} \left( \frac{2}{y^2} - \frac{2}{25} \right)\)
  5. None of the above

\[\begin{align*} f_Y(y) &= \frac{1}{5} \int \frac{ \mathbb{1}[0 < x < 5] \mathbb{1}[0 < y < x]}{x} \mathrm{d}x \\ &= \frac{1}{5} \int_y^5 \frac{\mathrm{d}x}{x} \\ &= \frac{1}{5} (\log 5 - \log y). \end{align*}\]

Posterior density: from our marginal and Bayes’ theorem, we obtain \[f_{X|Y}(x|y) = \frac{\mathbb{1}[0<x<5] \mathbb{1}[0 < y < x]}{x (\log 5 - \log y)}.\]

Posterior expectation: compute \(\mathbb{E}[X|Y = 0.06]\)