Censoring is an example of a data collection process. We will see a case study where ignoring the data collection process literally cost many lives…
Example: Chernobyl, 1986
In 1986, the Chernobyl Nuclear Power Plant exploded.
Initially, the reactor crew chief, A. Akimov, assumed the core reactor was intact.
Data generation
The numbers used in this page are synthetic but inspired by a true story.
Let us not look at the data generating code just yet, we will come back to it.
Code
set.seed(1)# detection limit: value higher than that stay at the limitlimit =1.1n_measurements =10true_mean =5.6# data, if we were able to observe perfectlyy =rexp(n_measurements, 1.0/true_mean)# number of measurements higher than the detection limitn_above_limit =sum(y >= limit)n_below_limit =sum(y < limit)# subset of the measurements that are below the limit:data_below_limit = y[y < limit]# measurements: those higher than the limit stay at the limitmeasurements =ifelse(y < limit, y, limit)
Measuring radiation levels
A dosimeter is a device measuring ionizing radiation.
Unit: roentgens per second (R/s).
Suppose that:
If ionizing radiation is greater than 1.5 R/s \(\Rightarrow\) we think the reactor is breached (radiation coming out of reactor core).
If ionizing radiation is smaller than 1.5 R/s \(\Rightarrow\) we think the reactor is not breached.