Test for goodness of fit of the normal-probability model

Taken from page 426 of Funamentals of Biostatistics, 8th Edition, by Bernard Rosner.

For this example, we will utilize the test data from the following hypothetical experiment where diastolic blood-pressure measurements were collected at home in a community-wide screening program of 14,736 adults ages 30-69 in East Boston, MA, as part of a nationwide study to detect and treat hypertensive people. The people int he study were each screened in the home, with two measurements taken during one visit. A frequency distribution of the mean distolic blood pressure is given in the table (see df below) in 10-mm Hg intervals.

In general, we would like to test the assumption that the data come from an underlying normal distribution because "standard methods of statistical inference can then be applied on these data"

How can the validity of this assumption be tested?

The null hypothesis (H0) is that the data come from a normal distribution.

The value chi-squared test statistic is calculated across all rows in the table as:

\begin{equation*} \left( \sum_{i=1}^n (O_i-E_i)^2/E_i \right) \end{equation*}

This means we need to calculate the Expected Frequency. The expected frequency is calculated from the normal distribution based on the parameters derived from the sample data (mean=u, standard deviation=s). We use this distribution's CDF to calculate the probability of getting a value within each "Group" (determined by the group's end points), and then multiply that probability density by the total sample size, n, to get the Expected Frequency for each blood-pressure measurement group.

The requirements for using this test are:

Both of these criteria are met in this example.

p < 0.001, thus we can reject the null hypothesis (H0: the sample comes from a normal distribution) as the results are highly significant.