DOC: improve standard_t example in numpy.random.

This rewrite better reflects the cited reference R example by offering a 2-tailed test and more accurate language. Closes #18370.
author: François Le Lay <fly@spotify.com> 2021-02-08 17:28:30 -0500
committer: François Le Lay <fly@spotify.com> 2021-02-08 20:45:34 -0500
commit: e29d0955c7c4e482e4128dde68ba5ff2246b8dbf (patch)
tree: 5d87cd99be25c272cf58c09020813d1f65c64781 /numpy
parent: 7b085eac10d08c49453541a896297f42f09ddfe7 (diff)
download: numpy-e29d0955c7c4e482e4128dde68ba5ff2246b8dbf.tar.gz
2 files changed, 60 insertions, 28 deletions
diff --git a/numpy/random/_generator.pyx b/numpy/random/_generator.pyx
index 1c4689a70..003c16113 100644
--- a/numpy/random/_generator.pyx
+++ b/numpy/random/_generator.pyx
@@ -1722,6 +1722,9 @@ cdef class Generator:
                Springer, 2002.
         .. [2] Wikipedia, "Student's t-distribution"
                https://en.wikipedia.org/wiki/Student's_t-distribution
+        .. [3] Walker, Helen M., "Degrees of freedom" in Journal of 
+               Educational Psychology
+               http://www.nohsteachers.info/pcaso/ap_statistics/PDFs/DegreesOfFreedom.pdf
 
         Examples
         --------
@@ -1732,33 +1735,46 @@ cdef class Generator:
         ...                    7515, 8230, 8770])
 
         Does their energy intake deviate systematically from the recommended
-        value of 7725 kJ?
-
-        We have 10 degrees of freedom, so is the sample mean within 95% of the
-        recommended value?
+        value of 7725 kJ? Our null hypothesis will be the absence of deviation,
+        and the alternate hypothesis will be the presence of an effect that could be
+        either positive or negative, hence making our test 2-tailed. 
+
+        Because we are estimating the mean and we have N=11 values in our sample,
+        we have N-1=10 degrees of freedom [3]_. We set our signifance level to 95% and 
+        compute the t statistic using the empirical mean and standard deviation
+        of our intake, setting the ddof parameter to the unbiased
+        value so the divisor in the standard deviation will be degrees of
+        freedom.
 
-        >>> s = np.random.default_rng().standard_t(10, size=100000)
         >>> np.mean(intake)
         6753.636363636364
         >>> intake.std(ddof=1)
         1142.1232221373727
+        >>> t = (np.mean(intake)-7725)/(intake.std(ddof=1)/np.sqrt(len(intake)))
+        >>> t
+        -2.8207540608310198
 
-        Calculate the t statistic, setting the ddof parameter to the unbiased
-        value so the divisor in the standard deviation will be degrees of
-        freedom, N-1.
+        We draw 1000000 samples from Student's t distribution with the adequate
+        degrees of freedom.
 
-        >>> t = (np.mean(intake)-7725)/(intake.std(ddof=1)/np.sqrt(len(intake)))
         >>> import matplotlib.pyplot as plt
+        >>> s = np.random.default_rng().standard_t(10, size=1000000)
         >>> h = plt.hist(s, bins=100, density=True)
 
-        For a one-sided t-test, how far out in the distribution does the t
-        statistic appear?
+        Does our t statistic land in one of the two critical regions found at 
+        both tails of the distribution?
 
         >>> np.sum(s<t) / float(len(s))
-        0.0090699999999999999  #random
+        0.009159  #random <0.025, statistic is in critical region 
+        >>> 2*0.009159
+        0.018318  #random
+
+        The probability value for this 2-tailed test is about 1.83%, which is 
+        lower than the 5% pre-determined significance threshold. 
 
-        So the p-value is about 0.009, which says the null hypothesis has a
-        probability of about 99% of being true.
+        Therefore, the probability of observing values as extreme as our intake
+        conditionally on the null hypothesis being true is too low, and we reject 
+        the null hypothesis of no deviation. 
 
         """
         return cont(&random_standard_t, &self._bitgen, size, self.lock, 1,
diff --git a/numpy/random/mtrand.pyx b/numpy/random/mtrand.pyx
index df8d7e380..8429cf17d 100644
--- a/numpy/random/mtrand.pyx
+++ b/numpy/random/mtrand.pyx
@@ -2137,6 +2137,9 @@ cdef class RandomState:
                Springer, 2002.
         .. [2] Wikipedia, "Student's t-distribution"
                https://en.wikipedia.org/wiki/Student's_t-distribution
+        .. [3] Walker, Helen M., "Degrees of freedom" in Journal of 
+               Educational Psychology
+               http://www.nohsteachers.info/pcaso/ap_statistics/PDFs/DegreesOfFreedom.pdf
 
         Examples
         --------
@@ -2147,33 +2150,46 @@ cdef class RandomState:
         ...                    7515, 8230, 8770])
 
         Does their energy intake deviate systematically from the recommended
-        value of 7725 kJ?
-
-        We have 10 degrees of freedom, so is the sample mean within 95% of the
-        recommended value?
+        value of 7725 kJ? Our null hypothesis will be the absence of deviation,
+        and the alternate hypothesis will be the presence of an effect that could be
+        either positive or negative, hence making our test 2-tailed. 
+
+        Because we are estimating the mean and we have N=11 values in our sample,
+        we have N-1=10 degrees of freedom [3]_. We set our signifance level to 95% and 
+        compute the t statistic using the empirical mean and standard deviation
+        of our intake, setting the ddof parameter to the unbiased
+        value so the divisor in the standard deviation will be degrees of
+        freedom.
 
-        >>> s = np.random.standard_t(10, size=100000)
         >>> np.mean(intake)
         6753.636363636364
         >>> intake.std(ddof=1)
         1142.1232221373727
+        >>> t = (np.mean(intake)-7725)/(intake.std(ddof=1)/np.sqrt(len(intake)))
+        >>> t
+        -2.8207540608310198
 
-        Calculate the t statistic, setting the ddof parameter to the unbiased
-        value so the divisor in the standard deviation will be degrees of
-        freedom, N-1.
+        We draw 1000000 samples from Student's t distribution with the adequate
+        degrees of freedom.
 
-        >>> t = (np.mean(intake)-7725)/(intake.std(ddof=1)/np.sqrt(len(intake)))
         >>> import matplotlib.pyplot as plt
+        >>> s = np.random.standard_t(10, size=1000000)
         >>> h = plt.hist(s, bins=100, density=True)
 
-        For a one-sided t-test, how far out in the distribution does the t
-        statistic appear?
+        Does our t statistic land in one of the two critical regions found at 
+        both tails of the distribution?
 
         >>> np.sum(s<t) / float(len(s))
-        0.0090699999999999999  #random
+        0.009159  #random < 0.025, statistic is in critical region
+        >>> 2*0.009159
+        0.018318  #random
+
+        The probability value for this 2-tailed test is about 1.83%, which is 
+        lower than the 5% pre-determined significance threshold. 
 
-        So the p-value is about 0.009, which says the null hypothesis has a
-        probability of about 99% of being true.
+        Therefore, the probability of observing values as extreme as our intake
+        conditionally on the null hypothesis being true is too low, and we reject 
+        the null hypothesis of no deviation. 
 
         """
         return cont(&legacy_standard_t, &self._aug_state, size, self.lock, 1,
author	François Le Lay <fly@spotify.com>	2021-02-08 17:28:30 -0500
committer	François Le Lay <fly@spotify.com>	2021-02-08 20:45:34 -0500
commit	e29d0955c7c4e482e4128dde68ba5ff2246b8dbf (patch)
tree	5d87cd99be25c272cf58c09020813d1f65c64781 /numpy
parent	7b085eac10d08c49453541a896297f42f09ddfe7 (diff)
download	numpy-e29d0955c7c4e482e4128dde68ba5ff2246b8dbf.tar.gz