NEP: c-masked-array: More tweaks

author: Mark Wiebe <mwiebe@enthought.com> 2011-06-25 16:13:17 -0500
committer: Mark Wiebe <mwiebe@enthought.com> 2011-06-27 10:32:42 -0500
commit: f61537d099d34ba67cfdc134347fd633987cf00f (patch)
tree: ad4d52b6662a760cc80bd3a760f8c6556c566469 /doc
parent: 08caab161a82f9076cc79f97bcf29957030c9635 (diff)
download: numpy-f61537d099d34ba67cfdc134347fd633987cf00f.tar.gz
1 files changed, 14 insertions, 5 deletions
diff --git a/doc/neps/c-masked-array.rst b/doc/neps/c-masked-array.rst
index 3c9fad9dd..d678fa395 100644
--- a/doc/neps/c-masked-array.rst
+++ b/doc/neps/c-masked-array.rst
@@ -102,8 +102,15 @@ numpy.NA to the array masks that element. The storage behind a masked
 value may never be accessed in any way, other than to unmask it by
 assigning a value.
 
-Because numpy.NA is a global singleton, it will be possible to test
-whether a value is masked by saying "arr[0] is np.NA".
+While numpy.NA works to mask values, it does not itself have a dtype.
+This means that returning the numpy.NA singleton from an operation
+like 'arr[0]' would be throwing away the dtype, which is still
+valuable to retain, so 'arr[0]' will return a zero-dimensional
+array with its value masked instead of numpy.NA. To test if the value
+is missing, a function like "np.ismissing(arr[0])" will be provided.
+One of the key reasons for the NumPy scalars is to allow their values
+into dictionaries. Having a missing value as the key in a dictionary
+is a bad idea, so not returning a scalar is fine for this case.
 
 All operations which write to masked arrays will not affect the value
 unless they also unmask that value. This allows the storage behind
@@ -151,9 +158,11 @@ A manual loop through a masked array like::
     for i in xrange(len(a)):
         a[i] = np.log(a[i])
 
-should work, something that is a little bit tricky because the global
-singleton np.NA has no type, and doesn't follow the type promotion rules.
-A good approach to deal with this needs to be found.
+works even with masked values, because 'a[i]' returns a zero-dimensional
+array with a masked value instead of np.NA for the missing elements.
+If np.NA was returned, np.log would have to raise an exception because
+it doesn't know the log of which dtype it's meant to call, whether it's a
+missing float or a missing string, for example.
 
 The 'validitymask' Property
 ===========================
author	Mark Wiebe <mwiebe@enthought.com>	2011-06-25 16:13:17 -0500
committer	Mark Wiebe <mwiebe@enthought.com>	2011-06-27 10:32:42 -0500
commit	f61537d099d34ba67cfdc134347fd633987cf00f (patch)
tree	ad4d52b6662a760cc80bd3a760f8c6556c566469 /doc
parent	08caab161a82f9076cc79f97bcf29957030c9635 (diff)
download	numpy-f61537d099d34ba67cfdc134347fd633987cf00f.tar.gz