diff options
author | Stephan Hoyer <shoyer@gmail.com> | 2019-06-12 00:08:06 -0400 |
---|---|---|
committer | GitHub <noreply@github.com> | 2019-06-12 00:08:06 -0400 |
commit | 8f9ec01afa2b2dee209010f6155da27af02de96d (patch) | |
tree | 2a2b9133b95176dab7e7311d650781814f2f6844 /numpy/lib/tests/test_function_base.py | |
parent | bc3c7176987a5017126abbc861458fe53cd099fc (diff) | |
download | numpy-8f9ec01afa2b2dee209010f6155da27af02de96d.tar.gz |
MAINT: avoid nested dispatch in numpy.core.shape_base (#13634)
* MAINT: avoid nested dispatch in numpy.core.shape_base
This is a partial reprise of the optimizations from GH-13585.
The trade-offs here are about readability, performance and whether these
functions automatically work on ndarray subclasses.
You'll have to judge the readability costs for yourself, but I think this is
pretty reasonable.
Here are the performance numbers for three relevant functions with the
following IPython script:
import numpy as np
x = np.array([1])
xs = [x, x, x]
for func in [np.stack, np.vstack, np.block]:
%timeit func(xs)
| Function | Master | This PR |
| --- | --- | --- |
| `stack` | 6.36 µs ± 175 ns | 6 µs ± 174 ns |
| `vstack` | 7.18 µs ± 186 ns | 5.43 µs ± 125 ns |
| `block` | 15.1 µs ± 141 ns | 11.3 µs ± 104 ns |
The performance benefit for `stack` is somewhat marginal (perhaps it should be
dropped), but it's much more meaningful inside `vstack`/`hstack` and `block`,
because these functions call other dispatched functions within a loop.
For automatically working on ndarray subclasses, the main concern would be that
by skipping dispatch with `concatenate`, subclasses that define `concatenate`
won't automatically get implementations for `*stack` functions. (But I don't
think we should consider ourselves obligated to guarantee these implementation
details, as I write in GH-13633.)
`block` also will not get an automatic implementation, but given that `block`
uses two different code paths depending on argument values, this is probably a
good thing, because there's no way the code path not involving `concatenate`
could automatically work (it uses `empty()`).
* MAINT: only remove internal use in np.block
* MAINT: fixup comment on np.block optimization
Diffstat (limited to 'numpy/lib/tests/test_function_base.py')
0 files changed, 0 insertions, 0 deletions