summaryrefslogtreecommitdiff
path: root/numpy/core/src/common/simd/avx512
Commit message (Expand)AuthorAgeFilesLines
* ENH, SIMD: Implment intrinsic for mask divisionSayed Adel2023-01-291-0/+13
* ENH, SIMD: Implment intrinsic for FMA multiply add(odd) and subtract(even)Sayed Adel2023-01-291-0/+4
* ENH, SIMD: Add special intrinsics for better non-contiguous/partial memory ac...Sayed Adel2023-01-291-10/+307
* ENH: Implement intrinsics for shuffle over 128-bit lane and unzipSayed Adel2023-01-291-0/+152
* ENH, SIMD: Discard non-signaling comparison intrinsicsSayed Adel2022-12-152-10/+1
* ENH, SIMD: Add ordered comparison intrinsics guarantees non-signalingSayed Adel2022-12-141-0/+10
* SIMD: Add new intrinsics to check true cross all vector lanesSayed Adel2022-09-192-40/+101
* ENH, SIMD: Implement new universal intrinsic for max/min operationsSayed Adel2022-09-173-73/+172
* BUG, SIMD: Fix C++ AVX512/qsort on old gcc compilersSayed Adel2022-08-191-12/+12
* Merge pull request #20913 from seiko2plus/zsystem_supMatti Picus2022-06-121-0/+2
|\
| * ENH, SIMD: Extend universal intrinsics to support IBMZSayed Adel2022-06-041-0/+2
* | BUG: switch _CMP_NEQ_OQ to _CMP_NEQ_UQ for npyv_cmpneq_f[32,64]Rafael CF Sousa2022-06-071-2/+2
|/
* SIMD, ENH: Add universal intrinsic andc8 and use it to remove ifneqRafael CF Sousa2022-05-292-8/+11
* SIMD, ENH: Use logical bitwise to implement comparison functions (bool_)Rafael CF Sousa2022-05-231-0/+12
* SIMD: Use universal intrinsics to implement comparison functionsRafael Cardoso Fernandes Sousa2022-05-232-0/+54
* Make common/simd/avx512/memory.h compatible with C++serge-sans-paille2022-02-211-2/+4
* road-to-cxx: x86-qsort.dispatch.c.src -> x86-qsort.dispatch.cppserge-sans-paille2022-02-211-2/+2
* Merge pull request #20363 from seiko2plus/svml2npyv/tanh_f32Matti Picus2022-02-101-0/+29
|\
| * SIMD: Add new universal intrinsics for lookup tableSayed Adel2022-01-191-0/+29
* | Merge pull request #20133 from r-devulap/avx512-sortMatti Picus2022-02-071-0/+72
|\ \ | |/ |/|
| * MAINT: rename reduce_min/max functions to not clash with local def in loops_m...Raghuveer Devulapalli2022-02-021-12/+12
| * MAINT: Use cvt instruction instead of extractRaghuveer Devulapalli2022-02-021-4/+4
| * SIMD: Add intrinsics emulation for reduce_min and reduce_max instructionsRaghuveer Devulapalli2022-02-021-0/+72
* | SIMD: add universal intrinsic for round to nearestSayed Adel2021-12-191-2/+6
* | ENH, SIMD: add new universal intrinsics for floorHowJMay2021-12-121-0/+4
|/
* ENH, SIMD: add new universal intrinsics for truncHowJMay2021-11-161-0/+4
* ENH: Implement SIMD for ceilHowJMay2021-11-011-0/+4
* ENH: Add conversion for 512bit intrin to 256bit intrinHowJMay2021-11-011-1/+21
* fix upSayed Adel2021-10-282-7/+7
* BUG, SIMD: Workaround broadcasting SIMD 64-bit integers on MSVC 32-bit modeSayed Adel2021-10-281-2/+30
* BUG, SIMD: Fix unexpected result of uint8 division on X86Sayed Adel2021-05-191-9/+10
* DOC: Add comment for ifdef macro guardHowJmay2021-05-161-1/+1
* ENH: Add max/min value comparison for integersHowJmay2021-05-101-0/+32
* ENH: Add min values comparison for floating pointHowJmay2021-05-021-0/+17
* Merge pull request #18863 from HowJMay/simd-maxSayed Adel2021-04-301-0/+17
|\
| * ENH: Add max values comparison for floating pointHowJmay2021-05-011-0/+17
* | fix: Fix the typo "implment"HowJmay2021-04-281-3/+3
|/
* SIMD: add NPYV fast integer division intrinsics for AVX512Sayed Adel2021-03-081-2/+222
* use more accurate words.Qiyu82021-01-271-1/+1
* improve sumup intriniscs.Qiyu82021-01-251-18/+18
* treat u8/u16 as overflow protection.Qiyu82021-01-221-17/+24
* Optimize the avx512 sum_u8/sum_u16.Qiyu82021-01-211-24/+28
* Add new intrinsics sum_u8/u16/u64.Qiyu82021-01-201-0/+25
* BUG, MAINT: improve avx512 mask logical operationsSayed Adel2021-01-052-47/+55
* Merge branch 'master' of github.com:numpy/numpy into countnzQiyu82020-12-234-0/+127
|\
| * ENH, SIMD: Add new NPYV intrinsics pack(1)Sayed Adel2020-12-224-0/+127
* | use max value to detect deviationQiyu82020-12-141-1/+1
* | remove extra inputQiyu82020-12-101-2/+2
* | use current intrinsics instead of create new one.Qiyu82020-12-102-17/+9
* | Add non-AVX512DQ intrinsicsQiyu82020-12-101-0/+8