CLBlast/src/kernels
Cedric Nugteren 3d0c227fa5
AMAX/AMIN integer testing and bug fixes (#457)
* Fixed a bug in XAMAX/XMIN routines that caused the increment and offset to be included in the result

* Perform proper integer-output testing in XAMAX tests

* A few changes towards getting it ready for a PR

* Also fix compilation for clBLAS and cuBLAS references

* Fix a bug that would only use the real part of complex numbers in the amax/amin routines

* A few small fixes related to the AMAX tests
2023-05-07 20:02:52 +02:00
..
level1 AMAX/AMIN integer testing and bug fixes (#457) 2023-05-07 20:02:52 +02:00
level2 Updated according to feedback from CNugteren 2023-01-17 08:35:29 -08:00
level3 Updated according to feedback from CNugteren 2023-01-17 08:35:29 -08:00
levelx Updated according to feedback from CNugteren 2023-01-17 08:35:29 -08:00
common.opencl implemented changes to boost Adreno performance according to https://jira-dc.qualcomm.com/jira/browse/OSR-8731 2023-01-03 10:56:04 -08:00
opencl_to_cuda.h CUDA kernel compilation fixes 2017-10-17 19:53:09 +02:00