Cedric Nugteren
3d0c227fa5
AMAX/AMIN integer testing and bug fixes ( #457 )
...
* Fixed a bug in XAMAX/XMIN routines that caused the increment and offset to be included in the result
* Perform proper integer-output testing in XAMAX tests
* A few changes towards getting it ready for a PR
* Also fix compilation for clBLAS and cuBLAS references
* Fix a bug that would only use the real part of complex numbers in the amax/amin routines
* A few small fixes related to the AMAX tests
2023-05-07 20:02:52 +02:00
Koichi Akabe
d9db543d75
Fix half-float+kernel_mode test cases of im2col, col2im, and convgemm
2018-12-17 21:57:35 +09:00
Koichi Akabe
032e3b0cc0
Add kernel_mode option to im2col, col2im, and convgemm functions
2018-11-12 10:12:07 +09:00
Cedric Nugteren
d45911b61d
Added groundwork for col2im algorithm plus first non-working version of kernel and test
2018-10-23 20:52:25 +02:00
Cedric Nugteren
ab0178c56b
Fixed MSVC's compilation error C1061 due to too many for-loops
2018-10-17 21:35:09 +02:00
Cedric Nugteren
83ba3d4b7b
Merge branch 'master' into convgemm_multi_kernel
2018-09-16 20:01:18 +02:00
Cedric Nugteren
4917b77e13
Added pre-processor test for GEMMK=1 kernel
2018-09-15 16:49:51 +02:00
Cedric Nugteren
b7d8339012
Reduced size of the xCONVGEMM correctness tests
2018-09-07 22:04:24 +02:00
Cedric Nugteren
713d0f96b3
Fixed an error reporting issue related to the canary region
2018-07-31 21:24:21 +02:00
Cedric Nugteren
2dd539f911
Removed complex numbers support for CONVGEMM
2018-07-29 10:37:14 +02:00
Cedric Nugteren
838422fbb1
Further implemented single-kernel approach of convgemm; extended test to capture other parts of the kernel code
2018-05-21 11:47:16 +02:00
Cedric Nugteren
cbcd4ff7e8
Merge branch 'master' into CLBlast-267-convgemm
2018-05-19 17:54:27 +02:00
Cedric Nugteren
8290ad78b9
Fixed a few issues with canary region testing
2018-05-17 12:16:32 +02:00
Cedric Nugteren
85341836dd
Added a canary region for overflow detection to the correctness tests
2018-05-17 10:45:50 +01:00
Cedric Nugteren
52e6195628
Split channels/strides testing values off from kernel sizes for more flexibility
2018-05-09 17:23:55 +02:00
Cedric Nugteren
2d1f6ba7fe
Added convgemm skeleton, test infrastructure, and first reference implementation
2018-05-06 11:35:34 +02:00
Cedric Nugteren
93610a9cba
Fixed some failing tests for GEMM and batched GEMM routines
2018-04-15 12:53:32 +02:00
Cedric Nugteren
f4d96e80c3
Fixed breaking preprocessor test on certain platforms due to empty kernel string
2018-03-15 20:45:41 +01:00
Cedric Nugteren
ef5008f5e4
Created the API and stubs for the HAD (hadamard-product) routines
2018-01-31 20:41:02 +01:00
Cedric Nugteren
90e8e55acb
Added test for the RetrieveParameters function
2018-01-11 20:34:09 +01:00
Cedric Nugteren
389919faec
Fixed bug in override parameters test
2018-01-11 20:30:45 +01:00
Cedric Nugteren
9fb2c61b25
Added API and tests for new GemmStridedBatched routine
2018-01-07 14:27:15 +01:00
Cedric Nugteren
eb89371d2b
Added a queue argument to the get-size function when running the tests/clients
2018-01-03 20:19:45 +01:00
Cedric Nugteren
b4d3a50f19
Split GEMM kernel in 4 files instead of 3 due to MSVC 2013 string length limit
2017-12-10 16:09:09 +01:00
Cedric Nugteren
9f02fb542c
Completed kernel modifications for pre-processor of all other kernels
2017-12-09 20:44:21 +01:00
Cedric Nugteren
ca5dbcd2bd
Made the pre-processor run by default for ARM and Qualcomm GPUs
2017-12-09 15:16:53 +01:00
Cedric Nugteren
d9df62b794
Fixed defines parsing and substituting in pre-processor; fixed some variable names in kernels
2017-12-09 10:49:55 +01:00
Cedric Nugteren
0f9637bbac
Improved array-to-register promotion, now handling function calls as well
2017-12-05 20:39:49 +01:00
Cedric Nugteren
cf4555d1f4
Added GEMM (direct and in-direct) to the pre-processor testing; modified the loops in kernel accordingly
2017-12-03 16:40:36 +01:00
Cedric Nugteren
60312e5878
Reformated transpose kernels for the pre-processor; extended the amount of tests
2017-12-03 12:00:37 +01:00
Cedric Nugteren
bf7aeb8d5b
Improved the pre-processor's handling of defines; added a special nested defines test
2017-11-30 21:43:16 +01:00
Cedric Nugteren
13eb772343
Integrated pre-processor in compilation flow, default is still disabled
2017-11-30 21:32:47 +01:00
Cedric Nugteren
0dde6af703
Extended the preprocessor tests to include CopyFast and CopyPad
2017-11-29 20:18:36 +01:00
Cedric Nugteren
426406668e
Improved the pre-processor tester, added GEMV and GER kernels
2017-11-28 20:52:47 +01:00
Cedric Nugteren
c0c6d00b12
Added stub for a preprocessor and a corresponding compilation test
2017-11-25 10:24:05 +01:00
Cedric Nugteren
4bac1287f2
Moved square-difference utility function for use in the tuners
2017-11-13 21:10:44 +01:00
Cedric Nugteren
12b08ae491
Merge branch 'master' into android_support
2017-10-28 17:32:37 +02:00
Cedric Nugteren
a3069a97c3
Prepared test and client infrastructure for use with the CUDA API
2017-10-15 13:56:19 +02:00
Cedric Nugteren
9224da19ef
Fixed the Python generator script w.r.t. the recent change of testing direct/in-direct GEMM kernels separately
2017-10-09 20:06:25 +02:00
Cedric Nugteren
74fd6767b9
GEMM tests now test both the in-direct and the direct kernels seperately
2017-10-01 20:36:56 +02:00
Cedric Nugteren
21af690472
Added missing headers
2017-09-26 21:17:55 +02:00
Cedric Nugteren
ed980a1df1
Updated database override function to work with the new database storage format
2017-09-24 15:44:14 +02:00
Cedric Nugteren
890281f3e8
Made database-caching no longer dependent on device name but on device/platform IDs
2017-09-23 17:50:44 +02:00
Cedric Nugteren
132e62892d
Implemented proper im2col reference function and completd tests
2017-08-19 16:55:09 +02:00
Cedric Nugteren
777681dcbd
Merge branch 'master' into im_to_col
2017-08-12 20:50:00 +02:00
Cedric Nugteren
844e68853e
Moved some utility functions to a test-specific utility compilation-unit
2017-08-12 15:38:17 +02:00
Cedric Nugteren
97bcf77d4b
First step towards supporting im2col in the test infrastructure
2017-07-16 22:33:49 +02:00
Cedric Nugteren
de9ed9d4ea
Fixed batched tests when testing for invalid sizes against clBLAS
2017-07-12 21:54:16 +02:00
Cedric Nugteren
d4c8a7c8b0
Changed printf-statements with %zu into std::cout to fix MSVC 2013 compatibility
2017-07-09 20:19:08 +02:00
Cedric Nugteren
4b415bdf3c
Disabled UNIX-style terminal color printing under Windows
2017-07-09 20:04:13 +02:00