CLBlast/src
2020-03-08 10:14:59 +01:00
..
database Silenced a new OpenCL warning message 2020-03-08 10:14:59 +01:00
kernels Fix out-of-bounds read/write in XhadFaster 2019-09-04 12:55:25 +01:00
pyclblast Added fp32 to fp16 conversion function in Python to make haxpy example work 2019-01-23 19:52:01 +01:00
routines Reduce TestMatrix calls for xgemmstridedbatched. 2019-12-09 22:17:24 +02:00
tuning Catches all exceptions of the tuners 2020-02-17 22:07:51 +01:00
utilities Reduce TestMatrix calls for xgemmstridedbatched. 2019-12-09 22:17:24 +02:00
api_common.cpp Made tuning API more flexible: disregards any extra parameter values 2018-10-13 17:47:29 +02:00
cache.cpp Now stores a shared_ptr to the Program class in the cache 2018-05-01 20:34:48 +02:00
cache.hpp Now stores a shared_ptr to the Program class in the cache 2018-05-01 20:34:48 +02:00
clblast.cpp Add kernel_mode option to im2col, col2im, and convgemm functions 2018-11-12 10:12:07 +09:00
clblast_c.cpp Add kernel_mode option to im2col, col2im, and convgemm functions 2018-11-12 10:12:07 +09:00
clblast_cuda.cpp Add kernel_mode option to im2col, col2im, and convgemm functions 2018-11-12 10:12:07 +09:00
clblast_netlib_c.cpp Add kernel_mode option to im2col, col2im, and convgemm functions 2018-11-12 10:12:07 +09:00
clpp11.hpp Silenced a new OpenCL warning message 2020-03-08 10:14:59 +01:00
cupp11.hpp Applied feedback from Cedric from first pull request 2018-07-14 19:50:47 -04:00
cxpp11_common.hpp Various fixes to make the host code and sample compile with the CUDA API 2017-10-14 11:43:57 +02:00
kernel_preprocessor.cpp Fixed pre-processor warnings related to the subgroup shuffling 2018-10-10 19:12:42 +02:00
kernel_preprocessor.hpp Implemented first simple pre-processor: defines parser and loop unrolling based on assumptions 2017-11-25 17:46:01 +01:00
routine.cpp Added a function to set the OpenCL kernel standard, either 1.1 or 1.2 2019-05-11 20:39:00 +02:00
routine.hpp Now stores a shared_ptr to the Program class in the cache 2018-05-01 20:34:48 +02:00