CLBlast/src
Cedric Nugteren 6f67525ea6 Changed col2im to append to the existing im-buffer 2018-11-07 19:45:07 +01:00
..
database Updated the tuning results for Intel IvyBridge M GT2 2018-07-31 20:49:41 +02:00
kernels Changed col2im to append to the existing im-buffer 2018-11-07 19:45:07 +01:00
pyclblast Updated pyclblast to 1.1.0 and uploaded to PyPi 2018-03-30 10:38:36 +02:00
routines Fix col2im implementation 2018-10-30 14:54:55 +09:00
tuning Fixed an MSVC compilation error due to large strings 2018-09-15 17:35:26 +02:00
utilities Fix col2im implementation 2018-10-30 14:54:55 +09:00
api_common.cpp Made tuning API more flexible: disregards any extra parameter values 2018-10-13 17:47:29 +02:00
cache.cpp Now stores a shared_ptr to the Program class in the cache 2018-05-01 20:34:48 +02:00
cache.hpp Now stores a shared_ptr to the Program class in the cache 2018-05-01 20:34:48 +02:00
clblast.cpp Added groundwork for col2im algorithm plus first non-working version of kernel and test 2018-10-23 20:52:25 +02:00
clblast_c.cpp Added groundwork for col2im algorithm plus first non-working version of kernel and test 2018-10-23 20:52:25 +02:00
clblast_cuda.cpp Added groundwork for col2im algorithm plus first non-working version of kernel and test 2018-10-23 20:52:25 +02:00
clblast_netlib_c.cpp Added groundwork for col2im algorithm plus first non-working version of kernel and test 2018-10-23 20:52:25 +02:00
clpp11.hpp Made last operation in TRSV and TRSM asynchronous, making the events not null 2018-08-13 22:58:44 +02:00
cupp11.hpp Applied feedback from Cedric from first pull request 2018-07-14 19:50:47 -04:00
cxpp11_common.hpp Various fixes to make the host code and sample compile with the CUDA API 2017-10-14 11:43:57 +02:00
kernel_preprocessor.cpp Fixed pre-processor warnings related to the subgroup shuffling 2018-10-10 19:12:42 +02:00
kernel_preprocessor.hpp Implemented first simple pre-processor: defines parser and loop unrolling based on assumptions 2017-11-25 17:46:01 +01:00
routine.cpp Eliminate a temporary Program object 2018-07-06 12:58:20 +01:00
routine.hpp Now stores a shared_ptr to the Program class in the cache 2018-05-01 20:34:48 +02:00