CLBlast/src
2017-11-15 22:44:44 +01:00
..
database Added tuning results for the GeForce GTX750Ti 2017-11-09 21:19:21 +01:00
kernels CUDA kernel compilation fixes 2017-10-17 19:53:09 +02:00
routines Factored out the creation of the OpenCL header and the program compilation 2017-11-11 16:14:43 +01:00
tuning Changed GEMM routine tuner's scoring to use L2 measure instead for better averaging 2017-11-06 20:50:36 +01:00
utilities Added exception handle with catch-all 2017-11-15 22:44:44 +01:00
api_common.cpp Added first (untested) version of a CUDA API 2017-10-11 23:16:57 +02:00
cache.cpp Made RemoveBySubset from the cache work with references to keys 2017-02-12 11:58:20 +01:00
cache.hpp Added platform ID to the binary program cache to prevent issues with multi-platform systems 2017-10-29 20:01:30 +01:00
clblast.cpp Moved non-routine-specific API functions and includes to separate files 2017-10-08 21:52:02 +02:00
clblast_c.cpp Added interface and stubs for the im2col routine 2017-07-02 12:10:22 +02:00
clblast_cuda.cpp Various fixes to make the host code and sample compile with the CUDA API 2017-10-14 11:43:57 +02:00
clblast_netlib_c.cpp Added interface and stubs for the im2col routine 2017-07-02 12:10:22 +02:00
clpp11.hpp Added Android support using the GNU C++ STL library and the GCC toolchain 2017-10-29 12:07:07 +01:00
cupp11.hpp Moved CUmodule code from Kernel to Program class to not require re-compilation every time 2017-10-18 18:17:30 +02:00
cxpp11_common.hpp Various fixes to make the host code and sample compile with the CUDA API 2017-10-14 11:43:57 +02:00
routine.cpp Factored out the creation of the OpenCL header and the program compilation 2017-11-11 16:14:43 +01:00
routine.hpp Moved the remaining OpenCL specific host code to the clpp11.h header where it belongs 2017-10-08 10:29:47 +02:00