CLBlast/src
2017-04-16 19:41:14 +02:00
..
database Fixed a compilation issue under MSVC and GCC 2017-04-10 08:38:24 +02:00
kernels Added a new Xaxpy kernel in between the regular and fast version in 2017-04-14 20:16:10 +02:00
routines Added a new Xaxpy kernel in between the regular and fast version in 2017-04-14 20:16:10 +02:00
tuning Added a new Xaxpy kernel in between the regular and fast version in 2017-04-14 20:16:10 +02:00
utilities Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now works 2017-04-13 21:31:27 +02:00
cache.cpp Made RemoveBySubset from the cache work with references to keys 2017-02-12 11:58:20 +01:00
cache.hpp Split the database into several smaller cached per-kernel databases (in preparation of per-kernel database overrides) 2017-02-12 12:02:39 +01:00
clblast.cpp Removed const-vector-of-const-objects from the database class to remain according to the C++11 standard 2017-04-10 07:40:27 +02:00
clblast_c.cpp Added API and test infrastructure for the batched GEMM routine 2017-03-10 21:24:35 +01:00
clblast_netlib_c.cpp Fixed a vector-size related bug in the CLBlast Netlib API 2016-11-23 22:00:20 +01:00
clpp11.hpp Added a special override database for the Apple CPU implementation on OS X: this makes the test work, it does not focus on good performance 2017-04-07 07:37:30 +02:00
cxpp11_common.hpp Added a fix for compilation under Visual Studio 2013 related to the new exception classes 2016-10-23 15:55:03 +02:00
routine.cpp Removed const-vector-of-const-objects from the database class to remain according to the C++11 standard 2017-04-10 07:40:27 +02:00
routine.hpp Removed const-vector-of-const-objects from the database class to remain according to the C++11 standard 2017-04-10 07:40:27 +02:00