CLBlast/src
2017-04-23 17:58:56 +02:00
..
database Re-added Titan X (Pascal) tuning results based on more averaging when tuning 2017-04-23 17:58:56 +02:00
kernels Added a new Xaxpy kernel in between the regular and fast version in 2017-04-14 20:16:10 +02:00
routines Added a new Xaxpy kernel in between the regular and fast version in 2017-04-14 20:16:10 +02:00
tuning Increased the default number of runs for the tuner from 2 up to 10 for fast kernels 2017-04-22 13:56:07 +02:00
utilities Fixed a namespace clash with CUDA FP16 for the half-datatype 2017-04-17 16:47:15 +02:00
cache.cpp Made RemoveBySubset from the cache work with references to keys 2017-02-12 11:58:20 +01:00
cache.hpp Split the database into several smaller cached per-kernel databases (in preparation of per-kernel database overrides) 2017-02-12 12:02:39 +01:00
clblast.cpp Removed const-vector-of-const-objects from the database class to remain according to the C++11 standard 2017-04-10 07:40:27 +02:00
clblast_c.cpp Fixed a namespace clash with CUDA FP16 for the half-datatype 2017-04-17 16:47:15 +02:00
clblast_netlib_c.cpp Fixed a vector-size related bug in the CLBlast Netlib API 2016-11-23 22:00:20 +01:00
clpp11.hpp Added a special override database for the Apple CPU implementation on OS X: this makes the test work, it does not focus on good performance 2017-04-07 07:37:30 +02:00
cxpp11_common.hpp Added a fix for compilation under Visual Studio 2013 related to the new exception classes 2016-10-23 15:55:03 +02:00
routine.cpp Removed const-vector-of-const-objects from the database class to remain according to the C++11 standard 2017-04-10 07:40:27 +02:00
routine.hpp Removed const-vector-of-const-objects from the database class to remain according to the C++11 standard 2017-04-10 07:40:27 +02:00