CLBlast/include/internal
2016-05-15 17:28:22 +02:00
..
database Added new tuning results for SGEMM and updated the performance graph for the Radeon M370X AMD GPU 2016-05-15 17:28:22 +02:00
routines Changed the index buffer of IxAMAX routines to unsigned int for proper buffersize checking 2016-05-01 14:03:37 +02:00
cache.h Added a program cache (per-context) next to the per-device binary cache 2016-05-01 12:56:08 +02:00
clpp11.h Added a program cache (per-context) next to the per-device binary cache 2016-05-01 12:56:08 +02:00
database.h Fixed a compilation issue under AppleClang 2016-02-28 14:14:50 +01:00
public_api.h Added missing newline to the end of the public API file 2016-03-30 16:13:22 -07:00
routine.h Changed the index buffer of IxAMAX routines to unsigned int for proper buffersize checking 2016-05-01 14:03:37 +02:00
tuning.h Added support for staggered/shuffled offsets for GEMM to improve performance for large power-of-2 kernels on AMD GPUs 2016-05-15 14:04:34 +02:00
utilities.h Added a '-verbose' option to the test binaries to report errors in more detail if needed 2016-04-27 14:38:30 +02:00