.. |
database
|
Added XgemvFastRot and Xgemm 16-bit tuning results: just defaults which are now automatically taken from 32-bit if there are no entries at all
|
2016-09-12 20:13:38 +02:00 |
kernels
|
Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC can't handle long strings
|
2016-09-12 22:13:16 +02:00 |
routines
|
Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC can't handle long strings
|
2016-09-12 22:13:16 +02:00 |
tuning
|
Split the XGEMM kernel further up: now in 3 parts. This is done because MSVC can't handle long strings
|
2016-09-12 22:13:16 +02:00 |
buffer_test.hpp
|
Moved all headers into the source tree, changed headers to .hpp extension
|
2016-06-18 20:20:13 +02:00 |
cache.cpp
|
Fixed a bug related to the cache and retrieval of programs based on the OpenCL context
|
2016-07-10 11:24:36 +02:00 |
cache.hpp
|
Fixed a bug related to the cache and retrieval of programs based on the OpenCL context
|
2016-07-10 11:24:36 +02:00 |
clblast.cpp
|
Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dllimport) when not building the library
|
2016-06-30 10:49:17 +02:00 |
clblast_c.cpp
|
Renamed all C++ source files to .cpp to match the .hpp extension better
|
2016-06-19 13:55:49 +02:00 |
clpp11.hpp
|
clblast::RunKernel, cl::Kernel: unify variants with/without waitForEvents, support empty LWS
|
2016-07-22 11:15:52 +03:00 |
routine.cpp
|
clblast::Database, clblast::Routine: implement "database overlays" provided by routine implementation
|
2016-07-22 11:15:52 +03:00 |
routine.hpp
|
Minor improvements after merging in groundwork for custom tuning parameters and kernels
|
2016-07-24 17:00:21 +02:00 |
utilities.cpp
|
test/correctness: read platform and device from environment
|
2016-08-27 05:37:26 +03:00 |
utilities.hpp
|
test/correctness: read platform and device from environment
|
2016-08-27 05:37:26 +03:00 |