.. |
database
|
Added tuning results for ARM Mali T760 GPU
|
2017-11-24 21:16:54 +01:00 |
kernels
|
Implemented first simple pre-processor: defines parser and loop unrolling based on assumptions
|
2017-11-25 17:46:01 +01:00 |
routines
|
Moved compilation function to separate file; removed dependency of tuners of the CLBlast library
|
2017-11-17 20:57:46 +01:00 |
tuning
|
Fixes some displaying issues in the GEMM routine tuner
|
2017-11-20 20:29:52 +01:00 |
utilities
|
Moved string splitting functions; added string character removal function
|
2017-11-25 17:44:21 +01:00 |
api_common.cpp
|
Added first (untested) version of a CUDA API
|
2017-10-11 23:16:57 +02:00 |
cache.cpp
|
Made RemoveBySubset from the cache work with references to keys
|
2017-02-12 11:58:20 +01:00 |
cache.hpp
|
Added platform ID to the binary program cache to prevent issues with multi-platform systems
|
2017-10-29 20:01:30 +01:00 |
clblast.cpp
|
Moved non-routine-specific API functions and includes to separate files
|
2017-10-08 21:52:02 +02:00 |
clblast_c.cpp
|
Added interface and stubs for the im2col routine
|
2017-07-02 12:10:22 +02:00 |
clblast_cuda.cpp
|
Various fixes to make the host code and sample compile with the CUDA API
|
2017-10-14 11:43:57 +02:00 |
clblast_netlib_c.cpp
|
Added interface and stubs for the im2col routine
|
2017-07-02 12:10:22 +02:00 |
clpp11.hpp
|
Potentially fixed an MSVC 2013 issue with a copy-constructor not being generated
|
2017-11-20 20:54:18 +01:00 |
cupp11.hpp
|
Potentially fixed an MSVC 2013 issue with a copy-constructor not being generated
|
2017-11-20 20:54:18 +01:00 |
cxpp11_common.hpp
|
Various fixes to make the host code and sample compile with the CUDA API
|
2017-10-14 11:43:57 +02:00 |
kernel_preprocessor.cpp
|
Improved the kernel pre-processor in various ways
|
2017-11-28 20:52:08 +01:00 |
kernel_preprocessor.hpp
|
Implemented first simple pre-processor: defines parser and loop unrolling based on assumptions
|
2017-11-25 17:46:01 +01:00 |
routine.cpp
|
Factored out the creation of the OpenCL header and the program compilation
|
2017-11-11 16:14:43 +01:00 |
routine.hpp
|
Moved the remaining OpenCL specific host code to the clpp11.h header where it belongs
|
2017-10-08 10:29:47 +02:00 |