.. |
database
|
Updated the tuning results for Intel IvyBridge M GT2
|
2018-07-31 20:49:41 +02:00 |
kernels
|
Changed col2im to append to the existing im-buffer
|
2018-11-07 19:45:07 +01:00 |
pyclblast
|
Updated pyclblast to 1.1.0 and uploaded to PyPi
|
2018-03-30 10:38:36 +02:00 |
routines
|
Fix col2im implementation
|
2018-10-30 14:54:55 +09:00 |
tuning
|
Fixed an MSVC compilation error due to large strings
|
2018-09-15 17:35:26 +02:00 |
utilities
|
Fix col2im implementation
|
2018-10-30 14:54:55 +09:00 |
api_common.cpp
|
Made tuning API more flexible: disregards any extra parameter values
|
2018-10-13 17:47:29 +02:00 |
cache.cpp
|
Now stores a shared_ptr to the Program class in the cache
|
2018-05-01 20:34:48 +02:00 |
cache.hpp
|
Now stores a shared_ptr to the Program class in the cache
|
2018-05-01 20:34:48 +02:00 |
clblast.cpp
|
Added groundwork for col2im algorithm plus first non-working version of kernel and test
|
2018-10-23 20:52:25 +02:00 |
clblast_c.cpp
|
Added groundwork for col2im algorithm plus first non-working version of kernel and test
|
2018-10-23 20:52:25 +02:00 |
clblast_cuda.cpp
|
Added groundwork for col2im algorithm plus first non-working version of kernel and test
|
2018-10-23 20:52:25 +02:00 |
clblast_netlib_c.cpp
|
Added groundwork for col2im algorithm plus first non-working version of kernel and test
|
2018-10-23 20:52:25 +02:00 |
clpp11.hpp
|
Made last operation in TRSV and TRSM asynchronous, making the events not null
|
2018-08-13 22:58:44 +02:00 |
cupp11.hpp
|
Applied feedback from Cedric from first pull request
|
2018-07-14 19:50:47 -04:00 |
cxpp11_common.hpp
|
Various fixes to make the host code and sample compile with the CUDA API
|
2017-10-14 11:43:57 +02:00 |
kernel_preprocessor.cpp
|
Fixed pre-processor warnings related to the subgroup shuffling
|
2018-10-10 19:12:42 +02:00 |
kernel_preprocessor.hpp
|
Implemented first simple pre-processor: defines parser and loop unrolling based on assumptions
|
2017-11-25 17:46:01 +01:00 |
routine.cpp
|
Eliminate a temporary Program object
|
2018-07-06 12:58:20 +01:00 |
routine.hpp
|
Now stores a shared_ptr to the Program class in the cache
|
2018-05-01 20:34:48 +02:00 |