CLBlast/test
2018-05-09 17:23:55 +02:00
..
correctness Split channels/strides testing values off from kernel sizes for more flexibility 2018-05-09 17:23:55 +02:00
performance Added convgemm skeleton, test infrastructure, and first reference implementation 2018-05-06 11:35:34 +02:00
routines Added convgemm skeleton, test infrastructure, and first reference implementation 2018-05-06 11:35:34 +02:00
diagnostics.cpp Moved timing function to a separate file 2017-10-28 14:12:05 +02:00
test_utilities.cpp Fixes for the CUDA backend of CLBlast 2017-12-24 12:10:55 +01:00
test_utilities.hpp Fixes for the CUDA backend of CLBlast 2017-12-24 12:10:55 +01:00
wrapper_cblas.hpp Fixed some Clang and MSVC warnings 2017-06-25 11:50:36 +02:00
wrapper_clblas.hpp Removed half-precision support from the TRSM routine; too unstable 2017-02-26 12:56:21 +01:00
wrapper_cublas.hpp Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now works 2017-04-13 21:31:27 +02:00
wrapper_cuda.hpp Fix an incompatibility with CUDA's FP16 definition 2017-10-17 20:29:23 +02:00