CLBlast/test
2018-05-21 11:47:16 +02:00
..
correctness Further implemented single-kernel approach of convgemm; extended test to capture other parts of the kernel code 2018-05-21 11:47:16 +02:00
performance Fixed the performance client for convgemm and added GFLOPS measurements 2018-05-09 19:59:31 +02:00
routines Merge branch 'master' into CLBlast-267-convgemm 2018-05-19 17:54:27 +02:00
diagnostics.cpp Moved timing function to a separate file 2017-10-28 14:12:05 +02:00
test_utilities.cpp Fixed a bug in loading xgemm-direct JSON data from disk 2018-05-19 12:48:04 +02:00
test_utilities.hpp Fixes for the CUDA backend of CLBlast 2017-12-24 12:10:55 +01:00
wrapper_cblas.hpp Fixed some Clang and MSVC warnings 2017-06-25 11:50:36 +02:00
wrapper_clblas.hpp Removed half-precision support from the TRSM routine; too unstable 2017-02-26 12:56:21 +01:00
wrapper_cublas.hpp Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now works 2017-04-13 21:31:27 +02:00
wrapper_cuda.hpp Fix an incompatibility with CUDA's FP16 definition 2017-10-17 20:29:23 +02:00