CLBlast/test
2018-07-29 10:37:14 +02:00
..
correctness Removed complex numbers support for CONVGEMM 2018-07-29 10:37:14 +02:00
performance Removed complex numbers support for CONVGEMM 2018-07-29 10:37:14 +02:00
routines Merge branch 'master' into CLBlast-267-convgemm 2018-05-19 17:54:27 +02:00
diagnostics.cpp Moved timing function to a separate file 2017-10-28 14:12:05 +02:00
test_utilities.cpp Fixed a bug in loading xgemm-direct JSON data from disk 2018-05-19 12:48:04 +02:00
test_utilities.hpp Fixes for the CUDA backend of CLBlast 2017-12-24 12:10:55 +01:00
wrapper_cblas.hpp Added MKL as an alternative for CBLAS for correctness and performance comparisons 2018-06-02 17:57:45 +02:00
wrapper_clblas.hpp Removed half-precision support from the TRSM routine; too unstable 2017-02-26 12:56:21 +01:00
wrapper_cublas.hpp Fixed CUDA malloc and cuBLAS handles: cuBLAS as a performance-reference now works 2017-04-13 21:31:27 +02:00
wrapper_cuda.hpp Fix an incompatibility with CUDA's FP16 definition 2017-10-17 20:29:23 +02:00