CLBlast

History

Tarmo Räntilä 21b66ca761 Reduce TestMatrix calls for xgemmstridedbatched. Replace the looped test by a single one with the offset of the last batch.		2019-12-09 22:17:24 +02:00
..
xaxpybatched.cpp	Made buffers of batched routines read/write (was: read-only)	2017-10-17 19:56:47 +02:00
xaxpybatched.hpp	Make batched routines based on offsets instead of a vector of cl_mem objects - undoing many earlier changes	2017-03-08 20:10:20 +01:00
xcol2im.cpp	Add kernel_mode option to im2col, col2im, and convgemm functions	2018-11-12 10:12:07 +09:00
xcol2im.hpp	Add kernel_mode option to im2col, col2im, and convgemm functions	2018-11-12 10:12:07 +09:00
xconvgemm.cpp	Added convgemm to the CLBlast database, added initial parameters for Skylake GPU	2018-12-31 19:05:34 +01:00
xconvgemm.hpp	Fix xconvgemm kernel and enable ConvGemmMethod::kSingleKernel	2018-12-18 13:56:00 +09:00
xgemmbatched.cpp	Reduce TestMatrix calls for xgemmbatched.	2019-12-09 22:13:52 +02:00
xgemmbatched.hpp	Reduced duplicate code in the batched GEMM implementation	2018-01-06 19:26:11 +01:00
xgemmstridedbatched.cpp	Reduce TestMatrix calls for xgemmstridedbatched.	2019-12-09 22:17:24 +02:00
xgemmstridedbatched.hpp	Added API and tests for new GemmStridedBatched routine	2018-01-07 14:27:15 +01:00
xhad.cpp	Implemented the XHAD Hadamard product routine	2018-02-02 21:18:37 +01:00
xhad.hpp	Created the API and stubs for the HAD (hadamard-product) routines	2018-01-31 20:41:02 +01:00
xim2col.cpp	Add kernel_mode option to im2col, col2im, and convgemm functions	2018-11-12 10:12:07 +09:00
xim2col.hpp	Add kernel_mode option to im2col, col2im, and convgemm functions	2018-11-12 10:12:07 +09:00
xinvert.cpp	Fixes for Apple OpenCL CPU implementation which requires a LWGS of 1 when barriers are present	2018-06-01 20:59:44 +02:00
xinvert.hpp	Added a first version of the diagonal block invert routine in preparation of TRSM	2017-01-15 17:30:00 +01:00
xomatcopy.cpp	Routine, Cache: generalize, reduce amount of copying in fast path	2017-01-24 11:56:15 +03:00
xomatcopy.hpp	treewide: use C++ exceptions properly	2016-10-22 08:45:25 +03:00