CLBlast/src/routines/level3
Cedric Nugteren c0e41b87cb Fixed an issue for unequal MWG and NWG and the new GEMMK == 1 kernel 2018-11-30 20:23:26 +01:00
..
xgemm.cpp Fixed an issue for unequal MWG and NWG and the new GEMMK == 1 kernel 2018-11-30 20:23:26 +01:00
xgemm.hpp Fixes for Apple OpenCL CPU implementation which requires a LWGS of 1 when barriers are present 2018-06-01 20:59:44 +02:00
xhemm.cpp Routine, Cache: generalize, reduce amount of copying in fast path 2017-01-24 11:56:15 +03:00
xhemm.hpp Routine, Cache: generalize, reduce amount of copying in fast path 2017-01-24 11:56:15 +03:00
xher2k.cpp Expressed HER2K as two HERK calls 2018-04-18 20:58:29 +02:00
xher2k.hpp Expressed HER2K as two HERK calls 2018-04-18 20:58:29 +02:00
xherk.cpp Fixed an MSVC compilation error due to large strings 2018-09-15 17:35:26 +02:00
xherk.hpp Expressed HER2K as two HERK calls 2018-04-18 20:58:29 +02:00
xsymm.cpp Routine, Cache: generalize, reduce amount of copying in fast path 2017-01-24 11:56:15 +03:00
xsymm.hpp Routine, Cache: generalize, reduce amount of copying in fast path 2017-01-24 11:56:15 +03:00
xsyr2k.cpp Expressed SYR2K as two SYRK calls 2018-04-18 20:29:28 +02:00
xsyr2k.hpp Expressed SYR2K as two SYRK calls 2018-04-18 20:29:28 +02:00
xsyrk.cpp Fixed an MSVC compilation error due to large strings 2018-09-15 17:35:26 +02:00
xsyrk.hpp Expressed SYR2K as two SYRK calls 2018-04-18 20:29:28 +02:00
xtrmm.cpp Fixed half-precision bugs in HTBMV/HTPMV/HTRMV/HSYR2K/HTRMM related to incorrect constants 2017-02-27 21:00:04 +01:00
xtrmm.hpp Routine, Cache: generalize, reduce amount of copying in fast path 2017-01-24 11:56:15 +03:00
xtrsm.cpp Made last operation in TRSV and TRSM asynchronous, making the events not null 2018-08-13 22:58:44 +02:00
xtrsm.hpp Made last operation in TRSV and TRSM asynchronous, making the events not null 2018-08-13 22:58:44 +02:00