CNugteren
|
41ce480c51
|
Updated changelog with host-code performance optimisation
|
2015-06-19 07:34:00 +02:00 |
|
CNugteren
|
ab55df703d
|
Added const-ref accessors to all CL++11 classes
|
2015-06-19 07:28:35 +02:00 |
|
CNugteren
|
682c01a80c
|
Now returns program from database by reference
|
2015-06-18 18:44:14 +02:00 |
|
Cedric Nugteren
|
af40b8e667
|
Merge pull request #3 from CNugteren/conjugate_transpose
Added complex conjugate transpose
|
2015-06-17 07:15:30 +02:00 |
|
CNugteren
|
8b2dbdba98
|
Updated with conjugate transpose and CGEMM/ZGEMM CSYMM/ZSYMM
|
2015-06-17 07:12:45 +02:00 |
|
CNugteren
|
7e176ccac9
|
Added support for conjugate transpose in GEMV
|
2015-06-16 08:42:52 +02:00 |
|
CNugteren
|
d7a0d970e0
|
Moved layout and transpose types to generic tester and AXY routines
|
2015-06-16 08:07:26 +02:00 |
|
CNugteren
|
af78a04eca
|
Updated the tuners to set the conjugate argument
|
2015-06-16 07:50:45 +02:00 |
|
CNugteren
|
e03582a112
|
Added support for CGEMM/ZGEMM and CSYMM/ZSYMM
|
2015-06-16 07:45:09 +02:00 |
|
CNugteren
|
6ebca73d72
|
Created data-type dependent transpose values to test
|
2015-06-16 07:44:32 +02:00 |
|
CNugteren
|
8f01c644b5
|
Added support for complex conjugate transpose
|
2015-06-16 07:43:19 +02:00 |
|
Cedric Nugteren
|
9e2fba9ab9
|
Merge pull request #2 from CNugteren/xgemv
Added GEMV routine
|
2015-06-15 18:35:27 +02:00 |
|
CNugteren
|
f925d47dad
|
Added GEMV to changelog and readme
|
2015-06-15 08:41:37 +02:00 |
|
CNugteren
|
ce703a2f5a
|
Added tuning for DGEMV on Iris and SGEMV on K40m
|
2015-06-15 08:41:13 +02:00 |
|
CNugteren
|
e63d20e0b6
|
Enable tests of DGEMV/CGEMV/ZGEMV
|
2015-06-15 08:40:41 +02:00 |
|
CNugteren
|
01726197ab
|
Fixed a bug in AXPBY defines for complex data-types
|
2015-06-15 08:38:24 +02:00 |
|
CNugteren
|
7776323cb4
|
Added GEMV performance-graph script
|
2015-06-14 12:05:30 +02:00 |
|
CNugteren
|
294a3e3d41
|
Split the three variations of the GEMV kernel for maximal tuning freedom
|
2015-06-14 11:15:53 +02:00 |
|
CNugteren
|
ab0064dab7
|
Fixed number of threads launched for GEMV
|
2015-06-14 10:08:56 +02:00 |
|
CNugteren
|
9aa2989447
|
Fixed number of threads launched for AXPY
|
2015-06-14 10:08:23 +02:00 |
|
CNugteren
|
4b3e3dcfe0
|
Added a fast GEMV kernel with vector loads, no tail, and fewer if-statements
|
2015-06-13 20:46:01 +02:00 |
|
CNugteren
|
584f80c666
|
Enlarge test matrices and vectors for AXY routines (GEMV)
|
2015-06-13 17:42:41 +02:00 |
|
CNugteren
|
6662f5d8e9
|
Refactored the GEMV kernel
|
2015-06-13 17:07:31 +02:00 |
|
CNugteren
|
9b66883e9c
|
Improved GEMV kernel with local memory and a tunable WPT
|
2015-06-13 14:10:07 +02:00 |
|
CNugteren
|
e522d1a74e
|
Added initial version of GEMV including tester and performance client
|
2015-06-13 11:01:20 +02:00 |
|
CNugteren
|
7610a8b182
|
Updated calculation of required matrix size
|
2015-06-12 22:04:12 +02:00 |
|
CNugteren
|
abec7cd8c8
|
Fixed a bug in testing where the offset would be set incorrectly
|
2015-06-12 22:03:48 +02:00 |
|
CNugteren
|
bdc3444d5c
|
Added new tester for matrix-vector-vector routines
|
2015-06-11 07:39:23 +02:00 |
|
CNugteren
|
16c3bb9140
|
Proper testing of invalid buffer sizes (squared) for TestABC
|
2015-06-11 07:38:31 +02:00 |
|
CNugteren
|
5858193a50
|
Removed obsolete testing structure for TestABC
|
2015-06-11 07:37:52 +02:00 |
|
CNugteren
|
85c1db9322
|
Added initial naive version of Xgemv kernel
|
2015-06-10 08:44:30 +02:00 |
|
CNugteren
|
3c17c1c133
|
Added clBLAS for reference
|
2015-05-30 12:40:23 +02:00 |
|
CNugteren
|
7078e72100
|
Added clBLAS for reference
|
2015-05-30 12:39:32 +02:00 |
|
CNugteren
|
a447b942f4
|
Added clBLAS for reference
|
2015-05-30 12:35:53 +02:00 |
|
CNugteren
|
bc5a341dfe
|
Initial commit of preview version
|
2015-05-30 12:30:43 +02:00 |
|
CNugteren
|
c7b054ea67
|
Initial commit of preview version
|
2015-05-30 11:32:02 +02:00 |
|
Cedric Nugteren
|
5c9f306ab5
|
Initial commit
|
2015-05-30 10:50:51 +02:00 |
|