cnugteren
|
e0497807e2
|
Added prototype for xASUM routines
|
2016-04-13 21:44:49 -06:00 |
|
cnugteren
|
1a82861a90
|
Added support for testing (performance and correctness) against a CPU BLAS library
|
2016-04-02 11:58:00 -07:00 |
|
cnugteren
|
8c3c6db7d0
|
Merge branch 'level1_routines' into development
|
2016-03-30 21:37:56 -07:00 |
|
Cedric Nugteren
|
c1df786764
|
Added prototypes for the xROTM and xROTMG routines
|
2016-03-30 16:13:37 -07:00 |
|
Cedric Nugteren
|
6ecc0d089c
|
Added prototypes for the xROT and xROTG functions
|
2016-03-30 16:13:32 -07:00 |
|
Cedric Nugteren
|
aaa687ca98
|
Added preliminary support for the xNRM2 routines
|
2016-03-28 23:00:44 +02:00 |
|
Cedric Nugteren
|
1d5a702d9d
|
Added prototypes for ScNRM2/DzNRM2 routines
|
2016-03-25 10:30:38 +01:00 |
|
Cedric Nugteren
|
3876096c30
|
Added prototypes for SNRM2/DNRM2 routines
|
2016-03-25 10:00:40 +01:00 |
|
Cedric Nugteren
|
fb58129afb
|
Made testing against clBLAS in the client binaries truely optional (was partly implemented before)
|
2016-03-06 16:34:26 +01:00 |
|
Cedric Nugteren
|
9f682aa66b
|
Set a proper default precision for the CLBlast clients
|
2016-02-20 14:41:53 +01:00 |
|
CNugteren
|
fadd76207f
|
Fixed warnings under MSVC
|
2016-02-08 20:44:05 +01:00 |
|
CNugteren
|
fbf071ba62
|
Fixed a linker error in the performance client under GCC
|
2016-02-06 10:53:44 +01:00 |
|
Cedric Nugteren
|
44fb40e5c4
|
Prepared for MSVC support
|
2016-01-30 11:54:29 +01:00 |
|
Cedric Nugteren
|
f573fe6bb3
|
Fixed a bug in the graph scripts (thanks to Victor Pakhomov)
|
2016-01-30 11:53:54 +01:00 |
|
CNugteren
|
c32c4a9739
|
Added infrastructure for packed matrices
|
2015-09-19 17:37:42 +02:00 |
|
CNugteren
|
4507ba4997
|
Added first version of banded matrix-vector multiplication
|
2015-09-18 15:25:20 +02:00 |
|
CNugteren
|
4796c9bcbd
|
Added generated main functions for correctness/performance tests for level 2 routines
|
2015-09-18 10:19:03 +02:00 |
|
CNugteren
|
a2e726d3bd
|
Added xDOT/xDOTU/xDOTC dot-product routines
|
2015-09-14 16:57:00 +02:00 |
|
CNugteren
|
e0c5312abb
|
Added support for the dot buffer and offset argument
|
2015-09-14 12:28:50 +02:00 |
|
CNugteren
|
b0b81deae1
|
Minor update of options-printing syntax
|
2015-08-24 07:38:20 +02:00 |
|
CNugteren
|
ff0c54c386
|
Added the XSWAP, XSCAL and XCOPY level-1 routines
|
2015-08-22 17:11:20 +02:00 |
|
CNugteren
|
938ca2707f
|
Added HEMV routine
|
2015-07-31 17:35:42 +02:00 |
|
CNugteren
|
b89517a2e7
|
Added SYMV routine
|
2015-07-31 17:13:41 +02:00 |
|
CNugteren
|
f7199b831f
|
Now using the new Claduc C++11 OpenCL header
|
2015-07-27 07:18:06 +02:00 |
|
CNugteren
|
0273b622d3
|
Made the graph script robust against diagnostic system messages
|
2015-07-22 21:30:02 +02:00 |
|
CNugteren
|
ba0026d2b9
|
Changed performance graphs to default to column-major
|
2015-07-15 22:21:24 +02:00 |
|
CNugteren
|
aa852bbe67
|
Added subfolders for the level1/2/3 routines
|
2015-07-12 16:57:09 +02:00 |
|
CNugteren
|
b5d39d9d0c
|
Added the HEMM routine, tester, and client
|
2015-07-12 15:11:50 +02:00 |
|
CNugteren
|
b02876d6e9
|
Added the HER2K routine, tester, and client
|
2015-07-10 20:59:20 +02:00 |
|
CNugteren
|
919bba3eaf
|
Added the HERK routine, tester, and client
|
2015-07-10 07:19:59 +02:00 |
|
CNugteren
|
2fe3fe1580
|
The clients now distinguish between the memory and alpha/beta data-type
|
2015-07-10 07:18:12 +02:00 |
|
CNugteren
|
d9ea0c47c6
|
Added the TRMM routine, tester, and client
|
2015-07-02 07:16:04 +02:00 |
|
CNugteren
|
500416aa38
|
Fixed the order of arguments
|
2015-07-02 07:12:49 +02:00 |
|
CNugteren
|
e3dd35f91b
|
Added the unit/non-unit diagonal enum
|
2015-07-01 09:39:41 +02:00 |
|
CNugteren
|
2914a285d4
|
Re-organized the performance-client infrastructure to avoid code duplication
|
2015-06-29 20:38:34 +02:00 |
|
CNugteren
|
7c8d16147a
|
Added the SYR2K routine, tester, and client
|
2015-06-26 08:12:56 +02:00 |
|
CNugteren
|
75f263ce3a
|
Added symmetric matrix support for the ABC performance tester
|
2015-06-26 08:10:23 +02:00 |
|
CNugteren
|
60a88aac86
|
Added the SYRK routine, tester, and client
|
2015-06-24 07:50:18 +02:00 |
|
CNugteren
|
a17297937d
|
Added performance-client for AC routines
|
2015-06-23 22:31:27 +02:00 |
|
CNugteren
|
0a3831e6d1
|
Updated bandwidth computation for GEMM and SYMM
|
2015-06-23 08:09:46 +02:00 |
|
CNugteren
|
ea7da6a497
|
Fixed support for complex data-types for GEMM and SYMM clients
|
2015-06-21 11:21:03 +02:00 |
|
CNugteren
|
7776323cb4
|
Added GEMV performance-graph script
|
2015-06-14 12:05:30 +02:00 |
|
CNugteren
|
e522d1a74e
|
Added initial version of GEMV including tester and performance client
|
2015-06-13 11:01:20 +02:00 |
|
CNugteren
|
bc5a341dfe
|
Initial commit of preview version
|
2015-05-30 12:30:43 +02:00 |
|