Cedric Nugteren
|
aaa687ca98
|
Added preliminary support for the xNRM2 routines
|
2016-03-28 23:00:44 +02:00 |
|
Cedric Nugteren
|
1d5a702d9d
|
Added prototypes for ScNRM2/DzNRM2 routines
|
2016-03-25 10:30:38 +01:00 |
|
Cedric Nugteren
|
3876096c30
|
Added prototypes for SNRM2/DNRM2 routines
|
2016-03-25 10:00:40 +01:00 |
|
Cedric Nugteren
|
fb58129afb
|
Made testing against clBLAS in the client binaries truely optional (was partly implemented before)
|
2016-03-06 16:34:26 +01:00 |
|
Cedric Nugteren
|
7468e2ba9d
|
Adjusted the correctness-test error margins
|
2016-03-06 16:32:38 +01:00 |
|
Cedric Nugteren
|
306bf67660
|
Added preliminary support for xHPR2 and xSPR2 routines
|
2016-03-06 15:48:11 +01:00 |
|
Cedric Nugteren
|
60da54da5d
|
Added preliminary support for xHER2 and xSYR2 routines
|
2016-03-02 21:18:01 +01:00 |
|
Cedric Nugteren
|
e3545215a5
|
Added support for xHER, xHPR, xSYR, and xSPR routines
|
2016-02-28 14:16:48 +01:00 |
|
Cedric Nugteren
|
9f682aa66b
|
Set a proper default precision for the CLBlast clients
|
2016-02-20 14:41:53 +01:00 |
|
Cedric Nugteren
|
6dc44da07b
|
Added support for xGERU and xGERC routines
|
2016-02-20 14:15:41 +01:00 |
|
Cedric Nugteren
|
8854a73127
|
Added XGER routine, kernel, and tuner
|
2016-02-20 12:40:01 +01:00 |
|
CNugteren
|
fadd76207f
|
Fixed warnings under MSVC
|
2016-02-08 20:44:05 +01:00 |
|
CNugteren
|
fbf071ba62
|
Fixed a linker error in the performance client under GCC
|
2016-02-06 10:53:44 +01:00 |
|
CNugteren
|
9622d3be22
|
Fixes for compilation under Visual Studio
|
2016-01-30 14:57:49 +01:00 |
|
Cedric Nugteren
|
44fb40e5c4
|
Prepared for MSVC support
|
2016-01-30 11:54:29 +01:00 |
|
Cedric Nugteren
|
f573fe6bb3
|
Fixed a bug in the graph scripts (thanks to Victor Pakhomov)
|
2016-01-30 11:53:54 +01:00 |
|
CNugteren
|
2b56c2c603
|
Added TRMV/TBMV/TPMV routines
|
2015-09-26 16:58:03 +02:00 |
|
CNugteren
|
de6547a92b
|
Added SBMV and SPMV routines
|
2015-09-19 18:01:19 +02:00 |
|
CNugteren
|
80da67d28b
|
Added the HPMV routine
|
2015-09-19 17:40:38 +02:00 |
|
CNugteren
|
c32c4a9739
|
Added infrastructure for packed matrices
|
2015-09-19 17:37:42 +02:00 |
|
CNugteren
|
aebd156869
|
Added the HBMV routine
|
2015-09-19 11:11:34 +02:00 |
|
CNugteren
|
4507ba4997
|
Added first version of banded matrix-vector multiplication
|
2015-09-18 15:25:20 +02:00 |
|
CNugteren
|
4796c9bcbd
|
Added generated main functions for correctness/performance tests for level 2 routines
|
2015-09-18 10:19:03 +02:00 |
|
CNugteren
|
6105ad6f5b
|
Added interface of all level 2 routines
|
2015-09-17 17:05:45 +02:00 |
|
CNugteren
|
6307d2e5db
|
Added script to generate API interface and implementation automatically
|
2015-09-17 10:14:33 +02:00 |
|
CNugteren
|
a2e726d3bd
|
Added xDOT/xDOTU/xDOTC dot-product routines
|
2015-09-14 16:57:00 +02:00 |
|
CNugteren
|
e0c5312abb
|
Added support for the dot buffer and offset argument
|
2015-09-14 12:28:50 +02:00 |
|
CNugteren
|
b0b81deae1
|
Minor update of options-printing syntax
|
2015-08-24 07:38:20 +02:00 |
|
CNugteren
|
ff0c54c386
|
Added the XSWAP, XSCAL and XCOPY level-1 routines
|
2015-08-22 17:11:20 +02:00 |
|
CNugteren
|
b46de22433
|
Moved precision tester to utilities
|
2015-08-19 19:34:29 +02:00 |
|
CNugteren
|
938ca2707f
|
Added HEMV routine
|
2015-07-31 17:35:42 +02:00 |
|
CNugteren
|
b89517a2e7
|
Added SYMV routine
|
2015-07-31 17:13:41 +02:00 |
|
CNugteren
|
c5d5adbddd
|
Refactored the correctness tests
|
2015-07-31 15:52:13 +02:00 |
|
CNugteren
|
f7199b831f
|
Now using the new Claduc C++11 OpenCL header
|
2015-07-27 07:18:06 +02:00 |
|
CNugteren
|
0273b622d3
|
Made the graph script robust against diagnostic system messages
|
2015-07-22 21:30:02 +02:00 |
|
CNugteren
|
ba0026d2b9
|
Changed performance graphs to default to column-major
|
2015-07-15 22:21:24 +02:00 |
|
CNugteren
|
aa852bbe67
|
Added subfolders for the level1/2/3 routines
|
2015-07-12 16:57:09 +02:00 |
|
CNugteren
|
b5d39d9d0c
|
Added the HEMM routine, tester, and client
|
2015-07-12 15:11:50 +02:00 |
|
CNugteren
|
b02876d6e9
|
Added the HER2K routine, tester, and client
|
2015-07-10 20:59:20 +02:00 |
|
CNugteren
|
919bba3eaf
|
Added the HERK routine, tester, and client
|
2015-07-10 07:19:59 +02:00 |
|
CNugteren
|
2fe3fe1580
|
The clients now distinguish between the memory and alpha/beta data-type
|
2015-07-10 07:18:12 +02:00 |
|
CNugteren
|
82469fc764
|
The testers now distinguish between the memory and alpha/beta data-type
|
2015-07-08 07:21:44 +02:00 |
|
CNugteren
|
d9ea0c47c6
|
Added the TRMM routine, tester, and client
|
2015-07-02 07:16:04 +02:00 |
|
CNugteren
|
500416aa38
|
Fixed the order of arguments
|
2015-07-02 07:12:49 +02:00 |
|
CNugteren
|
e3dd35f91b
|
Added the unit/non-unit diagonal enum
|
2015-07-01 09:39:41 +02:00 |
|
CNugteren
|
a591d5607d
|
Added constness to all cl_mem objects
|
2015-06-30 07:35:54 +02:00 |
|
CNugteren
|
14186af590
|
Added TRMM and TRSM clBLAS wrappers
|
2015-06-30 07:19:46 +02:00 |
|
CNugteren
|
ede78fe499
|
Fixed the license for the correctness testers
|
2015-06-29 20:39:51 +02:00 |
|
CNugteren
|
2914a285d4
|
Re-organized the performance-client infrastructure to avoid code duplication
|
2015-06-29 20:38:34 +02:00 |
|
CNugteren
|
e5c0edbfd7
|
Re-organized the test infrastructure to avoid code duplication
|
2015-06-28 15:52:57 +02:00 |
|