Cedric Nugteren
|
a5fd2323b6
|
Added prototype for the TRSV routine
|
2017-01-20 11:30:32 +01:00 |
|
Cedric Nugteren
|
681a465b35
|
Prepared for the addition of the TRSM triangular solver kernel
|
2016-12-18 12:30:16 +01:00 |
|
Cedric Nugteren
|
88ba1f4db9
|
Added performance results for the Skylake ULT GT2 GPU
|
2016-11-20 20:36:56 +01:00 |
|
Cedric Nugteren
|
a670c4c4bf
|
All enums in the C API are now prefixed with CLBlast to avoid potential name clashes with other projects
|
2016-10-22 16:14:56 +02:00 |
|
Cedric Nugteren
|
42ee4abbbc
|
Updated the performance graphs for Intel Iris Pro GPU and AMD Radeon M370X GPU
|
2016-10-10 18:07:05 +02:00 |
|
Cedric Nugteren
|
f563341e7b
|
Added fresh performance graphs for GeForce 750Ti; removed old GTX480 results
|
2016-10-10 16:59:28 +02:00 |
|
Cedric Nugteren
|
52ccaf5b25
|
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and/or transposing
|
2016-06-16 18:07:46 +02:00 |
|
Cedric Nugteren
|
995a528cec
|
Improved API documentation and added documentation for level-2 and level-3 routines
|
2016-06-13 20:17:26 +02:00 |
|
Cedric Nugteren
|
4fb8f9517c
|
Added documentation for the matrix-update level-2 family of routines
|
2016-06-10 11:16:06 +02:00 |
|
Cedric Nugteren
|
9f87455070
|
Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM
|
2016-05-25 13:29:53 +02:00 |
|
Cedric Nugteren
|
3e9a07f00a
|
Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2
|
2016-05-22 16:59:14 +02:00 |
|
Cedric Nugteren
|
95b828da12
|
Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSBMV/HSPMV/HTRMV/HTBMV/HTPMV
|
2016-05-22 15:38:26 +02:00 |
|
Cedric Nugteren
|
803aaf3070
|
Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASUM/HSUM/iHAMAX/iHMAX/iHMIN
|
2016-05-22 14:47:14 +02:00 |
|
Cedric Nugteren
|
489c5d76cf
|
Merged in latest changes from 0.7.1 release
|
2016-05-18 21:32:56 +02:00 |
|
Cedric Nugteren
|
120c31a30f
|
Initial experimental version of the half-precision HAXPY routine
|
2016-05-13 20:49:34 +02:00 |
|
Cedric Nugteren
|
ed2904a344
|
Added preliminary generated API documentation
|
2016-05-08 09:49:00 +02:00 |
|
Cedric Nugteren
|
801218ba10
|
Added performance graphs for Intel Iris and Radeon M370X
|
2016-03-12 16:04:23 +01:00 |
|
CNugteren
|
547b7afffc
|
Updated the performance results, added HD7950
|
2015-07-23 18:25:39 +02:00 |
|
CNugteren
|
7c8d16147a
|
Added the SYR2K routine, tester, and client
|
2015-06-26 08:12:56 +02:00 |
|
CNugteren
|
96e4012349
|
Added SSYRK performance graphs
|
2015-06-25 19:19:31 +02:00 |
|
CNugteren
|
6aac23be86
|
Updated performance graphs for Intel Iris GPUs
|
2015-06-21 09:12:42 +02:00 |
|
CNugteren
|
bc5a341dfe
|
Initial commit of preview version
|
2015-05-30 12:30:43 +02:00 |
|