Commit graph

20 commits

Author SHA1 Message Date
Cedric Nugteren 88ba1f4db9 Added performance results for the Skylake ULT GT2 GPU 2016-11-20 20:36:56 +01:00
Cedric Nugteren a670c4c4bf All enums in the C API are now prefixed with CLBlast to avoid potential name clashes with other projects 2016-10-22 16:14:56 +02:00
Cedric Nugteren 42ee4abbbc Updated the performance graphs for Intel Iris Pro GPU and AMD Radeon M370X GPU 2016-10-10 18:07:05 +02:00
Cedric Nugteren f563341e7b Added fresh performance graphs for GeForce 750Ti; removed old GTX480 results 2016-10-10 16:59:28 +02:00
Cedric Nugteren 52ccaf5b25 Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and/or transposing 2016-06-16 18:07:46 +02:00
Cedric Nugteren 995a528cec Improved API documentation and added documentation for level-2 and level-3 routines 2016-06-13 20:17:26 +02:00
Cedric Nugteren 4fb8f9517c Added documentation for the matrix-update level-2 family of routines 2016-06-10 11:16:06 +02:00
Cedric Nugteren 9f87455070 Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM 2016-05-25 13:29:53 +02:00
Cedric Nugteren 3e9a07f00a Added level-2 half-precision routines HGER/HSYR/HSPR/HSYR2/HSPR2 2016-05-22 16:59:14 +02:00
Cedric Nugteren 95b828da12 Added level-2 half-precision routines HGEMV/HGBMV/HHEMV/HHBMV/HHPMV/HSYMV/HSBMV/HSPMV/HTRMV/HTBMV/HTPMV 2016-05-22 15:38:26 +02:00
Cedric Nugteren 803aaf3070 Added level-1 half-precision routines HSWAP/HSCAL/HCOPY/HAXPY/HDOT/HNRM2/HASUM/HSUM/iHAMAX/iHMAX/iHMIN 2016-05-22 14:47:14 +02:00
Cedric Nugteren 489c5d76cf Merged in latest changes from 0.7.1 release 2016-05-18 21:32:56 +02:00
Cedric Nugteren 120c31a30f Initial experimental version of the half-precision HAXPY routine 2016-05-13 20:49:34 +02:00
Cedric Nugteren ed2904a344 Added preliminary generated API documentation 2016-05-08 09:49:00 +02:00
Cedric Nugteren 801218ba10 Added performance graphs for Intel Iris and Radeon M370X 2016-03-12 16:04:23 +01:00
CNugteren 547b7afffc Updated the performance results, added HD7950 2015-07-23 18:25:39 +02:00
CNugteren 7c8d16147a Added the SYR2K routine, tester, and client 2015-06-26 08:12:56 +02:00
CNugteren 96e4012349 Added SSYRK performance graphs 2015-06-25 19:19:31 +02:00
CNugteren 6aac23be86 Updated performance graphs for Intel Iris GPUs 2015-06-21 09:12:42 +02:00
CNugteren bc5a341dfe Initial commit of preview version 2015-05-30 12:30:43 +02:00