Commit graph

223 commits

Author SHA1 Message Date
Cedric Nugteren e3545215a5 Added support for xHER, xHPR, xSYR, and xSPR routines 2016-02-28 14:16:48 +01:00
Cedric Nugteren 9f682aa66b Set a proper default precision for the CLBlast clients 2016-02-20 14:41:53 +01:00
Cedric Nugteren 6dc44da07b Added support for xGERU and xGERC routines 2016-02-20 14:15:41 +01:00
Cedric Nugteren 8854a73127 Added XGER routine, kernel, and tuner 2016-02-20 12:40:01 +01:00
Cedric Nugteren 165a94c200 Various fixes to the database script 2016-02-07 16:39:37 +01:00
Cedric Nugteren 00be6f7530 Added dictionary with short and long OpenCL vendor names to fix issues with Intel having multiple names 2016-02-07 11:59:30 +01:00
Cedric Nugteren c76f1d9dbb Made the tuning database an optional external download 2016-02-07 10:59:51 +01:00
CNugteren 704a729f5c Made the database script compatible with Python 3 2016-02-06 13:11:36 +01:00
Cedric Nugteren 276e772a2c Added first auto-generated database headers from the Python database; only K40 and Iris supported now 2016-01-30 11:43:21 +01:00
Cedric Nugteren 76c9148030 Minor improvements to the database script, including proper file paths 2016-01-24 17:56:27 +01:00
Cedric Nugteren f0b3091cdb Added Python function to compute defaults for a particular device/vendor combination 2016-01-24 17:35:31 +01:00
CNugteren 09c94b17cf Added tuning data for Tesla K40 2015-10-28 21:20:42 +01:00
CNugteren bb4e78f737 Added initial tuning database with Intel Iris data 2015-10-25 16:49:59 +01:00
CNugteren ccd1a5c7cc Updated tuning database script according to the new JSON format 2015-10-25 16:49:29 +01:00
CNugteren a2d5d7770e Moved the tuner database script to a separate folder 2015-10-25 16:27:14 +01:00
CNugteren 2b56c2c603 Added TRMV/TBMV/TPMV routines 2015-09-26 16:58:03 +02:00
CNugteren de6547a92b Added SBMV and SPMV routines 2015-09-19 18:01:19 +02:00
CNugteren 80da67d28b Added the HPMV routine 2015-09-19 17:40:38 +02:00
CNugteren aebd156869 Added the HBMV routine 2015-09-19 11:11:34 +02:00
CNugteren 4507ba4997 Added first version of banded matrix-vector multiplication 2015-09-18 15:25:20 +02:00
CNugteren 4796c9bcbd Added generated main functions for correctness/performance tests for level 2 routines 2015-09-18 10:19:03 +02:00
CNugteren 6105ad6f5b Added interface of all level 2 routines 2015-09-17 17:05:45 +02:00
CNugteren 6307d2e5db Added script to generate API interface and implementation automatically 2015-09-17 10:14:33 +02:00