Cedric Nugteren
|
e9dec268bc
|
Add tuning results for Intel Core i9-9980HK
|
2021-08-19 20:25:26 +02:00 |
Cedric Nugteren
|
e59ea46180
|
Add tuning results for NVIDIA A100
|
2021-08-19 20:23:25 +02:00 |
Cedric Nugteren
|
0ee39af5ed
|
Add tuning results for TITAN RTX
|
2020-10-10 13:01:12 +02:00 |
Cedric Nugteren
|
481d86665f
|
Add tuning results for Radeon RX Vega
|
2020-10-10 12:56:28 +02:00 |
Cedric Nugteren
|
7084311e45
|
Added tuning parameters for Tesla P100 16GB
|
2019-02-09 16:31:48 +01:00 |
Cedric Nugteren
|
1035e533cd
|
Added tuning parameters for Xeon E5-2630 v3 and v4
|
2019-02-09 16:29:30 +01:00 |
Cedric Nugteren
|
c42e48068b
|
Added a few more initial Intel tuning parameters for convgemm
|
2019-01-19 15:32:35 +01:00 |
Cedric Nugteren
|
560f7a40f6
|
Added convgemm to the CLBlast database, added initial parameters for Skylake GPU
|
2018-12-31 19:05:34 +01:00 |
Cedric Nugteren
|
bf24421a34
|
Updated the tuning results for Intel IvyBridge M GT2
|
2018-07-31 20:49:41 +02:00 |
Cedric Nugteren
|
f72620f474
|
Added tuning results for Intel i5-4970S
|
2018-07-13 21:25:21 +02:00 |
Cedric Nugteren
|
08b1417956
|
Added tuning results for GeForce GTX 1070 Ti
|
2018-07-13 21:07:32 +02:00 |
Cedric Nugteren
|
c459582c4f
|
Added tuning results for HD Graphics 6000 Broadwell GT3
|
2018-07-13 21:05:43 +02:00 |
Cedric Nugteren
|
f14e6f87d2
|
Updated tuning results for the Skylake ULT GT2 GPU with the new kernel
|
2018-04-15 11:45:45 +02:00 |
Cedric Nugteren
|
0f49dd24e5
|
Updated database with defaults of GEMMK=0 and KREG=1
|
2018-04-10 21:26:18 +02:00 |
Cedric Nugteren
|
77ba11f686
|
Extended the maximum number of tuning parameters from 14 to 16
|
2018-04-08 18:12:54 +02:00 |
Cedric Nugteren
|
16f7f49683
|
Added tuning results for NVIDIA GeForce 970
|
2018-04-07 17:48:25 +02:00 |
Cedric Nugteren
|
9596e46d01
|
Added tuning results for NVIDIA GeForce 920MX
|
2018-04-07 17:44:32 +02:00 |
Cedric Nugteren
|
048fe90e57
|
Added tuning results for Intel HD Graphics 620
|
2018-04-07 17:33:57 +02:00 |
Cedric Nugteren
|
c9b5d614e2
|
Fixed a vendor naming bug in the tuners and in the database
|
2018-01-06 17:02:58 +01:00 |
Cedric Nugteren
|
1e738db6dd
|
Split the database into multiple small compilation units
|
2017-12-27 12:04:22 +01:00 |
Cedric Nugteren
|
7aabeb44cc
|
Updated the tuning results for the IvyBridge M GT2 GPU
|
2017-12-23 15:46:41 +01:00 |
Cedric Nugteren
|
b1f52f130c
|
Updated the database to use the new TRSV and Invert tuners
|
2017-12-23 13:55:22 +01:00 |
Cedric Nugteren
|
0ee81e27b9
|
Added tuning results for Apple AMD Radeon Pro 580
|
2017-12-20 19:59:31 +01:00 |
Cedric Nugteren
|
69f6591564
|
Removed all ARM Mali tuning results; re-added Mali-T760 and Mali-T628 results based on kernel pre-processor
|
2017-12-17 16:59:08 +01:00 |
Cedric Nugteren
|
abb4d5ab32
|
Added tuning results for ARM Mali T760 GPU
|
2017-11-24 21:16:54 +01:00 |
Cedric Nugteren
|
c41d219ea4
|
Added tuning results for the GeForce GTX750Ti
|
2017-11-09 21:19:21 +01:00 |
Cedric Nugteren
|
3ec0be6fb8
|
Added various GEMM routine tuning results
|
2017-11-07 21:34:54 +01:00 |
Cedric Nugteren
|
33ac2b0175
|
Improved the way the database defaults are computed
|
2017-11-06 21:59:45 +01:00 |
Cedric Nugteren
|
9b0a435fb0
|
Integrated the GEMM routine tuner for kernel selection; added first tuning results
|
2017-11-02 21:47:14 +01:00 |
Cedric Nugteren
|
73272ab97d
|
Fixed a bug in database compression/decompression
|
2017-11-02 21:19:18 +01:00 |
Cedric Nugteren
|
472f90501c
|
Added tuning parameters for GeForce GTX 580, GeForce GTX 1080Ti, and Core i5-4570
|
2017-10-20 18:06:12 +02:00 |
Cedric Nugteren
|
0802e3d84c
|
Added tuning results for Intel Core i7 6770HQ
|
2017-09-16 21:19:06 +02:00 |
Cedric Nugteren
|
4e317f5e85
|
Improved compilation time of the tuner database
|
2017-09-16 18:02:37 +02:00 |
Cedric Nugteren
|
0d13d814c2
|
Added architecture layer in the tuning database for better performance on unseen devices
|
2017-09-14 21:27:33 +02:00 |
Cedric Nugteren
|
20da5e33a8
|
Split the database files over multiple directories and files; first step towards separate compilation
|
2017-09-06 21:50:42 +02:00 |
Cedric Nugteren
|
18d832e149
|
Added tuning results for the Qualcomm Adreno 330 GPU
|
2017-07-30 18:18:02 +02:00 |
Cedric Nugteren
|
1a8ed48a35
|
Fixed some Clang and MSVC warnings
|
2017-06-25 11:50:36 +02:00 |
Cedric Nugteren
|
615a7fdc81
|
Fixes some compilation issues related to the database structure change
|
2017-06-21 23:07:47 +02:00 |
Cedric Nugteren
|
e44feb8576
|
Changed the structure of the database to reduce compilation time and save memory
|
2017-06-20 21:19:26 +02:00 |
Cedric Nugteren
|
48f2682eb7
|
Added tuning results for the Core i7-920 CPU
|
2017-06-18 20:53:59 +02:00 |
Cedric Nugteren
|
33ed1e5a06
|
Added tuning results for GeForce GT 650M (thanks to bzcheeseman)
|
2017-06-01 22:52:08 +02:00 |
Cedric Nugteren
|
71933c3411
|
Added tuning results for the AMD Radeon Fiji GPU
|
2017-05-11 22:53:52 -07:00 |
Cedric Nugteren
|
1c33af6eab
|
Re-added Titan X (Pascal) tuning results based on more averaging when tuning
|
2017-04-23 17:58:56 +02:00 |
Cedric Nugteren
|
e41d204856
|
Increased the default number of runs for GEMV tuning; updated GEMV tuning results for Iris Pro
|
2017-04-21 22:12:20 +02:00 |
Cedric Nugteren
|
e9ef037549
|
Added tuning results for the Radeon HD6750M GPU (Apple OpenCL)
|
2017-03-04 15:24:55 +01:00 |
Cedric Nugteren
|
ea6790665d
|
Merge branch 'development' into triangular_solvers
|
2017-02-26 14:51:45 +01:00 |
Cedric Nugteren
|
0643a29af5
|
Added tuning parameters for the AMD RX480 GPU (Ellesmere)
|
2017-02-18 13:59:10 +01:00 |
Cedric Nugteren
|
dc93523204
|
Added tuning results for Titan X (Pascal version)
|
2017-02-08 21:14:38 +01:00 |
Cedric Nugteren
|
c248f900c0
|
Merge branch 'development' into triangular_solvers
|
2017-02-05 22:18:59 +01:00 |
Cedric Nugteren
|
fec8c1a806
|
Completed a first STRSV implementation
|
2017-02-04 16:04:19 +01:00 |