Cedric Nugteren
|
6e2ab6ee96
|
Add tuning results for 5 devices (#526)
|
2024-02-08 20:33:33 +01:00 |
Cedric Nugteren
|
9535155ad8
|
Add tuning results for 4 devices (#518)
|
2023-11-12 20:47:00 +01:00 |
Cedric Nugteren
|
29e13d5a33
|
Add tuning results for 5 devices (#503)
|
2023-09-14 21:14:26 +02:00 |
Cedric Nugteren
|
83bd474eda
|
Add tuning results for 7 devices (#494)
|
2023-07-04 21:09:12 +02:00 |
Cedric Nugteren
|
af667c45fe
|
Add 6 tuning results (#493)
* Add tuning results for 6 devices
* Add GPU generation names for NVIDIA and AMD GPUs in the documentation
|
2023-06-24 12:10:08 +02:00 |
Cedric Nugteren
|
2b98c6a28c
|
Add tuning results for more devices (#488)
Add tuning results for 13 devices
|
2023-06-06 21:31:35 +02:00 |
Cedric Nugteren
|
ec733402a8
|
Add tuning results for Radeon RX 6700 XT (#484)
|
2023-06-01 21:51:33 +02:00 |
Cedric Nugteren
|
05a26111f7
|
Add tuning results for 14 devices (#483)
|
2023-05-31 21:06:52 +02:00 |
Cedric Nugteren
|
63eb127bad
|
Intel HD Graphics 770 and AMD RX 6600 XT tuning results (#474)
* Add tuning results for AMD Radeon RX 6600 XT
* Add tuning results for Intel HD Graphics 770
* Update list of tuned devices
|
2023-05-21 14:25:15 +02:00 |
Cedric Nugteren
|
7a3ef92ff2
|
Add 3 sets of tuning results: RX 5700 XT, 2080 Ti, and 3090 (#468)
* Add tuning results for AMD Radeon RX 5700 XT
* Add tuning results for NVIDIA GeForce RTX 2080 Ti
* Add tuning results for NVIDIA GeForce RTX 3090
|
2023-05-17 10:31:47 +02:00 |
Cedric Nugteren
|
c9856758b3
|
Add tuning results for Intel FPGA emulation device
|
2023-01-21 21:13:49 +01:00 |
Cedric Nugteren
|
f4a14daf8d
|
Add tuning results for Radeon Pro 450
|
2023-01-21 21:11:38 +01:00 |
Cedric Nugteren
|
3ca1f5176e
|
Add tuning results for Adreno 740
|
2023-01-21 21:09:09 +01:00 |
Cedric Nugteren
|
d11b0c8b01
|
Add tuning results for Adreno 730
|
2023-01-21 20:33:49 +01:00 |
Angus, Alexander
|
4f394608a2
|
implemented changes to boost Adreno performance according to https://jira-dc.qualcomm.com/jira/browse/OSR-8731
|
2023-01-03 10:56:04 -08:00 |
Cedric Nugteren
|
f107162e64
|
Add tuning results for Adreno 540
|
2022-04-25 20:36:18 +02:00 |
Cedric Nugteren
|
c4163b4b1a
|
Add tuning results for Radeon RX 6500 XT
|
2022-04-25 20:33:47 +02:00 |
Cedric Nugteren
|
7ec8b2f29b
|
Add tuning results for Radeon RX 6800 XT
|
2022-04-25 20:31:55 +02:00 |
Cedric Nugteren
|
772dd307ab
|
Add Quadro T2000 tuning parameters for the Tesla T4
|
2021-08-27 20:39:59 +02:00 |
Cedric Nugteren
|
1f639b7264
|
Remove Tesla T4 tuning results
|
2021-08-27 20:32:59 +02:00 |
Cedric Nugteren
|
5a9bd270f8
|
Add tuning results for NVIDIA Tesla V100
|
2021-08-19 20:34:09 +02:00 |
Cedric Nugteren
|
adb4b02982
|
Add tuning results for NVIDIA Tesla T4
|
2021-08-19 20:31:52 +02:00 |
Cedric Nugteren
|
dea3b5fadb
|
Add tuning results for NVIDIA Quadro T2000
|
2021-08-19 20:29:47 +02:00 |
Cedric Nugteren
|
521ad117bc
|
Add tuning results for NVIDIA Quadro GV100
|
2021-08-19 20:27:39 +02:00 |
Cedric Nugteren
|
e9dec268bc
|
Add tuning results for Intel Core i9-9980HK
|
2021-08-19 20:25:26 +02:00 |
Cedric Nugteren
|
e59ea46180
|
Add tuning results for NVIDIA A100
|
2021-08-19 20:23:25 +02:00 |
Cedric Nugteren
|
0ee39af5ed
|
Add tuning results for TITAN RTX
|
2020-10-10 13:01:12 +02:00 |
Cedric Nugteren
|
481d86665f
|
Add tuning results for Radeon RX Vega
|
2020-10-10 12:56:28 +02:00 |
Cedric Nugteren
|
e3ce88154a
|
Silenced a new OpenCL warning message
|
2020-03-08 10:14:59 +01:00 |
Cedric Nugteren
|
7084311e45
|
Added tuning parameters for Tesla P100 16GB
|
2019-02-09 16:31:48 +01:00 |
Cedric Nugteren
|
1035e533cd
|
Added tuning parameters for Xeon E5-2630 v3 and v4
|
2019-02-09 16:29:30 +01:00 |
Cedric Nugteren
|
c42e48068b
|
Added a few more initial Intel tuning parameters for convgemm
|
2019-01-19 15:32:35 +01:00 |
Cedric Nugteren
|
560f7a40f6
|
Added convgemm to the CLBlast database, added initial parameters for Skylake GPU
|
2018-12-31 19:05:34 +01:00 |
Cedric Nugteren
|
bf24421a34
|
Updated the tuning results for Intel IvyBridge M GT2
|
2018-07-31 20:49:41 +02:00 |
Cedric Nugteren
|
f72620f474
|
Added tuning results for Intel i5-4970S
|
2018-07-13 21:25:21 +02:00 |
Cedric Nugteren
|
08b1417956
|
Added tuning results for GeForce GTX 1070 Ti
|
2018-07-13 21:07:32 +02:00 |
Cedric Nugteren
|
c459582c4f
|
Added tuning results for HD Graphics 6000 Broadwell GT3
|
2018-07-13 21:05:43 +02:00 |
Cedric Nugteren
|
7c3431a72a
|
Fixes for Apple OpenCL CPU implementation which requires a LWGS of 1 when barriers are present
|
2018-06-01 20:59:44 +02:00 |
Cedric Nugteren
|
ff4d5558a6
|
Widened Apple OpenCL check, added way to debug too-large-workgroups issue
|
2018-05-30 22:59:04 +02:00 |
Cedric Nugteren
|
a8bb0c9f3c
|
Added Apple OpenCL TRSV block size override; removed failing old Intel GPU test from README
|
2018-05-29 21:29:12 +02:00 |
Cedric Nugteren
|
f14e6f87d2
|
Updated tuning results for the Skylake ULT GT2 GPU with the new kernel
|
2018-04-15 11:45:45 +02:00 |
Cedric Nugteren
|
0f49dd24e5
|
Updated database with defaults of GEMMK=0 and KREG=1
|
2018-04-10 21:26:18 +02:00 |
Cedric Nugteren
|
77ba11f686
|
Extended the maximum number of tuning parameters from 14 to 16
|
2018-04-08 18:12:54 +02:00 |
Cedric Nugteren
|
16f7f49683
|
Added tuning results for NVIDIA GeForce 970
|
2018-04-07 17:48:25 +02:00 |
Cedric Nugteren
|
9596e46d01
|
Added tuning results for NVIDIA GeForce 920MX
|
2018-04-07 17:44:32 +02:00 |
Cedric Nugteren
|
048fe90e57
|
Added tuning results for Intel HD Graphics 620
|
2018-04-07 17:33:57 +02:00 |
Cedric Nugteren
|
7a756cbce7
|
Fixed a failing TRSV test using a CPU with Apple OpenCL
|
2018-03-15 20:58:42 +01:00 |
Cedric Nugteren
|
a500f537d8
|
Added a RetrieveParameters function to inspect tuning parameters
|
2018-01-11 20:32:06 +01:00 |
Cedric Nugteren
|
c9b5d614e2
|
Fixed a vendor naming bug in the tuners and in the database
|
2018-01-06 17:02:58 +01:00 |
Cedric Nugteren
|
e71c037304
|
Fixed a performance overhead in database creation: it is again a static variable now as it was before
|
2018-01-06 11:28:04 +01:00 |