Commit Graph

143 Commits (6e2ab6ee967c4a9b3350c7ce4e7d7b736c9e45f6)

Author SHA1 Message Date
Cedric Nugteren 6e2ab6ee96
Add tuning results for 5 devices (#526) 2024-02-08 20:33:33 +01:00
Cedric Nugteren 9535155ad8
Add tuning results for 4 devices (#518) 2023-11-12 20:47:00 +01:00
Cedric Nugteren 29e13d5a33
Add tuning results for 5 devices (#503) 2023-09-14 21:14:26 +02:00
Cedric Nugteren 83bd474eda
Add tuning results for 7 devices (#494) 2023-07-04 21:09:12 +02:00
Cedric Nugteren af667c45fe
Add 6 tuning results (#493)
* Add tuning results for 6 devices

* Add GPU generation names for NVIDIA and AMD GPUs in the documentation
2023-06-24 12:10:08 +02:00
Cedric Nugteren 2b98c6a28c
Add tuning results for more devices (#488)
Add tuning results for 13 devices
2023-06-06 21:31:35 +02:00
Cedric Nugteren ec733402a8
Add tuning results for Radeon RX 6700 XT (#484) 2023-06-01 21:51:33 +02:00
Cedric Nugteren 05a26111f7
Add tuning results for 14 devices (#483) 2023-05-31 21:06:52 +02:00
Cedric Nugteren 63eb127bad
Intel HD Graphics 770 and AMD RX 6600 XT tuning results (#474)
* Add tuning results for AMD Radeon RX 6600 XT

* Add tuning results for Intel HD Graphics 770

* Update list of tuned devices
2023-05-21 14:25:15 +02:00
Cedric Nugteren 7a3ef92ff2
Add 3 sets of tuning results: RX 5700 XT, 2080 Ti, and 3090 (#468)
* Add tuning results for AMD Radeon RX 5700 XT

* Add tuning results for NVIDIA GeForce RTX 2080 Ti

* Add tuning results for NVIDIA GeForce RTX 3090
2023-05-17 10:31:47 +02:00
Cedric Nugteren c9856758b3 Add tuning results for Intel FPGA emulation device 2023-01-21 21:13:49 +01:00
Cedric Nugteren f4a14daf8d Add tuning results for Radeon Pro 450 2023-01-21 21:11:38 +01:00
Cedric Nugteren 3ca1f5176e Add tuning results for Adreno 740 2023-01-21 21:09:09 +01:00
Cedric Nugteren d11b0c8b01 Add tuning results for Adreno 730 2023-01-21 20:33:49 +01:00
Angus, Alexander 4f394608a2 implemented changes to boost Adreno performance according to https://jira-dc.qualcomm.com/jira/browse/OSR-8731 2023-01-03 10:56:04 -08:00
Cedric Nugteren f107162e64 Add tuning results for Adreno 540 2022-04-25 20:36:18 +02:00
Cedric Nugteren c4163b4b1a Add tuning results for Radeon RX 6500 XT 2022-04-25 20:33:47 +02:00
Cedric Nugteren 7ec8b2f29b Add tuning results for Radeon RX 6800 XT 2022-04-25 20:31:55 +02:00
Cedric Nugteren 772dd307ab Add Quadro T2000 tuning parameters for the Tesla T4 2021-08-27 20:39:59 +02:00
Cedric Nugteren 1f639b7264 Remove Tesla T4 tuning results 2021-08-27 20:32:59 +02:00
Cedric Nugteren 5a9bd270f8 Add tuning results for NVIDIA Tesla V100 2021-08-19 20:34:09 +02:00
Cedric Nugteren adb4b02982 Add tuning results for NVIDIA Tesla T4 2021-08-19 20:31:52 +02:00
Cedric Nugteren dea3b5fadb Add tuning results for NVIDIA Quadro T2000 2021-08-19 20:29:47 +02:00
Cedric Nugteren 521ad117bc Add tuning results for NVIDIA Quadro GV100 2021-08-19 20:27:39 +02:00
Cedric Nugteren e9dec268bc Add tuning results for Intel Core i9-9980HK 2021-08-19 20:25:26 +02:00
Cedric Nugteren e59ea46180 Add tuning results for NVIDIA A100 2021-08-19 20:23:25 +02:00
Cedric Nugteren 0ee39af5ed Add tuning results for TITAN RTX 2020-10-10 13:01:12 +02:00
Cedric Nugteren 481d86665f Add tuning results for Radeon RX Vega 2020-10-10 12:56:28 +02:00
Cedric Nugteren e3ce88154a Silenced a new OpenCL warning message 2020-03-08 10:14:59 +01:00
Cedric Nugteren 7084311e45 Added tuning parameters for Tesla P100 16GB 2019-02-09 16:31:48 +01:00
Cedric Nugteren 1035e533cd Added tuning parameters for Xeon E5-2630 v3 and v4 2019-02-09 16:29:30 +01:00
Cedric Nugteren c42e48068b Added a few more initial Intel tuning parameters for convgemm 2019-01-19 15:32:35 +01:00
Cedric Nugteren 560f7a40f6 Added convgemm to the CLBlast database, added initial parameters for Skylake GPU 2018-12-31 19:05:34 +01:00
Cedric Nugteren bf24421a34 Updated the tuning results for Intel IvyBridge M GT2 2018-07-31 20:49:41 +02:00
Cedric Nugteren f72620f474 Added tuning results for Intel i5-4970S 2018-07-13 21:25:21 +02:00
Cedric Nugteren 08b1417956 Added tuning results for GeForce GTX 1070 Ti 2018-07-13 21:07:32 +02:00
Cedric Nugteren c459582c4f Added tuning results for HD Graphics 6000 Broadwell GT3 2018-07-13 21:05:43 +02:00
Cedric Nugteren 7c3431a72a Fixes for Apple OpenCL CPU implementation which requires a LWGS of 1 when barriers are present 2018-06-01 20:59:44 +02:00
Cedric Nugteren ff4d5558a6 Widened Apple OpenCL check, added way to debug too-large-workgroups issue 2018-05-30 22:59:04 +02:00
Cedric Nugteren a8bb0c9f3c Added Apple OpenCL TRSV block size override; removed failing old Intel GPU test from README 2018-05-29 21:29:12 +02:00
Cedric Nugteren f14e6f87d2 Updated tuning results for the Skylake ULT GT2 GPU with the new kernel 2018-04-15 11:45:45 +02:00
Cedric Nugteren 0f49dd24e5 Updated database with defaults of GEMMK=0 and KREG=1 2018-04-10 21:26:18 +02:00
Cedric Nugteren 77ba11f686 Extended the maximum number of tuning parameters from 14 to 16 2018-04-08 18:12:54 +02:00
Cedric Nugteren 16f7f49683 Added tuning results for NVIDIA GeForce 970 2018-04-07 17:48:25 +02:00
Cedric Nugteren 9596e46d01 Added tuning results for NVIDIA GeForce 920MX 2018-04-07 17:44:32 +02:00
Cedric Nugteren 048fe90e57 Added tuning results for Intel HD Graphics 620 2018-04-07 17:33:57 +02:00
Cedric Nugteren 7a756cbce7 Fixed a failing TRSV test using a CPU with Apple OpenCL 2018-03-15 20:58:42 +01:00
Cedric Nugteren a500f537d8 Added a RetrieveParameters function to inspect tuning parameters 2018-01-11 20:32:06 +01:00
Cedric Nugteren c9b5d614e2 Fixed a vendor naming bug in the tuners and in the database 2018-01-06 17:02:58 +01:00
Cedric Nugteren e71c037304 Fixed a performance overhead in database creation: it is again a static variable now as it was before 2018-01-06 11:28:04 +01:00