Commit Graph

103 Commits (613ee24ab7f47fe075b6c88d92cdccc1eefea585)

Author SHA1 Message Date
Cedric Nugteren 613ee24ab7
Add tuning results for GTX4050, MX450, and i7-12700H (#541) 2024-05-15 17:39:38 +00:00
Cedric Nugteren 6e2ab6ee96
Add tuning results for 5 devices (#526) 2024-02-08 20:33:33 +01:00
Cedric Nugteren 9535155ad8
Add tuning results for 4 devices (#518) 2023-11-12 20:47:00 +01:00
Cedric Nugteren 29e13d5a33
Add tuning results for 5 devices (#503) 2023-09-14 21:14:26 +02:00
Cedric Nugteren 83bd474eda
Add tuning results for 7 devices (#494) 2023-07-04 21:09:12 +02:00
Cedric Nugteren af667c45fe
Add 6 tuning results (#493)
* Add tuning results for 6 devices

* Add GPU generation names for NVIDIA and AMD GPUs in the documentation
2023-06-24 12:10:08 +02:00
Cedric Nugteren 2b98c6a28c
Add tuning results for more devices (#488)
Add tuning results for 13 devices
2023-06-06 21:31:35 +02:00
Cedric Nugteren ec733402a8
Add tuning results for Radeon RX 6700 XT (#484) 2023-06-01 21:51:33 +02:00
Cedric Nugteren 05a26111f7
Add tuning results for 14 devices (#483) 2023-05-31 21:06:52 +02:00
Cedric Nugteren 63eb127bad
Intel HD Graphics 770 and AMD RX 6600 XT tuning results (#474)
* Add tuning results for AMD Radeon RX 6600 XT

* Add tuning results for Intel HD Graphics 770

* Update list of tuned devices
2023-05-21 14:25:15 +02:00
Mikko Vedru 0832ed6a16
Documentation: tuning.md: Add a useful link (#473) 2023-05-21 08:07:43 +00:00
Cedric Nugteren 9eca896b05 Fix documentation bug w.r.t. ld values and matrix layout 2023-03-25 20:24:40 +01:00
Cedric Nugteren f107162e64 Add tuning results for Adreno 540 2022-04-25 20:36:18 +02:00
Cedric Nugteren c4163b4b1a Add tuning results for Radeon RX 6500 XT 2022-04-25 20:33:47 +02:00
Cedric Nugteren 7ec8b2f29b Add tuning results for Radeon RX 6800 XT 2022-04-25 20:31:55 +02:00
Gard Spreemann df1eebc120 PPA for older Ubuntus 2021-08-24 12:36:35 +02:00
Gard Spreemann 3b1e14acd6 Let the installation documentation reflect the fact that CLBlast is now in Debian and Ubuntu 2021-08-24 11:27:42 +02:00
Cedric Nugteren 2eaabeed10 Added a note on clock frequencies for tuning 2021-08-19 22:38:18 +02:00
Cedric Nugteren c2951b8a2a Updated README and tuning list 2021-08-19 20:37:46 +02:00
Cedric Nugteren 0ee39af5ed Add tuning results for TITAN RTX 2020-10-10 13:01:12 +02:00
Cedric Nugteren 481d86665f Add tuning results for Radeon RX Vega 2020-10-10 12:56:28 +02:00
Cedric Nugteren 5f97d64505 Update API documentation 2020-03-08 11:29:47 +01:00
Cedric Nugteren 7084311e45 Added tuning parameters for Tesla P100 16GB 2019-02-09 16:31:48 +01:00
Cedric Nugteren 1035e533cd Added tuning parameters for Xeon E5-2630 v3 and v4 2019-02-09 16:29:30 +01:00
Cedric Nugteren 11f4c7dd93 Added documentation on the convgemm routine 2019-01-19 15:44:19 +01:00
Koichi Akabe c0883cf2fe Update the documentation 2018-12-18 14:08:16 +09:00
Cedric Nugteren 4676ec2921 Added a FAQ document 2018-12-01 17:19:28 +01:00
Koichi Akabe 032e3b0cc0 Add kernel_mode option to im2col, col2im, and convgemm functions 2018-11-12 10:12:07 +09:00
Cedric Nugteren 6f67525ea6 Changed col2im to append to the existing im-buffer 2018-11-07 19:45:07 +01:00
Cedric Nugteren 2d32a23293 Added new col2im routine to the documentation 2018-11-01 21:46:19 +01:00
Cedric Nugteren d45911b61d Added groundwork for col2im algorithm plus first non-working version of kernel and test 2018-10-23 20:52:25 +02:00
Cedric Nugteren 634b2bc75c
Merge pull request #319 from CNugteren/convgemm_multi_kernel
First im2col+GEMM implementation of convolution
2018-10-14 17:27:45 +02:00
Cedric Nugteren 8676b62178 Updated the documentation for GEMV tuning 2018-10-13 17:43:51 +02:00
Cedric Nugteren 83ba3d4b7b Merge branch 'master' into convgemm_multi_kernel 2018-09-16 20:01:18 +02:00
Cedric Nugteren 91dbd580ab Added a kernel-parameter pair table to document the tuning API 2018-09-15 18:47:31 +02:00
Cedric Nugteren c788e040f7 Added xCONVGEMM as im2col plus a batched GEMM kernel 2018-09-07 22:02:44 +02:00
Hendrik Ranocha faed209f30
Add Julia Wrapper
I've written a wrapper of CLBlast in Julia which can be found [here](https://github.com/JuliaGPU/CLBlast.jl). It is published and available using the Julia package manager.
2018-09-03 15:57:16 +02:00
Cedric Nugteren 2dd539f911 Removed complex numbers support for CONVGEMM 2018-07-29 10:37:14 +02:00
Cedric Nugteren 5903820ba2 Merge branch 'master' into CLBlast-267-convgemm 2018-07-29 10:26:34 +02:00
Cedric Nugteren db179a1e40 Updated to CLBlast version 1.4.1 2018-07-14 12:29:06 +02:00
Cedric Nugteren f72620f474 Added tuning results for Intel i5-4970S 2018-07-13 21:25:21 +02:00
Cedric Nugteren 08b1417956 Added tuning results for GeForce GTX 1070 Ti 2018-07-13 21:07:32 +02:00
Cedric Nugteren c459582c4f Added tuning results for HD Graphics 6000 Broadwell GT3 2018-07-13 21:05:43 +02:00
Cedric Nugteren 1c9a741470 Merge branch 'master' into CLBlast-267-convgemm 2018-06-03 15:53:27 +02:00
Cedric Nugteren fee8df153c Added list of tuners to be run by 'alltuners' target 2018-06-03 10:42:15 +02:00
Cedric Nugteren cbcd4ff7e8 Merge branch 'master' into CLBlast-267-convgemm 2018-05-19 17:54:27 +02:00
Cedric Nugteren 66583b3cda The GEMM routine tuner now loads kernel JSON tuning results from disk if available; now run part of alltuners target 2018-05-19 12:48:59 +02:00
Cedric Nugteren ad57a45039 Added documentation on some details of the GEMM implementation 2018-05-17 12:50:03 +02:00
Cedric Nugteren a4119531ee Updated the documentation for convgemm to include data layout (NCHW) 2018-05-09 17:46:27 +02:00
Cedric Nugteren 2d1f6ba7fe Added convgemm skeleton, test infrastructure, and first reference implementation 2018-05-06 11:35:34 +02:00