Cedric Nugteren
2b98c6a28c
Add tuning results for more devices ( #488 )
...
Add tuning results for 13 devices
2023-06-06 21:31:35 +02:00
Cedric Nugteren
ec733402a8
Add tuning results for Radeon RX 6700 XT ( #484 )
2023-06-01 21:51:33 +02:00
Cedric Nugteren
05a26111f7
Add tuning results for 14 devices ( #483 )
2023-05-31 21:06:52 +02:00
Cedric Nugteren
63eb127bad
Intel HD Graphics 770 and AMD RX 6600 XT tuning results ( #474 )
...
* Add tuning results for AMD Radeon RX 6600 XT
* Add tuning results for Intel HD Graphics 770
* Update list of tuned devices
2023-05-21 14:25:15 +02:00
Mikko Vedru
0832ed6a16
Documentation: tuning.md: Add a useful link ( #473 )
2023-05-21 08:07:43 +00:00
Cedric Nugteren
9eca896b05
Fix documentation bug w.r.t. ld values and matrix layout
2023-03-25 20:24:40 +01:00
Cedric Nugteren
f107162e64
Add tuning results for Adreno 540
2022-04-25 20:36:18 +02:00
Cedric Nugteren
c4163b4b1a
Add tuning results for Radeon RX 6500 XT
2022-04-25 20:33:47 +02:00
Cedric Nugteren
7ec8b2f29b
Add tuning results for Radeon RX 6800 XT
2022-04-25 20:31:55 +02:00
Gard Spreemann
df1eebc120
PPA for older Ubuntus
2021-08-24 12:36:35 +02:00
Gard Spreemann
3b1e14acd6
Let the installation documentation reflect the fact that CLBlast is now in Debian and Ubuntu
2021-08-24 11:27:42 +02:00
Cedric Nugteren
2eaabeed10
Added a note on clock frequencies for tuning
2021-08-19 22:38:18 +02:00
Cedric Nugteren
c2951b8a2a
Updated README and tuning list
2021-08-19 20:37:46 +02:00
Cedric Nugteren
0ee39af5ed
Add tuning results for TITAN RTX
2020-10-10 13:01:12 +02:00
Cedric Nugteren
481d86665f
Add tuning results for Radeon RX Vega
2020-10-10 12:56:28 +02:00
Cedric Nugteren
5f97d64505
Update API documentation
2020-03-08 11:29:47 +01:00
Cedric Nugteren
7084311e45
Added tuning parameters for Tesla P100 16GB
2019-02-09 16:31:48 +01:00
Cedric Nugteren
1035e533cd
Added tuning parameters for Xeon E5-2630 v3 and v4
2019-02-09 16:29:30 +01:00
Cedric Nugteren
11f4c7dd93
Added documentation on the convgemm routine
2019-01-19 15:44:19 +01:00
Koichi Akabe
c0883cf2fe
Update the documentation
2018-12-18 14:08:16 +09:00
Cedric Nugteren
4676ec2921
Added a FAQ document
2018-12-01 17:19:28 +01:00
Koichi Akabe
032e3b0cc0
Add kernel_mode option to im2col, col2im, and convgemm functions
2018-11-12 10:12:07 +09:00
Cedric Nugteren
6f67525ea6
Changed col2im to append to the existing im-buffer
2018-11-07 19:45:07 +01:00
Cedric Nugteren
2d32a23293
Added new col2im routine to the documentation
2018-11-01 21:46:19 +01:00
Cedric Nugteren
d45911b61d
Added groundwork for col2im algorithm plus first non-working version of kernel and test
2018-10-23 20:52:25 +02:00
Cedric Nugteren
634b2bc75c
Merge pull request #319 from CNugteren/convgemm_multi_kernel
...
First im2col+GEMM implementation of convolution
2018-10-14 17:27:45 +02:00
Cedric Nugteren
8676b62178
Updated the documentation for GEMV tuning
2018-10-13 17:43:51 +02:00
Cedric Nugteren
83ba3d4b7b
Merge branch 'master' into convgemm_multi_kernel
2018-09-16 20:01:18 +02:00
Cedric Nugteren
91dbd580ab
Added a kernel-parameter pair table to document the tuning API
2018-09-15 18:47:31 +02:00
Cedric Nugteren
c788e040f7
Added xCONVGEMM as im2col plus a batched GEMM kernel
2018-09-07 22:02:44 +02:00
Hendrik Ranocha
faed209f30
Add Julia Wrapper
...
I've written a wrapper of CLBlast in Julia which can be found [here](https://github.com/JuliaGPU/CLBlast.jl ). It is published and available using the Julia package manager.
2018-09-03 15:57:16 +02:00
Cedric Nugteren
2dd539f911
Removed complex numbers support for CONVGEMM
2018-07-29 10:37:14 +02:00
Cedric Nugteren
5903820ba2
Merge branch 'master' into CLBlast-267-convgemm
2018-07-29 10:26:34 +02:00
Cedric Nugteren
db179a1e40
Updated to CLBlast version 1.4.1
2018-07-14 12:29:06 +02:00
Cedric Nugteren
f72620f474
Added tuning results for Intel i5-4970S
2018-07-13 21:25:21 +02:00
Cedric Nugteren
08b1417956
Added tuning results for GeForce GTX 1070 Ti
2018-07-13 21:07:32 +02:00
Cedric Nugteren
c459582c4f
Added tuning results for HD Graphics 6000 Broadwell GT3
2018-07-13 21:05:43 +02:00
Cedric Nugteren
1c9a741470
Merge branch 'master' into CLBlast-267-convgemm
2018-06-03 15:53:27 +02:00
Cedric Nugteren
fee8df153c
Added list of tuners to be run by 'alltuners' target
2018-06-03 10:42:15 +02:00
Cedric Nugteren
cbcd4ff7e8
Merge branch 'master' into CLBlast-267-convgemm
2018-05-19 17:54:27 +02:00
Cedric Nugteren
66583b3cda
The GEMM routine tuner now loads kernel JSON tuning results from disk if available; now run part of alltuners target
2018-05-19 12:48:59 +02:00
Cedric Nugteren
ad57a45039
Added documentation on some details of the GEMM implementation
2018-05-17 12:50:03 +02:00
Cedric Nugteren
a4119531ee
Updated the documentation for convgemm to include data layout (NCHW)
2018-05-09 17:46:27 +02:00
Cedric Nugteren
2d1f6ba7fe
Added convgemm skeleton, test infrastructure, and first reference implementation
2018-05-06 11:35:34 +02:00
Cedric Nugteren
2776d76176
Added interface of batched convolution as GEMM
2018-05-05 14:06:33 +02:00
Cedric Nugteren
16f7f49683
Added tuning results for NVIDIA GeForce 970
2018-04-07 17:48:25 +02:00
Cedric Nugteren
9596e46d01
Added tuning results for NVIDIA GeForce 920MX
2018-04-07 17:44:32 +02:00
Cedric Nugteren
7e69c422af
Updated the roadmap
2018-03-30 10:05:16 +02:00
Cedric Nugteren
934893972e
Merge pull request #262 from CNugteren/CLBlast-237-tuning-api
...
CLBlast #237 : Tuning API
2018-03-11 15:38:33 +01:00
Cedric Nugteren
49b02ec194
Added initial glossary
2018-03-10 17:02:38 +01:00