mirror of
https://github.com/CNugteren/CLBlast.git
synced 2024-07-02 12:26:57 +02:00
Updated the changelog
This commit is contained in:
parent
0022107b2a
commit
9f22bc232b
|
@ -4,6 +4,7 @@ Development (next version)
|
||||||
- Added CLBlast to Ubuntu PPA and macOS Homebrew package managers
|
- Added CLBlast to Ubuntu PPA and macOS Homebrew package managers
|
||||||
- Added an API to run the tuners programmatically without any I/O
|
- Added an API to run the tuners programmatically without any I/O
|
||||||
- Improved the performance potential by adding a second tunable GEMM kernel with 2D register tiling
|
- Improved the performance potential by adding a second tunable GEMM kernel with 2D register tiling
|
||||||
|
- Added support for Intel specific subgroup shuffling extensions for faster GEMM on Intel GPUs
|
||||||
- Re-added a local memory size constraint to the tuners
|
- Re-added a local memory size constraint to the tuners
|
||||||
- Updated and reorganised the CLBlast documentation
|
- Updated and reorganised the CLBlast documentation
|
||||||
- Various minor fixes and enhancements
|
- Various minor fixes and enhancements
|
||||||
|
|
Loading…
Reference in a new issue