2015-05-30 12:30:43 +02:00
|
|
|
|
2015-07-24 20:50:00 +02:00
|
|
|
Development version (next release)
|
2015-07-31 11:15:48 +02:00
|
|
|
- Now using the Claduc C++11 interface to OpenCL
|
|
|
|
- Removed clBLAS sources, it should now be installed separately for testing
|
2015-07-24 20:50:00 +02:00
|
|
|
|
2015-07-24 08:25:32 +02:00
|
|
|
Version 0.3.0
|
2015-06-29 20:42:34 +02:00
|
|
|
- Re-organized test/client infrastructure to avoid code duplication
|
2015-07-24 08:16:41 +02:00
|
|
|
- Added an optional bypass for pre/post-processing kernels in level-3 routines
|
|
|
|
- Significantly improved performance of level-3 routines on AMD GPUs
|
2015-06-24 07:52:19 +02:00
|
|
|
- Added level-3 routines:
|
2015-07-12 15:14:35 +02:00
|
|
|
* CHEMM/ZHEMM
|
2015-06-24 07:52:19 +02:00
|
|
|
* SSYRK/DSYRK/CSYRK/ZSYRK
|
2015-07-12 15:14:35 +02:00
|
|
|
* CHERK/ZHERK
|
2015-06-26 08:12:56 +02:00
|
|
|
* SSYR2K/DSYR2K/CSYR2K/ZSYR2K
|
2015-07-12 15:14:35 +02:00
|
|
|
* CHER2K/ZHER2K
|
|
|
|
* STRMM/DTRMM/CTRMM/ZTRMM
|
2015-06-24 07:52:19 +02:00
|
|
|
|
2015-06-21 09:13:08 +02:00
|
|
|
Version 0.2.0
|
2015-06-17 07:12:45 +02:00
|
|
|
- Added support for complex conjugate transpose
|
2015-06-20 16:47:50 +02:00
|
|
|
- Several host-code performance improvements
|
|
|
|
- Improved testing infrastructure and coverage
|
2015-06-15 08:41:37 +02:00
|
|
|
- Added level-2 routines:
|
2015-06-20 16:47:50 +02:00
|
|
|
* SGEMV/DGEMV/CGEMV/ZGEMV
|
2015-06-17 07:12:45 +02:00
|
|
|
- Added level-3 routines:
|
2015-06-20 16:47:50 +02:00
|
|
|
* CGEMM/ZGEMM
|
|
|
|
* CSYMM/ZSYMM
|
2015-06-15 08:41:37 +02:00
|
|
|
|
2015-05-30 12:30:43 +02:00
|
|
|
Version 0.1.0
|
|
|
|
- Initial preview version release to GitHub
|
|
|
|
- Supported level-1 routines:
|
2015-06-20 16:47:50 +02:00
|
|
|
* SAXPY/DAXPY/CAXPY/ZAXPY
|
2015-05-30 12:30:43 +02:00
|
|
|
- Supported level-3 routines:
|
2015-06-20 16:47:50 +02:00
|
|
|
* SGEMM/DGEMM
|
|
|
|
* SSYMM/DSYMM
|