CLBlast/CHANGELOG

55 lines
1.5 KiB
Plaintext
Raw Normal View History

2015-05-30 12:30:43 +02:00
Development version (next release)
- Improved structure and performance of level-2 routines (xSYMV/xHEMV)
- Added level-1 routines:
* SSWAP/DSWAP/CSWAP/ZSWAP
* SSCAL/DSCAL/CSCAL/ZSCAL
* SCOPY/DCOPY/CCOPY/ZCOPY
* SDOT/DDOT
* CDOTU/ZDOTU
* CDOTC/ZDOTC
- Added level-2 routines:
* SGBMV/DGBMV/CGBMV/ZGBMV
2015-09-19 11:11:34 +02:00
* CHBMV/ZHBMV
2015-09-19 17:40:38 +02:00
* CHPMV/ZHPMV
2015-08-22 12:41:40 +02:00
Version 0.4.0
- Now using the Claduc C++11 interface to OpenCL
2015-08-13 18:00:09 +02:00
- Added plain C API for increased compatibility (clblast_c.h)
2015-08-22 12:40:18 +02:00
- Re-organized tuner infrastructure and added JSON output
- Removed clBLAS sources, it should now be installed separately for testing
- Added Travis continuous integration
2015-07-31 17:44:17 +02:00
- Added level-2 routines:
* CHEMV/ZHEMV
* SSYMV/DSYMV
2015-07-24 08:25:32 +02:00
Version 0.3.0
- Re-organized test/client infrastructure to avoid code duplication
- Added an optional bypass for pre/post-processing kernels in level-3 routines
- Significantly improved performance of level-3 routines on AMD GPUs
2015-06-24 07:52:19 +02:00
- Added level-3 routines:
2015-07-12 15:14:35 +02:00
* CHEMM/ZHEMM
2015-06-24 07:52:19 +02:00
* SSYRK/DSYRK/CSYRK/ZSYRK
2015-07-12 15:14:35 +02:00
* CHERK/ZHERK
* SSYR2K/DSYR2K/CSYR2K/ZSYR2K
2015-07-12 15:14:35 +02:00
* CHER2K/ZHER2K
* STRMM/DTRMM/CTRMM/ZTRMM
2015-06-24 07:52:19 +02:00
2015-06-21 09:13:08 +02:00
Version 0.2.0
- Added support for complex conjugate transpose
- Several host-code performance improvements
- Improved testing infrastructure and coverage
2015-06-15 08:41:37 +02:00
- Added level-2 routines:
* SGEMV/DGEMV/CGEMV/ZGEMV
- Added level-3 routines:
* CGEMM/ZGEMM
* CSYMM/ZSYMM
2015-06-15 08:41:37 +02:00
2015-05-30 12:30:43 +02:00
Version 0.1.0
- Initial preview version release to GitHub
- Supported level-1 routines:
* SAXPY/DAXPY/CAXPY/ZAXPY
2015-05-30 12:30:43 +02:00
- Supported level-3 routines:
* SGEMM/DGEMM
* SSYMM/DSYMM