Development version (next release) - Re-organized test/client infrastructure to avoid code duplication - Bypasses pre/post-processing kernels if possible (in level-3 routines) - Added level-3 routines: * CHEMM/ZHEMM * SSYRK/DSYRK/CSYRK/ZSYRK * CHERK/ZHERK * SSYR2K/DSYR2K/CSYR2K/ZSYR2K * CHER2K/ZHER2K * STRMM/DTRMM/CTRMM/ZTRMM Version 0.2.0 - Added support for complex conjugate transpose - Several host-code performance improvements - Improved testing infrastructure and coverage - Added level-2 routines: * SGEMV/DGEMV/CGEMV/ZGEMV - Added level-3 routines: * CGEMM/ZGEMM * CSYMM/ZSYMM Version 0.1.0 - Initial preview version release to GitHub - Supported level-1 routines: * SAXPY/DAXPY/CAXPY/ZAXPY - Supported level-3 routines: * SGEMM/DGEMM * SSYMM/DSYMM