CLBlast/src
2016-06-08 10:13:37 +02:00
..
kernels Added global memory synchronisation for better cache performance on ARM Mali GPUs 2016-06-08 10:13:37 +02:00
routines Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM 2016-05-25 13:29:53 +02:00
tuning Prepared the GER kernels and tuner for half-precision support 2016-05-22 16:18:08 +02:00
cache.cc Added a program cache (per-context) next to the per-device binary cache 2016-05-01 12:56:08 +02:00
clblast.cc Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM 2016-05-25 13:29:53 +02:00
clblast_c.cc Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM 2016-05-25 13:29:53 +02:00
database.cc Added level-3 half-precision routines HGEMM/HSYMM/HSYRK/HSYR2K/HTRMM 2016-05-25 13:29:53 +02:00
routine.cc Added global memory synchronisation for better cache performance on ARM Mali GPUs 2016-06-08 10:13:37 +02:00
utilities.cc Added half-precision tests for the clBLAS reference through conversion to single-precision 2016-05-26 23:36:19 +02:00