CNugteren
|
2b56c2c603
|
Added TRMV/TBMV/TPMV routines
|
2015-09-26 16:58:03 +02:00 |
|
CNugteren
|
04d28b0420
|
Made buffer copying a const-method for the source
|
2015-09-26 16:48:11 +02:00 |
|
CNugteren
|
de6547a92b
|
Added SBMV and SPMV routines
|
2015-09-19 18:01:19 +02:00 |
|
CNugteren
|
80da67d28b
|
Added the HPMV routine
|
2015-09-19 17:40:38 +02:00 |
|
CNugteren
|
c32c4a9739
|
Added infrastructure for packed matrices
|
2015-09-19 17:37:42 +02:00 |
|
CNugteren
|
aebd156869
|
Added the HBMV routine
|
2015-09-19 11:11:34 +02:00 |
|
CNugteren
|
93dddda63e
|
Improved the organization and performance of level 2 routines
|
2015-09-18 17:46:41 +02:00 |
|
CNugteren
|
4507ba4997
|
Added first version of banded matrix-vector multiplication
|
2015-09-18 15:25:20 +02:00 |
|
Cedric Nugteren
|
42db8ea968
|
Merge pull request #26 from CNugteren/routine_definitions
Generated API interface and implementations
|
2015-09-18 10:23:16 +02:00 |
|
CNugteren
|
4796c9bcbd
|
Added generated main functions for correctness/performance tests for level 2 routines
|
2015-09-18 10:19:03 +02:00 |
|
CNugteren
|
6105ad6f5b
|
Added interface of all level 2 routines
|
2015-09-17 17:05:45 +02:00 |
|
CNugteren
|
6307d2e5db
|
Added script to generate API interface and implementation automatically
|
2015-09-17 10:14:33 +02:00 |
|
CNugteren
|
1c24210026
|
Made Travis always build pushes to the master branch
|
2015-09-14 17:16:31 +02:00 |
|
Cedric Nugteren
|
a2b773573d
|
Merge pull request #25 from CNugteren/level1_routines
Added several level 1 routines
|
2015-09-14 17:12:23 +02:00 |
|
CNugteren
|
224c967584
|
Removed routines from the table which are not supported by clBLAS
|
2015-09-14 17:02:33 +02:00 |
|
CNugteren
|
a2e726d3bd
|
Added xDOT/xDOTU/xDOTC dot-product routines
|
2015-09-14 16:57:00 +02:00 |
|
CNugteren
|
2a383f3450
|
Added extra temporary buffer to tuners in preparation of Xdot routines
|
2015-09-14 15:53:34 +02:00 |
|
CNugteren
|
e0c5312abb
|
Added support for the dot buffer and offset argument
|
2015-09-14 12:28:50 +02:00 |
|
CNugteren
|
b0b81deae1
|
Minor update of options-printing syntax
|
2015-08-24 07:38:20 +02:00 |
|
CNugteren
|
ff0c54c386
|
Added the XSWAP, XSCAL and XCOPY level-1 routines
|
2015-08-22 17:11:20 +02:00 |
|
CNugteren
|
75517353d5
|
Re-organized level1 xaxpy kernel
|
2015-08-22 14:33:48 +02:00 |
|
CNugteren
|
70ba7c83d4
|
Prepared the changelog for the next release
|
2015-08-22 12:50:26 +02:00 |
|
CNugteren
|
74f601794d
|
Updated to version 0.4.0
|
2015-08-22 12:41:40 +02:00 |
|
CNugteren
|
ff1a670e88
|
Updated the documentation
|
2015-08-22 12:40:18 +02:00 |
|
CNugteren
|
5f5d31754a
|
Added clblast prefix to binaries and added the alltests target
|
2015-08-21 07:36:19 +02:00 |
|
Cedric Nugteren
|
cf168fca70
|
Merge pull request #23 from CNugteren/tuner_database
Added initial version of a tuner-database
|
2015-08-20 08:38:18 +02:00 |
|
CNugteren
|
15db2bcc20
|
Added initial version of tuner-database Python script
|
2015-08-20 08:30:51 +02:00 |
|
CNugteren
|
07e393cce4
|
Added target to run all tuners
|
2015-08-19 19:35:56 +02:00 |
|
CNugteren
|
798a3b6101
|
Add check for supported precision to the tuners
|
2015-08-19 19:35:08 +02:00 |
|
CNugteren
|
b46de22433
|
Moved precision tester to utilities
|
2015-08-19 19:34:29 +02:00 |
|
CNugteren
|
8a02db0746
|
Added precision to the JSON output
|
2015-08-19 11:12:42 +02:00 |
|
CNugteren
|
cbd25bffea
|
Added hotfix 8eeb7f721f
|
2015-08-19 11:12:16 +02:00 |
|
Cedric Nugteren
|
85bd783e0d
|
Merge pull request #22 from CNugteren/travis
Added Travis continuous integration
|
2015-08-19 09:34:01 +02:00 |
|
CNugteren
|
e806bc1ff0
|
Added Travis build-status to the README
|
2015-08-19 09:29:54 +02:00 |
|
CNugteren
|
e239c5f852
|
Now using apt-get directly in Travis
|
2015-08-19 09:20:10 +02:00 |
|
CNugteren
|
4f79d13d1d
|
Updated fglrx package in Travis
|
2015-08-19 09:10:45 +02:00 |
|
CNugteren
|
8d8dcda5bf
|
Added OpenCL and Clang to travis
|
2015-08-19 09:06:59 +02:00 |
|
CNugteren
|
154b611546
|
Added GCC 4.8 and updated CMake
|
2015-08-18 08:25:32 +02:00 |
|
CNugteren
|
ad4aade5d5
|
Added initial .travis.yml file
|
2015-08-18 07:22:27 +02:00 |
|
Cedric Nugteren
|
4f6e42d052
|
Merge pull request #21 from CNugteren/c_api
Added a plain C API
|
2015-08-13 18:02:03 +02:00 |
|
CNugteren
|
4242f90215
|
Added the plain C API
|
2015-08-13 18:00:09 +02:00 |
|
CNugteren
|
603e389545
|
Added all supported routines to the C API
|
2015-08-13 17:58:46 +02:00 |
|
CNugteren
|
8eeb7f721f
|
Fixed a complex data-type bug in the transpose kernel
|
2015-08-13 14:33:42 +02:00 |
|
CNugteren
|
a6c104ef20
|
Added SGEMM example using the C API
|
2015-08-13 13:47:15 +02:00 |
|
CNugteren
|
8617195ac5
|
Added initial version of C API with just one routine
|
2015-08-13 13:46:13 +02:00 |
|
CNugteren
|
f85d44f602
|
Added argument m,n,k metadata to JSON files
|
2015-08-13 08:33:04 +02:00 |
|
CNugteren
|
dbdb58c600
|
Refactored the tuners, added JSON output
|
2015-08-09 15:50:41 +02:00 |
|
Cedric Nugteren
|
e4aa4519c2
|
Merge pull request #19 from CNugteren/basic_level2_routines
Level-2 routines: HEMV and SYMV
|
2015-08-04 08:19:42 +02:00 |
|
CNugteren
|
75b4d92ac3
|
Added distinguished names for GEMV inherited HEMV/SYMV
|
2015-08-04 08:15:39 +02:00 |
|
CNugteren
|
d1a7cf18ec
|
Abstracted loading of matrix A for GEMV kernel
|
2015-08-03 07:37:14 +02:00 |
|