Cedric Nugteren
280698d076
Merge pull request #117 from intelfx/exceptions
...
Convert to use C++ exceptions internally
2016-10-22 15:05:12 +02:00
Ivan Shapovalov
b98af44fcf
treewide: use C++ exceptions properly
...
Since the codebase is designed around proper C++ idioms such as RAII, it
makes sense to only use C++ exceptions internally instead of mixing
exceptions and error codes. The exceptions are now caught at top level
to preserve compatibility with the existing error code-based API.
Note that we deliberately do not catch C++ runtime errors (such as
`std::bad_alloc`) nor logic errors (aka failed assertions) because no
actual handling can ever happen for such errors.
However, in the C interface we do catch _all_ exceptions (...) and
convert them into a wild-card error code.
2016-10-22 08:45:25 +03:00
Cedric Nugteren
597974b40d
Merge pull request #118 from matze/add-pkg-config
...
Generate and install pkg-config description
2016-10-21 21:00:07 +02:00
Matthias Vogelgesang
3797d144cc
Generate and install pkg-config description
2016-10-21 09:38:25 +02:00
Cedric Nugteren
c8d0e41e84
Added the possibility to supply the env-variable CLBLAST_TEST_ARGUMENTS to specify options for the make alltest or ctest targets
2016-10-20 23:05:16 +02:00
Cedric Nugteren
53deed298f
Added documentation and minor refactoring for the recent support of static library compilation
2016-10-15 17:11:08 +02:00
Shehzan Mohammed
0d958bf3b3
Fixes for static lib compilation on Windows
2016-10-14 18:45:34 -04:00
Cedric Nugteren
c0482ace6c
Fixed a bug where clblas.h couldn't be found for the performance tests (clients)
2016-10-14 22:11:35 +02:00
Cedric Nugteren
3386ad49c4
Set proper flags for the verbose mode (debug flags)
2016-10-14 20:54:05 +02:00
Cedric Nugteren
99a620f9a1
Merge pull request #112 from shehzan10/static
...
Add option to build shared or static library
2016-10-14 10:06:44 +02:00
Shehzan Mohammed
56f07e42b1
Add option to build shared or static library
2016-10-13 12:03:44 -04:00
Cedric Nugteren
a9d35cf04c
Merge branch 'development' into gemm_direct
2016-10-01 13:45:08 +02:00
Anton Lokhmotov
c484bb26b6
Use cross-platform thread lib idiom instead of *nix-specific pthread.
2016-09-26 21:04:28 +00:00
Anton Lokhmotov
c20a5bb7ca
Link clBLAS together with pthread.
2016-09-26 10:30:18 +00:00
Cedric Nugteren
73d135c2ce
Added a first version of a tuner for the GEMM direct kernel; collapsed MWGD, NWGD and KWGD into one WGD parameter
2016-09-25 14:48:34 +02:00
Anton Lokhmotov
750f185ba9
Add path to ref library header when building tests.
2016-09-24 11:46:34 +00:00
Cedric Nugteren
4b94afda94
Updated to version 0.9.0
2016-09-13 19:20:39 +02:00
Cedric Nugteren
48ab0428cb
Renamed the DEFAULT_DEVICE and DEFAULT_PLATFORM env variables to be in line with recent usages of CLBLAST_DEVICE and CLBLAST_PLATFORM
2016-09-13 19:08:49 +02:00
Ivan Shapovalov
9095537a6a
CMakeLists.txt: use -Wno-ignored-attributes to silence unfixable warnings
2016-09-13 16:12:30 +03:00
Cedric Nugteren
35623cd98d
Minor update regarding the previous CMake export/install target changes
2016-07-28 20:45:09 +02:00
Ivan Shapovalov
b5d7b58393
CMakeLists.txt: use target_include_directories()
2016-07-28 19:09:29 +03:00
Ivan Shapovalov
570cbcffa7
CMakeLists.txt: provide a find_package() config for dependent projects
2016-07-28 19:09:29 +03:00
Ivan Shapovalov
a1d80e7402
CMakeLists.txt: use ${clblast_SOURCE_DIR} instead of ${CMAKE_SOURCE_DIR}
2016-07-22 11:15:52 +03:00
Cedric Nugteren
27854070b4
Added a VERBOSE mode to debug performance: now prints details about compilation and kernel execution to screen
2016-07-06 21:50:12 +02:00
CNugteren
2d665099ef
Fixed a linking issue with the tuners on Visual Studio
2016-07-04 19:46:14 +02:00
Cedric Nugteren
b330ab0866
Added declspec(dllexport) to ClearCache and FillCache, and added declspec(dllimport) when not building the library
2016-06-30 10:49:17 +02:00
Cedric Nugteren
577f0ee117
Updated to version 0.8.0
2016-06-28 21:32:00 +02:00
CNugteren
871b576c06
Made it possible to build the clients and tests on Windows using Visual Studio
2016-06-28 16:38:45 +02:00
Cedric Nugteren
ca386f9883
Added fp16 to the alltuners target
2016-06-27 11:46:33 +02:00
Cedric Nugteren
61203453aa
Renamed all C++ source files to .cpp to match the .hpp extension better
2016-06-19 13:55:49 +02:00
Cedric Nugteren
f726fbdc9f
Moved all headers into the source tree, changed headers to .hpp extension
2016-06-18 20:20:13 +02:00
Cedric Nugteren
bacb5d2bb2
Clean-up of the routine class, moved RunKernel to the routine/common file
2016-06-18 18:16:14 +02:00
Cedric Nugteren
52ccaf5b25
Added XOMATCOPY routines to perform out-of-place matrix scaling, copying, and/or transposing
2016-06-16 18:07:46 +02:00
Cedric Nugteren
b894611ad1
Re-organised the level-3 supporting kernels (copy, pad, transpose, convert) and renamed files and functions appropriately
2016-06-14 18:17:58 +02:00
Cedric Nugteren
6d6b030053
Made the CPU BLAS library the default reference to test against in favor of clBLAS
2016-06-08 09:21:39 +02:00
Cedric Nugteren
7a7873d552
Fixed the RPATH settings for linking on OS X
2016-06-06 13:40:52 +02:00
Cedric Nugteren
983df6a8b4
Made use of CMake's built-in unit testing, allowing all tests to be run using 'make test'
2016-05-31 20:53:55 +02:00
Cedric Nugteren
305bf16c4c
Separated the performance tests (clients) from the correctness tests in CMake
2016-05-30 16:38:26 +02:00
Cedric Nugteren
489c5d76cf
Merged in latest changes from 0.7.1 release
2016-05-18 21:32:56 +02:00
Cedric Nugteren
591e343ec9
Added an example of using the half-precision HAXPY routine
2016-05-15 20:18:34 +02:00
Cedric Nugteren
4b6bdd83a2
Added header with conversions from and to half-precision floating-point
2016-05-15 20:13:57 +02:00
Cedric Nugteren
c5730c8b43
Updated to version 0.7.0
2016-05-08 20:29:41 +02:00
Cedric Nugteren
2952390f27
Added an example to demonstrate the use of the ClearCache and FillCache functions
2016-04-29 23:33:36 +02:00
Cedric Nugteren
4f528b1730
Added sample C programs for the SASUM and DGEMV routines
2016-04-29 20:33:19 +02:00
Cedric Nugteren
82be8f211c
Moved all cache-related functions to a separate file; added a ClearCompiledProgramCache function to clear the cache
2016-04-27 16:02:13 +02:00
cnugteren
16a048f1ac
Added support for the iSAMAX/iDAMAX/iCAMAX/iZAMAX routines
2016-04-20 22:12:51 -06:00
cnugteren
8be99de82d
Added support for the SASUM/DASUM/ScASUM/DzASUM routines
2016-04-14 19:58:26 -06:00
cnugteren
c2cfee76c4
Properly set warning flags for Clang
2016-04-04 08:39:13 -07:00
cnugteren
1a82861a90
Added support for testing (performance and correctness) against a CPU BLAS library
2016-04-02 11:58:00 -07:00
cnugteren
a2056f2216
Create a first version of CPU BLAS detection in CMake
2016-03-31 22:22:29 -07:00