Commit graph

1015 commits

Author SHA1 Message Date
Cedric Nugteren 6e95752054 Merge pull request #184 from CNugteren/im_to_col
im2col
2017-08-30 19:17:17 +02:00
Cedric Nugteren 161fd8514d Merge branch 'master' into im_to_col 2017-08-24 21:15:14 +02:00
Cedric Nugteren 4d9d03ba51 Completed im2col implementation 2017-08-24 21:11:12 +02:00
Cedric Nugteren a8c26594d9 Made the im2col client properly handle the arguments 2017-08-23 19:54:09 +02:00
Cedric Nugteren da28cc5e93 Minor updates after merging in the PSO addition to the tuners 2017-08-21 20:14:02 +02:00
Cedric Nugteren e5eb6b1d3a Merge pull request #173 from mcian/PSO_params
Add PSO parameters support and search strategy selection from command…
2017-08-21 20:06:29 +02:00
mcian dfd332524a Remove multistrategy and related functions 2017-08-21 14:09:11 +02:00
Cedric Nugteren 803ca781f9 First version of im2col kernel, unoptimized but working 2017-08-19 18:25:13 +02:00
Cedric Nugteren 132e62892d Implemented proper im2col reference function and completd tests 2017-08-19 16:55:09 +02:00
Cedric Nugteren 777681dcbd Merge branch 'master' into im_to_col 2017-08-12 20:50:00 +02:00
Cedric Nugteren d67fd6604b Merge pull request #182 from CNugteren/compilation_improvements
Compilation improvements
2017-08-12 17:17:10 +02:00
Cedric Nugteren d30c459c5f Fixed .hpp -> .h typo in CMakeLists 2017-08-12 16:11:23 +02:00
Cedric Nugteren f6b6d7ef4b Properly set the common test utilities in the CMake files 2017-08-12 16:07:28 +02:00
Cedric Nugteren 0a63621579 Moved functions from the header to the .cpp file to prevent compiling the same code multiple times 2017-08-12 15:59:14 +02:00
Cedric Nugteren 844e68853e Moved some utility functions to a test-specific utility compilation-unit 2017-08-12 15:38:17 +02:00
Cedric Nugteren d588f28dbe Updated CMakeLists to include header files such that IDEs can locate them 2017-08-11 21:20:40 +02:00
mcian 4adee60884 Revert the xgemm strategy to default. If user wants to use multistrategy can simple call the function TestHeuristic from the main 2017-08-09 16:58:46 +02:00
mcian 0b4aa109f8 Use cltune::SearchMethod enum instead of int values 2017-08-09 16:05:25 +02:00
Cedric Nugteren eb896838b1 Updated to version 1.0.1 (bugfix release) 2017-08-08 20:35:49 +02:00
Cedric Nugteren 27ab3deee7 Merge pull request #180 from mcian/restore_direct_GEMM
Restore direct gemm
2017-07-31 21:46:47 +02:00
mcian 99afdcd908 Restore direct GEMM to previous version 2017-07-31 14:06:23 +02:00
mcian dc49906542 Merge pull request #1 from CNugteren/master
fork up to date
2017-07-31 14:02:12 +02:00
Cedric Nugteren 1155c068e9 Updated to version 1.0.0 2017-07-30 20:54:21 +02:00
Cedric Nugteren ae068771da Fixes for Travis automatic deployment 2017-07-30 18:53:26 +02:00
Cedric Nugteren b494df1111 Fixes warnings for Clang & AppleClang 2017-07-30 18:52:20 +02:00
Cedric Nugteren 6ceb9b7152 Fixes to AppVeyor and Travis scripts 2017-07-30 18:34:39 +02:00
Cedric Nugteren e6f938e0e9 Improved deployment procedure of automatic builds 2017-07-30 18:19:46 +02:00
Cedric Nugteren 18d832e149 Added tuning results for the Qualcomm Adreno 330 GPU 2017-07-30 18:18:02 +02:00
Cedric Nugteren 0ea16a0e63 Minor optimization for the direct GEMM kernel: don't ceil m and n unnecessarily high 2017-07-25 20:53:12 +02:00
Cedric Nugteren b7473f50df Added status badges for correctness tests; updated list of contributors; fixed minor typos 2017-07-24 20:14:47 +02:00
Cedric Nugteren 55861c40ff Merge branch 'relax_gemmbatched_ld_requirements' 2017-07-23 21:04:17 +02:00
mcian 473e814718 Code refactoring 2017-07-23 14:48:13 +02:00
Cedric Nugteren 2d52f9b1d3 Merge pull request #176 from CNugteren/inline_keyword_optional
Made the inline keyword in kernels optional
2017-07-22 10:44:08 +02:00
Cedric Nugteren 1b4959a16a Merge pull request #175 from mcian/Arm_Threshold
Add new threshold for ARM
2017-07-19 19:27:32 +02:00
mcian a36283aaec Add new threshold for ARM 2017-07-17 12:20:46 +02:00
mcian 8131e68664 Add PSO parameters support and search strategy selection from command line 2017-07-17 12:00:25 +02:00
Cedric Nugteren 97bcf77d4b First step towards supporting im2col in the test infrastructure 2017-07-16 22:33:49 +02:00
Cedric Nugteren de9ed9d4ea Fixed batched tests when testing for invalid sizes against clBLAS 2017-07-12 21:54:16 +02:00
Cedric Nugteren f77b48692b Relaxed requirement on a_ld and b_ld for batched GEMM 2017-07-12 21:53:39 +02:00
Cedric Nugteren f2477f6636 Removed spurious warning for Clang < 3.9 2017-07-12 20:58:31 +02:00
Cedric Nugteren c71362b13d Merge pull request #172 from CNugteren/msvc_improvements
Windows & MSVC improvements
2017-07-09 21:14:52 +02:00
Cedric Nugteren d4c8a7c8b0 Changed printf-statements with %zu into std::cout to fix MSVC 2013 compatibility 2017-07-09 20:19:08 +02:00
Cedric Nugteren 4b415bdf3c Disabled UNIX-style terminal color printing under Windows 2017-07-09 20:04:13 +02:00
Cedric Nugteren 442c31dd50 Made the inline keyword in kernels optional currently only enabled for NVIDIA and ARM GPUs 2017-07-08 17:12:16 +02:00
Cedric Nugteren 84ec50e29d Added interface and stubs for the im2col routine 2017-07-02 12:10:22 +02:00
Cedric Nugteren 75c0e861b8 Merge branch 'gemm_direct_bug' 2017-07-01 14:44:29 +02:00
Cedric Nugteren 4cf516cfec Fixed an if-statement in the direct GEMM kernel causing a bug with specific sets of input parameters 2017-06-30 21:57:41 +02:00
Cedric Nugteren 52881f3864 Added batched GEMM example program 2017-06-29 21:15:25 +02:00
Cedric Nugteren 4e51b1e1f8 Moved and inlined some static member variables and disabled spurious clang warnings 2017-06-27 21:05:16 +02:00
Cedric Nugteren e60b10529a Undo of earlier move of TestBlas::kTransposes constant to fix MSVC 2013 compilation 2017-06-27 20:59:28 +02:00