2017-10-20 18:21:31 +02:00
CLBlast feature road-map
================
This file gives an overview of the main features planned for addition to CLBlast. A first-order indication time-frame for development time is provided:
2017-12-10 16:08:06 +01:00
| Issue# | When | Who | Status | What |
| ---------------------------------------------------------------|-------------|-----------|--------|---------------|
| - | Oct '17 | CNugteren | ✔ | CUDA API for CLBlast |
| [#169 ](https://github.com/CNugteren/CLBlast/issues/169 ) & #195 | Oct-Nov '17 | CNugteren | ✔ | Auto-tuning the kernel selection parameter |
| [#181 ](https://github.com/CNugteren/CLBlast/issues/181 ) & #201 | Nov '17 | CNugteren | ✔ | Compilation for Android and testing on a device |
| - | Nov '17 | CNugteren | ✔ | Integration of CLTune for easy testing on Android / fewer dependencies |
| [#128 ](https://github.com/CNugteren/CLBlast/issues/128 ) & #205 | Nov-Dec '17 | CNugteren | ✔ | Pre-processor for loop unrolling and array-to-register-promotion for e.g. ARM Mali |
2018-01-03 20:38:48 +01:00
| [#207 ](https://github.com/CNugteren/CLBlast/issues/207 ) | Dec '17 | CNugteren | ✔ | Tuning of the TRSM/TRSV routines |
| [#195 ](https://github.com/CNugteren/CLBlast/issues/195 ) | Jan '18 | CNugteren | ✔ | Extra GEMM API with pre-allocated temporary buffer |
2018-01-29 20:45:21 +01:00
| [#95 ](https://github.com/CNugteren/CLBlast/issues/95 ) & #237 | Jan '18 | CNugteren | ✔ | Implement strided batch GEMM |
2018-02-02 21:18:37 +01:00
| [#224 ](https://github.com/CNugteren/CLBlast/issues/224 ) | Jan-Feb '18 | CNugteren | ✔ | Implement Hadamard product (element-wise vector-vector product) |
2018-02-24 20:46:14 +01:00
| [#233 ](https://github.com/CNugteren/CLBlast/issues/233 ) | Feb '18 | CNugteren | ✔ | Add CLBlast to common package managers |
2018-02-18 18:01:26 +01:00
| [#223 ](https://github.com/CNugteren/CLBlast/issues/223 ) | Feb '18 | CNugteren | ✔ | Python OpenCL interface |
2018-03-10 14:52:40 +01:00
| [#237 ](https://github.com/CNugteren/CLBlast/issues/237 ) | Mar '18 | CNugteren | ✔ | Making tuning possible from the CLBlast API |
2018-04-29 15:06:33 +02:00
| [#228 ](https://github.com/CNugteren/CLBlast/issues/228 ) | Mar-Apr '18 | CNugteren | ✔ | Improving performance for Qualcomm Adreno GPUs |
2018-06-03 13:18:05 +02:00
| [#267 ](https://github.com/CNugteren/CLBlast/issues/267 ) | July '18 | CNugteren | | Merge im2col and GEMM into a direct kernel |
| [#270 ](https://github.com/CNugteren/CLBlast/issues/270 ) | Aug '18 | CNugteren | | Implement col2im |
| - | Aug '18 | CNugteren | | Add a SYCL interface to the library |
2018-04-29 15:06:33 +02:00
| [#136 ](https://github.com/CNugteren/CLBlast/issues/136 ) | ?? | CNugteren | | Implement xAXPBY and xSET |
2017-12-10 16:08:06 +01:00
| [#169 ](https://github.com/CNugteren/CLBlast/issues/169 ) | ?? | dividiti | | Problem-specific tuning parameter selection |