TY - JOUR AU - Izydorczyk, Jacek AB - The goal of this article is to overview the challenges and problems on the way from the state‐of‐the‐art CUDA accelerated neural network code to multi‐GPU code. For this purpose, the authors describe the journey of porting that existing in GitHub, a fully featured CUDA‐accelerated Darknet engine, to OpenCL. This article presents the lessons learned and the techniques that were put in place for this porting. There are few other implementations on GitHub that leverage the OpenCL standard, and a few have tried to port Darknet as well. Darknet is a well‐known convolutional neural network (CNN) framework. The authors of this article investigated all aspects of porting and achieved a fully featured Darknet engine on OpenCL. The effort was focused not only on classification using YOLO1, YOLO2, YOLO3, and YOLO4 CNN models. Other aspects were also covered, such as training neural networks and benchmarks to identify weak points in the implementation. Compared with the standard CPU version, the GPU computing code substantially improves the Darknet computing time by using underutilized hardware in existing systems. If the system is OpenCL‐based, it is practically hardware‐independent. The authors also improved the CUDA version as Darknet‐vNext. TI - Darknet on OpenCL: A multiplatform tool for object detection and classification JF - Concurrency and Computation: Practice and Experience DO - 10.1002/cpe.6936 DA - 2022-07-10 UR - https://www.deepdyve.com/lp/wiley/darknet-on-opencl-a-multiplatform-tool-for-object-detection-and-siyl2890Ix VL - 34 IS - 15 DP - DeepDyve ER -