Opencl tagged news

Khronos Releases OpenCL 3.0.7 alongside growing OpenCL 3.0 Adoption at IWOCL Conference

Today, at the 9th International Workshop for OpenCL (IWOCL and SYCLcon 2021) Khronos has released the OpenCL 3.0.7 specifications, with a number of new extensions for extended bit-level operations, new queries for suggested local work group size and PCI device information, and enhanced SPIR-V support for debugging and linking. OpenCL 3.0 was released in September 2020 and Imagination, Intel and NVIDIA are already shipping conformant implementations. A Khronos IWOCL slide deck has more details on numerous recent updates and improvements to the OpenCL developer ecosystem.

Find out more about OpenCL this week at IWOCL and SYCLcon 2021 with a Panel Session including OpenCL Working Group members together with Microsoft and Silhouette FX on Wednesday April 28th. On Tuesday April 27, Brice Videau from Argonne National Lab will present a tutorial on OpenCL SDK Layers. Free Registration is now open for this virtual event.

Khronos & VeriSilicon Technical Tutorial and Workshop - April 22-23, 2021

The Khronos Group and VeriSilicon are holding a joint Technical Tutorial and Workshop in Shanghai on April 22 & 23rd. The first day will be a virtual event and will include an overview of the Khronos Group and then deep dive into Vulkan and Vulkan Ray Tracing. On day 2, which will be onsite in Shanghai, the workshop will focus on OpenXR and parallel processing, vision acceleration and inferencing. Be sure to check out the event’s page for more information and register.

NVIDIA is Now OpenCL 3.0 Conformant

Today NVIDIA announced fully conformant OpenCL 3.0 for Windows and Linux on Maxwell and later GPUs. Existing OpenCL 1.x based applications will continue to work with NVIDIA’s OpenCL 3.0 drivers without any changes. In addition to full OpenCL 1.2 compatibility, NVIDIA’s OpenCL 3.0 drivers now deliver significant optional OpenCL 3.0 functionality. Developers can try out the R465 drivers with OpenCL 3.0 today.

9th International Workshop on OpenCL and SYCL

Join us online at the 9th International Workshop on OpenCL, including SYCLcon 2021, for four days of talks April 26-29,2021. There will be workshops and community networking aimed at furthering the collaboration and knowledge sharing amongst the international community of high-performance computing specialist working with OpenCL, SYCL, SPIR and Vulkan Compute. The event provides a rich mix of hands-on tutorials, technical presentations, research papers, posters, panel discussions, networking and vendor discussions. It also provides a formal channel for community feedback to the Khronos Group, the industry body responsible for the standards.

IWOCL & SYCLcon is the premier workshop of leading academic and industrial experts to present, discuss and learn about applying OpenCL and SYCL addressing issues faced in High Performance Computing across a wide range of application domains. This is an excellent opportunity to contribute and participate in this workshop through a paper, talk, special session / tutorial, or poster. This workshop will include invited presentations from academia and industry, and a panel discussion of leading experts in the field.

Deadline for submissions is January 15th, so don’t delay. Submit your proposed content today.

OpenCL Rolls Out Maintenance Release and C++ for OpenCL Documentation

OpenCL Rolls Out Maintenance Release and C++ for OpenCL Documentation

Today Khronos released v3.0.6 of the OpenCL Specifications. This is a regular maintenance release with bug fixes and clarifications, an updated address spaces section, new extensions for additional subgroup functions, and an extension for enhanced platform and device version queries. Also, documentation for the C++ for OpenCL V1.0 kernel language is now downloadable from an OpenCL-Docs GitHub repository tag, describing how the language combines C++17 functionality with familiar OpenCL kernel language paradigms. An extension for online compilation of C++ for OpenCL kernels was published earlier this year and offline compilation of C++ for OpenCL kernels has been supported by clang release 9.0 onwards.

PoCL is a portable open source (MIT-licensed) implementation of the OpenCL standard (1.2 with some 2.0 features supported). In addition to being an easily portable multi-device (truely heterogeneous) open-source OpenCL implementation, a major goal of this project is improving interoperability of diversity of OpenCL-capable devices by integrating them to a single centrally orchestrated platform. Also one of the key goals longer term is to enhance performance portability of OpenCL programs across device types utilizing runtime and compiler techniques.

Version 1.6 release Highlights: Support for Clang/LLVM 11.0, improved CUDA performance and features, improved PowerPC support and enhanced OpenCL debugging usage.

OpenMP 5.1 Released With Better Interoperability For CUDA / AMD HIP / OpenCL

It’s been two years already since the release of the OpenMP 5.0 specification and the update released on Friday is quite a worthy update:

OpenMP 5.1 introduces a new interop construct for improving interoperability with non-OpenMP device execution contexts. This aims to improve the portability of OpenMP 5.1+ to non-native interfaces/accelerators. This interop construct is designed with NVIDIA CUDA, AMD ROCm/HIP, and OpenCL in mind. The interop construct is used for dealing with interoperability properties for one or more “foreign runtime environments”.

NSITEXE, Kyoto Microcomputer and Codeplay Software are bringing open standards programming to RISC-V Vector processor for HPC and AI systems

Codeplay Software Ltd, pioneers in enabling acceleration technologies, announced today that software developers working on HPC and AI for embedded systems will be able to take advantage of industry defined open standards from The Khronos Group on RISC-V architectures, thanks to Japan’s New Energy and Industrial Technology Development Organisation (“NEDO”) project in which NSITEXE and Kyoto Microcomputer Co., Ltd. (“KMC”) are participating.

NSITEXE and KMC have ordered an implementation of LLVM for RISC-V Vector Extension Processor (“RVV”), and also Codeplay’s ComputeAorta™ and ComputeCpp™, efficient and high performance implementations of OpenCL and SYCL open standards. In the NEDO project, as a research, NSITEXE develops OpenCL and SYCL compilers from LLVM to utilize RVV, and KMC implements vector syntax to utilize RVV efficiently based on LLVM and Clang. These research developments will contribute to RISC-V community to support open-standard technologies.

Tutorial: Making the most of Arm NN for GPU inference: OpenCL Tuner

Arm NN is an open-source inference engine for CPUs, GPUs and NPUs. It bridges the gap between existing NN frameworks and the underlying IP. Arm NN is built on top of the Arm Compute Library (ACL). This contains a collection of highly optimized low-level functions to accelerate inference on the Arm Cortex-A family of CPU processors and the Arm Mali family of GPUs. For GPUs, ACL uses OpenCL as its compute API. The OpenCL memory model closely maps to the GPU architecture making it possible to implement optimizations that significantly reduce the accessing of global memory. Read on to learn how.

Intel Releases OpenCL Intercept Layer 3.0

The Intel OpenCL Intercept Layer is one of the company’s efforts around helping to improve debugging and analyzing of OpenCL application performance. This cross-platform layer intercepts the OpenCL API calls through the OpenCL ICD loader to analyze/debug CL applications. With the OpenCL Intercept Layer 3.0 release, it has full support for tracing all OpenCL 3.0 APIs. The update also allows for tracing more vendor-specific CL extensions, proper handling of extension APIs from multiple platforms, emulated support for unified shader memory via shared virtual memory, and a number of other enhancements including bug fixes and performance improvements.

On Github:

We would like to hear from you. If you have feedback, please leave a comment on the OpenCL 3.0 blog post: