OpenCL Tutorial - Shared Memory Kernel Optimization

MacResearch has posted part 6 in their series of OpenCL tutorials. In this episode, a real-world code that has been parallelized by porting to the GPU. The use of shared memory to improve performance is covered as well as a discussion of synchronization points for coordinated work within a work-group. Source code is provided.