I am writing a school project (implementation of Maximum Intensity Projection on GPU (OpenCL)) and now I am facing an interesting problem.
With CL_DEVICE_TYPE_CPU kernel took nearly 2-3 seconds and returns correct results.
This is good result in comparison to C++ program (~30s). But when the same kernel is executed on GPU, it took 60 seconds (with correct result) and computer seem to be frozen.
I have done some research and I thought that it could be in incorrect memory usage, but in this case, it should be same problem on CPU. But it isn't.
Have anybody encountered similar problem? How can this be solved?
Thanks a lot for replies
CPU : i5 430m
GPU : AMD m5650