How can I do this? Currently when my global work set gets above a certain size (10000 bodies in my Barnes-Hut nbody sim) my entire PC freezes up. Only the mouse moves, for the duration of the entire iteration (i.e. appears as if all 10000 work items are processed, with no other GPU activity allowed, apparently). Is there any way to allow other GPU operations to be interleaved with my work? Do I have to break up my NDExecuteKernel into a few separate calls? I am leaving work group organisation to OpenCL as it is too much of a pain to do it myself, not sure if maybe this has something to do with it?
OS = Win 7
GPU = AMD