I am confuse with a problem, I want to know if we can do the following:
I have an algorithm and it should run in parallel:
if we have and "array a" , and for each element in this array it will execute the kernel code, - its ok for now- , but if I have 3 different data for "array a", how we can run 3 set of a in parallel, so here we have two level of parallel:
1.element of a are executing code in parallel
2. 3 different a (run a on three different data set)
I hope its clear.