Search:

Type: Posts; User: steveStevens

Search: Search took 0.00 seconds.

  1. Releasing Memory, Kernels, Devices etc

    I thought I read somewhere, (though, for the life of me, I can't find the source), that, using the C++ API you don't have to release devices/kernels/memory like w/ the C API as the destructors for...
  2. Replies
    2
    Views
    1,010

    I guess some pictures would help. Here is exactly...

    I guess some pictures would help. Here is exactly what is happening in both programs.

    1) generate gaussian vector
    2) zero pad gaussian vector to next next highest power of 2 length
    3) forward...
  3. Replies
    2
    Views
    1,010

    As a follow up, I'm also wondering if maybe the...

    As a follow up, I'm also wondering if maybe the problem is the number of intra-kernel operations I've got going on is too many for just using global memory and registers, (is that even a thing? a...
  4. Replies
    2
    Views
    1,010

    Enqueue/Finish Scheme for FFT

    This is likely the first part of 2 posts related to some trouble I have involving an FFT signal cross correlation module I'm creating, (makes use of circular convolution theorem, etc etc). I'd like...
  5. Re: enequeueNDRangeKernel - parallel execution on OpenCL dev

    I can't seem to find this in the spec, but does cl::CommandQueue::finish() also perform the functionality of flush()?

    I'm guessing I can do away with the event vector altogether and just have 3,...
  6. Re: enequeueNDRangeKernel - parallel execution on OpenCL dev

    I just did some more investigating, and came up with the following:

    std::vector<cl::CommandQueue> deviceQueues;
    std::vector<cl::Event> eventVector;

    // Global Range:
    cl::NDRange...
  7. Re: enequeueNDRangeKernel - parallel execution on OpenCL dev

    So I've been trying to allocate this stuff dynamically like so:

    std::vector<cl::CommandQueue> deviceQueues;

    cl::NDRange globalRange(d2 /nDevices);

    cl::NDRange localRange(LOOP_UNROLL);
    ...
  8. Re: enequeueNDRangeKernel - parallel execution on OpenCL dev

    Gorgeous response, thank you.
  9. enequeueNDRangeKernel - parallel execution on OpenCL device?

    say I have n openCL devices, and that the data, of size d2 has been partitioned into sections such that it complements compute topology, memory buffers have been allocated, etc.

    Given something...
Results 1 to 9 of 11