Search:

Type: Posts; User: steveStevens

Search: Search took 0.00 seconds.

  1. Tesla C2050 - OpenCL - Kernel Concurrency Issue

    Hi All,

    This problem has some complex background so I will attempt to abstract as much as possible. I'm posting here as well as on the OpenCL forums because my problems are occurring with use of...
  2. Releasing Memory, Kernels, Devices etc

    I thought I read somewhere, (though, for the life of me, I can't find the source), that, using the C++ API you don't have to release devices/kernels/memory like w/ the C API as the destructors for...
  3. Replies
    2
    Views
    1,173

    I guess some pictures would help. Here is exactly...

    I guess some pictures would help. Here is exactly what is happening in both programs.

    1) generate gaussian vector
    2) zero pad gaussian vector to next next highest power of 2 length
    3) forward...
  4. Replies
    2
    Views
    1,173

    As a follow up, I'm also wondering if maybe the...

    As a follow up, I'm also wondering if maybe the problem is the number of intra-kernel operations I've got going on is too many for just using global memory and registers, (is that even a thing? a...
  5. Replies
    2
    Views
    1,173

    Enqueue/Finish Scheme for FFT

    This is likely the first part of 2 posts related to some trouble I have involving an FFT signal cross correlation module I'm creating, (makes use of circular convolution theorem, etc etc). I'd like...
  6. Re: enequeueNDRangeKernel - parallel execution on OpenCL dev

    I can't seem to find this in the spec, but does cl::CommandQueue::finish() also perform the functionality of flush()?

    I'm guessing I can do away with the event vector altogether and just have 3,...
  7. Re: enequeueNDRangeKernel - parallel execution on OpenCL dev

    I just did some more investigating, and came up with the following:

    std::vector<cl::CommandQueue> deviceQueues;
    std::vector<cl::Event> eventVector;

    // Global Range:
    cl::NDRange...
  8. Re: enequeueNDRangeKernel - parallel execution on OpenCL dev

    So I've been trying to allocate this stuff dynamically like so:

    std::vector<cl::CommandQueue> deviceQueues;

    cl::NDRange globalRange(d2 /nDevices);

    cl::NDRange localRange(LOOP_UNROLL);
    ...
  9. Re: enequeueNDRangeKernel - parallel execution on OpenCL dev

    Gorgeous response, thank you.
  10. enequeueNDRangeKernel - parallel execution on OpenCL device?

    say I have n openCL devices, and that the data, of size d2 has been partitioned into sections such that it complements compute topology, memory buffers have been allocated, etc.

    Given something...
Results 1 to 10 of 12