Search:

Type: Posts; User: Dithermaster

Page 1 of 9 1 2 3 4

Search: Search took 0.01 seconds.

  1. Replies
    5
    Views
    158

    Of course. For example, and system header that...

    Of course. For example, and system header that includes file system access, system clock access, stdio access, etc. None of these can be accessed from the device.
  2. The benefit of shared local memory is if many...

    The benefit of shared local memory is if many work items in a work group need to access the same memory at different times (for example, a matrix multiply). If each of your work items accesses...
  3. Replies
    5
    Views
    158

    Something you are #include'ing is trying to...

    Something you are #include'ing is trying to #include stdarg.h, which is likely not compatible with OpenCL C99. Check your includes (and their includes) to find the culprit.
  4. Replies
    1
    Views
    169

    Intel has some tools which can measure CPU and...

    Intel has some tools which can measure CPU and GPU power usage. I'm not aware of anything off the top of my head for AMD or NVIDIA.
  5. Replies
    5
    Views
    237

    Is there even a use case for pipes on CPU or GPU...

    Is there even a use case for pipes on CPU or GPU devices (that is more efficient or less code than just using global memory or images between kernels), or do they exist just for FPGA devices?
  6. To clarify, never install OpenCL.dll in the...

    To clarify, never install OpenCL.dll in the system directory. If you install it in your application folder or bundle, that's less of a system-wrecking technical issue. Legally you might not have the...
  7. Do (2). It works great. Never do (4) or (5),...

    Do (2). It works great.

    Never do (4) or (5), only drivers should install the ICD and you could hurt other applications if you do it wrong.
  8. Replies
    5
    Views
    257

    I've heard that pipes benefit OpenCL on FPGA...

    I've heard that pipes benefit OpenCL on FPGA since they fit better into the pipelined hardware nature of those devices. They also seem to have some benefit in dynamic parallelism for...
  9. Replies
    2
    Views
    249

    My understanding is that OpenCL 1.2 is available...

    My understanding is that OpenCL 1.2 is available for Tegra but you have to contact NVIDIA to get it.
  10. I've seen 1024,1,1 only for the Apple CPU device,...

    I've seen 1024,1,1 only for the Apple CPU device, so I agree with your guess that it was that device. Switch to the GPU device for better dimensions.
  11. Replies
    1
    Views
    382

    Because the runtime may choose to run some...

    Because the runtime may choose to run some workgroups to completion before starting others (when the number of workgroups far exceeds the hardware capabilties) there are therefore no global...
  12. On Windows, OpenCL.dll _is_ the ICD, but you...

    On Windows, OpenCL.dll _is_ the ICD, but you still don't want to ship it. It varies by version, for one thing (what if you ship a version 1.2 one, but the vendor driver updated the system ICD...
  13. You do NOT want to ship this DLL with your...

    You do NOT want to ship this DLL with your project. The one installed on the system is the one you want to use. What problem are you trying to solve?
  14. Replies
    2
    Views
    592

    clFlush can certainly block the CPU; it won't...

    clFlush can certainly block the CPU; it won't return until the command queue has completely been flushed to the hardware, and if the hardware queue is full, the CPU will block.

    Except for CL/GL...
  15. I answered on SO (before I saw this).

    I answered on SO (before I saw this).
  16. That kernel looks like it was code-generated, not...

    That kernel looks like it was code-generated, not hand coded. In any case, one source of slowdown is that each work item reads 16 doubles from global memory. While they can be broadcast within each...
  17. The API is designed to be async -- all of the...

    The API is designed to be async -- all of the clEnqueue calls are designed to return quickly. The OpenCL driver uses a separate thread to push work to the GPU. So once you've queued up work to the...
  18. If you have an OpenCL driver for CPU installed...

    If you have an OpenCL driver for CPU installed then CL_DEVICE_TYPE_CPU devices appear, so yes, it is a useful flag to have.

    You might, for example, try for a GPU device, and only if one is not...
  19. Replies
    1
    Views
    717

    clBuildProgram is _required_ regardless of...

    clBuildProgram is _required_ regardless of whether you created the program using clCreateProgramWithSource or clCreateProjectWithBinary. It will be faster with binary sources.
  20. Replies
    3
    Views
    1,948

    OpenCL C is based on C99, so if it is ill-defined...

    OpenCL C is based on C99, so if it is ill-defined in C99, it's ill-defined in OpenCL C.
  21. Replies
    3
    Views
    1,948

    No such limitation. You can do multiple reads and...

    No such limitation. You can do multiple reads and writes to global memory from within a kernel. You should go back and ask your past self what they meant in the comment.
  22. My cursory understanding is that it's up to the...

    My cursory understanding is that it's up to the vendor's driver and how it's implemented. From what I'm reading above, AMD's driver support it. I think NVIDIA Tesla cards run in the non-graphics mode...
  23. > 256 is the work group size and 700 is the...

    > 256 is the work group size and 700 is the global size so it is evenly divisible.
    Um, no it's not. 256 goes into 768 but not 700.
    The common solution is to "round up" the global size to be an...
  24. You have old knowledge. Intel and AMD are both...

    You have old knowledge. Intel and AMD are both shipping OpenCL 2.0 drivers.

    Intel: https://software.intel.com/en-us/articles/opencl-drivers (2014 r2 is OpenCL 2.0)

    AMD:...
  25. Does the device report slightly less local memory...

    Does the device report slightly less local memory for CL_DEVICE_LOCAL_MEM_SIZE when you're running the r340.xx driver? It had better!

    I did notice a while back that some older NVIDIA OpenCL 1.0...
Results 1 to 25 of 202
Page 1 of 9 1 2 3 4