Search:

Type: Posts; User: Bilog

Page 1 of 2 1 2

Search: Search took 0.00 seconds.

  1. The preferred vector width is the vector width...

    The preferred vector width is the vector width that you should try to use on the device. If CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT is 4, for example, it means that on that device you should try to use...
  2. You are seeing the effect of the so-called...

    You are seeing the effect of the so-called delayed or lazy allocation, which is a very common technique used in many implementations.

    Basically, when an OpenCL buffer is created, it is not...
  3. Thank you very much for the reply, it's greatly...

    Thank you very much for the reply, it's greatly appreciated. Looking forward to the updated spec.
  4. Replies
    20
    Views
    6,466

    Sticky: C++14 doesn't define built-in vector types nor...

    C++14 doesn't define built-in vector types nor memory spaces either, yet OpenCL C++ still has them. Removing the restrict keyword will also cause performance regressions in all those kernels and...
  5. Your platform might have limitations on the local...

    Your platform might have limitations on the local work size in the second and third dimensions. You can check this by retrieving the CL_DEVICE_MAX_WORK_ITEM_SIZES property, which returns a list of...
  6. Replies
    20
    Views
    6,466

    Sticky: Another point that needs clarification (aside...

    Another point that needs clarification (aside from the meaning of CL_DEVICE_VENDOR_ID) is the behavior of sub-devices in terms of (pre-)existing contexts. I've opened a specific discussion about this...
  7. Ambiguity in the specification about sub-devices and contexts

    Hello all,

    what is the correct behavior in the cases of sub-devices created _after_ context creation?

    Let's say that I create a context C that only includes a single device, devA. I then...
  8. Replies
    20
    Views
    6,466

    Sticky: An additional point, concerning the available...

    An additional point, concerning the available device information:

    * as I mentioned, it would be better to have device info entry about the supported OpenCL C++ version; while currently there is...
  9. Replies
    20
    Views
    6,466

    Sticky: Absolutely agreed. An important case where the...

    Absolutely agreed. An important case where the high-leve feature exposed in OpenCL C (or C++) would be better replaced by lower-level functions is that of work-group and subgroup scans and...
  10. Replies
    20
    Views
    6,466

    Sticky: A few things I've noticed on the first read of...

    A few things I've noticed on the first read of the OpenCL C++ 1.0 draft:

    * a minor missing point is that there is no device property retrievable by `clGetDeviceInfo` about the supported OpenCL C++...
  11. According to the specification, the requirement...

    According to the specification, the requirement is that the kernel signature (number and type of arguments) should be the same for all devices for which the program was built. If you build different...
  12. The preferred wg size multiple is what the OpenCL...

    The preferred wg size multiple is what the OpenCL platforms thinks the local workgroup size should be a multiple of to achieve optimal performance. On NVIDIA GPUs, this is always returned as the warp...
  13. Replies
    32
    Views
    22,801

    work_group_prefixsum_{inclusive,exclusive}_{add,mi...

    work_group_prefixsum_{inclusive,exclusive}_{add,min,max} functions are not named correctly, since they are not necessarily additions. Is it too late to change them to...
  14. Replies
    3
    Views
    1,679

    -52 is CL_INVALID_KERNEL_ARGS, and indeed you are...

    -52 is CL_INVALID_KERNEL_ARGS, and indeed you are passing 4 args to a kernel that needs 5 of them.
  15. Replies
    8
    Views
    2,662

    You should probably report your problem to AMD...

    You should probably report your problem to AMD (they have a forum dedicated to OpenCL questions and issues over at their devgurus.amd.com site)
  16. Replies
    1
    Views
    1,193

    Nothing. Since OpenCL has separate sources for...

    Nothing. Since OpenCL has separate sources for the host and device parts, there is no need to qualify device functions.
  17. Replies
    1
    Views
    2,341

    In OpenCL all functions are automatically inlined.

    In OpenCL all functions are automatically inlined.
  18. Replies
    4
    Views
    2,108

    Re: get_global_id is undefined

    get_global_id() is a built-in of OpenCL C, so it is only defined inside of kernels. Are you trying to use it in host code? please post a minimal buildable example showing the problem.
  19. Replies
    3
    Views
    1,835

    Re: warp size vs # of SPs per SM

    On Fermi, each warp is physically executed as two half-warps; the 2.1 devices can effectively run 3 half-warps at once. (The thing is actually more complex, due to the device ability to issue more...
  20. Replies
    3
    Views
    2,164

    Re: running on GPU but not on CPU

    Are you using the Intel OpenCL SDK on an AMD CPU? In my experience, this combination doesn't work, while the reverse (AMD APP with Intel CPU) works.
  21. Replies
    2
    Views
    1,866

    Re: Copying c++ classes for use in open CL

    The OpenCL C programming language is based on C99, and therefore has no support for C++ features and types. In particular, this means you cannot pass C++ objects to OpenCL.

    For your specific...
  22. Replies
    4
    Views
    7,017

    Re: OpenCL struct alignment on host and device

    The problem is, how do you guarantee that the host and device compiler will introduce the additional padding in the same place? Note that the spec says that padding may be added. Compilers will then...
  23. Replies
    3
    Views
    5,927

    Re: OpenCL profiling tools for Linux

    The AMD APP includes a command-line profiler that works in Linux as well. It produces CSV files that you can then open and analyze by hand.
  24. Replies
    1
    Views
    3,107

    Re: Well defined ways of detecting product ID

    All the major OpenCL platforms expose the relevant information in the form of macros that you can test for in the kernel. Quality and amount of documentation of these macros varies. For example, AMD...
  25. Replies
    6
    Views
    7,175

    Re: using printf in the .cl file

    I doubt they'll ever think about doing that, unless a sizeable number of users complain about the lack of support and possibly threatens to switch to the competition.
Results 1 to 25 of 46
Page 1 of 2 1 2