Search:

Type: Posts; User: OferRosenberg

Search: Search took 0.00 seconds.

  1. Replies
    1
    Views
    368

    Hi Zvika, Geforce 9400 GT is compute...

    Hi Zvika,

    Geforce 9400 GT is compute capability 1.0 (see here: https://developer.nvidia.com/cuda-gpus)

    Look at CUDA programming guide, Appendix G.3, for explanation on Compute Capability 1.x...
  2. Hi Sajjadul, As far as I understand, the...

    Hi Sajjadul,

    As far as I understand, the comment refers to the differences between memory allocation concepts between OpenCL and CUDA.

    In CUDA, cudaMalloc API call returns a pointer. This...
  3. Replies
    3
    Views
    1,051

    You didn't mention which implementation you're...

    You didn't mention which implementation you're using (AMD, Intel or NVIDIA).
    Try using CL_USE_HOST_PTR with a buffer allocated by the application - and have this buffer pinned/locked before the map...
  4. Hi, Few things to check: 1. Check the...

    Hi,

    Few things to check:
    1. Check the alignment of the host allocated buffer. Appendix C.3 provides the aligment rules.
    2. Note that when using CL_MEM_USE_HOST_PTR, implementations may cache...
  5. Extending clint's answer a little: The type of...

    Extending clint's answer a little:

    The type of image created is a single color per location - only R. (if you wish to work with RGBA, you need to modify the format). As such:
    1. The buffer that...
  6. In most examples of N-body that I'm familiar...

    In most examples of N-body that I'm familiar with, the usage of vector data type is somewhat reversed compared to your code - each particle is a float4 (or float3), and the kernel code has a "for"...
  7. Maybe it fails because you have two platforms...

    Maybe it fails because you have two platforms installed (Intel and AMD). Your code takes the first platform returned by clGetPlatformID, and tries to get a GPU device. If the first platform on the...
  8. The difference is that you don't need to enable...

    The difference is that you don't need to enable the extension via the compiler directive.

    Accordying to the spec, Section 9.1, if a developer wants to use an optional extension in his program, he...
  9. I did a presentation on that 3Y ago at SIGGRAPH...

    I did a presentation on that 3Y ago at SIGGRAPH 2010.
    Google for "Ofer Rosenberg SIGGRAPH" (or Bing. or Yahoo. choose your favorite...)
  10. Radeon HD6750 is VLIW5 architecture. Look at...

    Radeon HD6750 is VLIW5 architecture. Look at wikipedia or search the web for it (I tried to add a link to anandtech, but the forum system blocked me...)

    Basically, a workitem is executed on one SC...
Results 1 to 10 of 10