Search:

Type: Posts; User: Gopal_HC

Search: Search took 0.00 seconds.

  1. Replies
    2
    Views
    1,127

    Thanks Rob for replying ! Surely I will look...

    Thanks Rob for replying !
    Surely I will look into POCL and would like to contribute in this.

    I installed and built ICD loader from http://www.khronos.org/registry/cl/ and
    testing of ICD Test...
  2. Replies
    2
    Views
    1,127

    Adding support for a new hardware in OpenCL

    Hi,

    As of my project, I have been assigned to add support for a new hardware device in OpenCL Software stack, which has two major components :
    1. Host Runtime: to provide OpenCL host platform API...
  3. LLVM: Manually optimizing OpenCL/CUDA intermediate code !

    Hi,

    I am interested to optimize OpenCL code, in this regards i went through some OpenCL optmization guide book which says that there are following things you should consider while optimizing your...
  4. Replies
    3
    Views
    1,142

    Thanku ! 1. after trying rotate(A, (uint)5), my...

    Thanku !
    1. after trying rotate(A, (uint)5), my kernel compiled and i got correct result.
    3. My implementation of rotate function is :
    uint rotate1(int n, uint x)
    {
    return (x << n) | (x >>...
  5. Replies
    3
    Views
    1,142

    Questions on OpenCL Built-in functions?

    Hi,
    I am trying to use OpenCL Built-in "rotate" function in one of my kernel as given below, but i am getting following errors while compiling :

    clBuildProgram Error for -11 Error Number
    error:...
  6. How OpenCL __private address space is mapped on GPU?

    OpenCL spec says that "All variables inside a function (including __kernel functions), or passed into the function as arguments are in the __private or private address space. Variables declared as...
  7. Replies
    4
    Views
    1,675

    Thank you, i used __private address space and...

    Thank you,
    i used __private address space and got result little faster compare to __local
  8. Replies
    4
    Views
    1,675

    Thanks for helping me out !!! One more thing i...

    Thanks for helping me out !!!
    One more thing i want to know, lets assume number of elements in arr is 44 then would it be efficient to use address space __local in place of __private?
    I thought of...
  9. Replies
    4
    Views
    1,675

    __global vs __constant qualifier in OpenCL

    I want an array variable to have a program scope.

    One way I can do this by passing it as a function pointer throughout the program, which might be complex when we have multiple functions...
  10. Re: clEnqueueWriteBuffer causes segmentation fault

    I found the reason of getting segmentation fault in clEnqueuWriteBuffer() function.
    the first parameter, command queue was NULL value, due to that I was getting segmentation fault.
  11. Re: OpenCL implementation for Multiple platforms

    I cleared my doubt.
    I implemented myself OpenCL program to run on multiple platforms using :
    1. command queue as a single as well as double pointer. It worked correctly.
    2. device buffer as a...
  12. OpenCL implementation for Multiple platforms

    One doubt i have about OpenCL implementation for multiple platform to run my applications simultaneously on all available devices across all the platforms. Please correct me, if my implementation is...
  13. Re: clEnqueueWriteBuffer causes segmentation fault

    I checked the size parameter and buffer overflow, everything is fine.
    Regarding the start_ptr, size parameter need not to include this.

    One doubt i have about OpenCL implementation for multiple...
  14. clEnqueueWriteBuffer causes segmentation fault

    Hi,
    I am developing OpenCL code to run my application simultaneously on multiple platforms. For testing purpose I am using only one platform (AMD APP) with Cayman device (dual gpu card) and OpenCL...
  15. Error in CodeXL: Not able to produce resource usage info

    Hi,
    I am using AMD CodeXL to analyse OpenCL kernels on AMD GPUs. I have successfully installed CodeXL-Linux-1.0 in my ubuntu machine, which also includes AMD APP KernelAnalyzer2.

    For few of my...
  16. Replies
    4
    Views
    2,254

    Re: OpenCL slow compiling on AMD card

    Hi chippies,

    Thanx again !!

    Yes i was using loop unrolling for the large loop. After removing the loop unroll pragma from that large loop, my code is compiling well on AMD card. Basically it...
  17. Replies
    4
    Views
    2,254

    OpenCL slow compiling on AMD card

    Hi, I am doing some simple OpenCL tests and i found that my kernel code compiles faster on Nvidia GPU (GeForce GTX 295) rather than AMD GPU (Cayman).

    I am using a separate .cl file of 533 lines,...
  18. Replies
    0
    Views
    1,085

    OpenCL kernel crashes with -5 Error

    I am developing OpenCL program using MultiGPU.

    I have to launch very large number of threads. At a time i am launching only few threads for a kernel, based on number of resources(registers usage...
Results 1 to 18 of 18