Search:

Type: Posts; User: louiswu

Search: Search took 0.00 seconds.

  1. Replies
    4
    Views
    2,365

    Re: clBuffers, synchronised memory and flags

    So setting CL_MEM_COPY_HOST_PTR results in it being copied to and from the gpu on each run, I actually don't need to use clEnqueueWriteBuffer to copy the data over as it seems to copy it over for me,...
  2. Replies
    4
    Views
    2,365

    Re: clBuffers, synchronised memory and flags

    Sorry I probably haven't made myself very clear so a simplified version:

    Currently I've only learnt OpenCL using Buffer objects to transfer data between the host and the GPU, however as a buffer...
  3. Replies
    4
    Views
    2,365

    clBuffers, synchronised memory and flags

    Hi there,

    I'm testing my application with nVidias OpenCL Visual Profiler and I'm noticing that memory buffer I'm allocating for the device using this code:

    //_bins
    _cmDevBins =...
  4. trouble understanding when gld uncoalesced loads occur

    Hi there,

    I'm writing some code to perform an image registration between two images, basically this involves working with values from 2 64x64x64 3d images which I am passing into my kernel via...
  5. Replies
    2
    Views
    2,085

    Re: Obtaing error strings from codes

    Cheers that helped a lot
  6. Replies
    99
    Views
    65,014

    Re: OpenCL C++ Bindings

    Hi there,

    I'm trying to work out how to use the sampler from the c++ bindings however there is no constructor given for the class where I was expecting something similar to the current openCL...
  7. Replies
    2
    Views
    2,085

    Obtaing error strings from codes

    Hi there I'm using the C++ openCL bindings and my errors take the form of an int (-10, -54 etc) I was wondering how I can use this int to obtain the actual string error message?

    Thanks in advance.
  8. Need some help understanding how to obtain a threads warpid

    Hi there,

    Apologies for the newb question im having some trouble getting my head around openCL in relation to a 3d case.

    I'm wanting to obtain the relevant warpId of a given thread, in a 1D...
  9. Replies
    3
    Views
    2,606

    Re: Writing to shared global memory

    Sorry I should explain better, this was a simplification of my problem.

    I'm writing an algorithim which compares two images similarity by creating a histogram.

    What this essentially entails is...
  10. Replies
    3
    Views
    2,606

    Writing to shared global memory

    Hi there, I'm wondering how to do what seems like a fairly simple task. I want each thread to increment an integer, so that if 30 threads are run on a the gpu device, the counter will be 30.

    I'm...
  11. Replies
    1
    Views
    1,854

    Re: Enabling double precision

    Ah apologies, just realised my gfx card does not support cuda model 1.3 for double precision (using a 8800gts).

    On a related to note, is there an easy way to get meaning ful error strings from the...
  12. Replies
    1
    Views
    1,854

    Enabling double precision

    Hi there,

    I'm trying to enable double precision through my kernels however I get an error thrown when I include the extension enabling pragma:
    #pragma OPENCL EXTENSION cl_khr_fp64 : enable

    I'm...
Results 1 to 12 of 13