Search:

Type: Posts; User: RCL

Search: Search took 0.00 seconds.

  1. Re: Arrangement (order) of threads inside 2D work unit

    UPDATE: about the last possibility: it's basically ruled out because 16x1 or 1x16 work groups are significantly slower than 4x4 work groups, so the data I'm processing definitely exhibits 2D...
  2. Re: Arrangement (order) of threads inside 2D work unit

    My kernel threads perform better when they are executed in smaller square-like blocks (like 4x4), because then they are more likely to take the same branch. However, just decreasing work group size...
  3. Arrangement (order) of threads inside 2D work unit

    Hi,

    If underlying hardware can only operate on N threads simultaneously (like, say, warp or half-warp sizes on NVIDIAs current cards, which are 32/16 threads respectively), how do threads in a...
Results 1 to 3 of 3