Results 1 to 3 of 3

Thread: Scalability issues in openCL.

  1. #1
    Junior Member
    Join Date
    Jun 2012
    Posts
    4

    Scalability issues in openCL.

    I am new to openCL and trying to learn the basics. I have a doubt that if i specify the size of work-group and NDRange in a way so that the total number of work-items excede the processing elements in a device , whether it is possible to execute those work-items in parallel and if yes how it is implemented.

    I will also be grateful if someone can explain how the abstract model of openCL is mapped to hardware model with some example.

  2. #2
    Senior Member
    Join Date
    Aug 2011
    Posts
    271

    Re: Scalability issues in openCL.

    Quote Originally Posted by gtg_bansal
    I am new to openCL and trying to learn the basics. I have a doubt that if i specify the size of work-group and NDRange in a way so that the total number of work-items excede the processing elements in a device , whether it is possible to execute those work-items in parallel and if yes how it is implemented.

    I will also be grateful if someone can explain how the abstract model of openCL is mapped to hardware model with some example.
    This is covered in the introductory material for all GPU vendor's documentation. Much better than could be explained with a few words - the diagrams help.

    They also cover how specifying more work groups is beneficial - it's pretty much the design rationale of modern GPU's that allowed opencl to exist in the first place, so you can be sure it works - but you don't really need to know how it's done to use it.

    e.g. nvidia opencl programming guide, section 2.1
    amd accelerated paralel processing - opencl - programming guide, chapter 1.

    (if you don't already have them ... use google)

  3. #3
    Junior Member
    Join Date
    Jun 2012
    Posts
    4

    Re: Scalability issues in openCL.

    Quote Originally Posted by notzed
    Quote Originally Posted by gtg_bansal
    I am new to openCL and trying to learn the basics. I have a doubt that if i specify the size of work-group and NDRange in a way so that the total number of work-items excede the processing elements in a device , whether it is possible to execute those work-items in parallel and if yes how it is implemented.

    I will also be grateful if someone can explain how the abstract model of openCL is mapped to hardware model with some example.
    This is covered in the introductory material for all GPU vendor's documentation. Much better than could be explained with a few words - the diagrams help.

    They also cover how specifying more work groups is beneficial - it's pretty much the design rationale of modern GPU's that allowed opencl to exist in the first place, so you can be sure it works - but you don't really need to know how it's done to use it.

    e.g. nvidia opencl programming guide, section 2.1
    amd accelerated paralel processing - opencl - programming guide, chapter 1.

    (if you don't already have them ... use google)
    Thank you notzed for your suggestion.

Similar Threads

  1. Issues in OpenCL spec 1.2
    By himanshu.gautam in forum OpenCL
    Replies: 6
    Last Post: 12-22-2011, 04:10 PM
  2. OpenCL/OpenGL interop syncing issues
    By welford in forum OpenCL
    Replies: 0
    Last Post: 09-04-2010, 05:48 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •