Page 2 of 2 FirstFirst 12
Results 11 to 13 of 13

Thread: Optimisation tips for fetch intensive kernel on ATI

  1. #11
    Senior Member
    Join Date
    Aug 2011
    Posts
    271

    Re: Optimisation tips for fetch intensive kernel on ATI

    Quote Originally Posted by neFAST
    Thanks for your answer.
    So what would be your explanation in terms of hardware difference? Are the NVidia cards taking less cycles per global read? Or is it the cache system that is better?
    It's mostly the ALU packing on the VLIW units, and the way branches/fetches work (clauses). If you get bad ALU packing you can lose a lot of performance, and some code just can't be changed to improve it.

    This is why GCN departed from the VLIW ways of it's predecessors - it should still be fine for graphics, but will help a lot for some non-graphics code.

  2. #12
    Junior Member
    Join Date
    May 2012
    Posts
    7

    Re: Optimisation tips for fetch intensive kernel on ATI

    Quote Originally Posted by notzed
    Quote Originally Posted by neFAST
    This is why GCN departed from the VLIW ways of it's predecessors - it should still be fine for graphics, but will help a lot for some non-graphics code.
    My 7850 (Pitcairn) is using GCN, right?

  3. #13
    Senior Member
    Join Date
    Aug 2011
    Posts
    271

    Re: Optimisation tips for fetch intensive kernel on ATI

    Quote Originally Posted by neFAST
    Quote Originally Posted by notzed
    Quote Originally Posted by neFAST
    This is why GCN departed from the VLIW ways of it's predecessors - it should still be fine for graphics, but will help a lot for some non-graphics code.
    My 7850 (Pitcairn) is using GCN, right?
    ahh yeah, sorry missed that.

Page 2 of 2 FirstFirst 12

Similar Threads

  1. Newbie! Trying to learn how to do data intensive calculus?
    By seinecle in forum OpenCL - parallel programming of heterogeneous systems
    Replies: 0
    Last Post: 03-03-2012, 07:12 AM
  2. Kernel now working -- card to buy -- NVidia or ATI?
    By Photovore in forum OpenCL - parallel programming of heterogeneous systems
    Replies: 2
    Last Post: 07-25-2011, 02:15 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •