Page 2 of 2 FirstFirst 12
Results 11 to 13 of 13

Thread: Optimisation tips for fetch intensive kernel on ATI

  1. #11
    Senior Member
    Join Date
    Aug 2011
    Posts
    271

    Re: Optimisation tips for fetch intensive kernel on ATI

    Quote Originally Posted by neFAST
    Thanks for your answer.
    So what would be your explanation in terms of hardware difference? Are the NVidia cards taking less cycles per global read? Or is it the cache system that is better?
    It's mostly the ALU packing on the VLIW units, and the way branches/fetches work (clauses). If you get bad ALU packing you can lose a lot of performance, and some code just can't be changed to improve it.

    This is why GCN departed from the VLIW ways of it's predecessors - it should still be fine for graphics, but will help a lot for some non-graphics code.

  2. #12
    Junior Member
    Join Date
    May 2012
    Posts
    7

    Re: Optimisation tips for fetch intensive kernel on ATI

    Quote Originally Posted by notzed
    Quote Originally Posted by neFAST
    This is why GCN departed from the VLIW ways of it's predecessors - it should still be fine for graphics, but will help a lot for some non-graphics code.
    My 7850 (Pitcairn) is using GCN, right?

  3. #13
    Senior Member
    Join Date
    Aug 2011
    Posts
    271

    Re: Optimisation tips for fetch intensive kernel on ATI

    Quote Originally Posted by neFAST
    Quote Originally Posted by notzed
    Quote Originally Posted by neFAST
    This is why GCN departed from the VLIW ways of it's predecessors - it should still be fine for graphics, but will help a lot for some non-graphics code.
    My 7850 (Pitcairn) is using GCN, right?
    ahh yeah, sorry missed that.

Page 2 of 2 FirstFirst 12

Similar Threads

  1. Replies: 0
    Last Post: 03-03-2012, 08:12 AM
  2. Replies: 2
    Last Post: 07-25-2011, 02:15 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •