Results 1 to 2 of 2

Thread: vload_half and vloada_half

  1. #1
    Junior Member
    Join Date
    Nov 2009
    Posts
    1

    vload_half and vloada_half

    Hi all
    In OpenCL spec, there are 2 versions of this kind of build-in functions for half type. the only difference I found is that they have different requirement of alignment. does it mean that vloada_half() will have an higher performance? And what is the purpose of this differentiation. Thanks!
    shawn
    Thanks

  2. #2
    Senior Member
    Join Date
    Sep 2002
    Location
    Santa Clara
    Posts
    105

    Re: vload_half and vloada_half

    vload_halfn allow you to load a 1, 2, 4, 8 or 16 component half-vector where the alignment requirement is that p be aligned to a 16-bit i.e. size of a scalar half boundary.

    vloada_halfn allow you to load a 1, 2, 4, 8 or 16 component half-vector where the alignment requirement is that p be aligned to the size of half vector. vloada_halfn should, in most cases, give you better memory access performance compared to the unaligned vload_halfn version.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •