Results 1 to 2 of 2

Thread: Cache miss in kernel

  1. #1
    Junior Member
    Join Date
    Oct 2013
    Posts
    22

    Cache miss in kernel

    Hello,

    Should I consider the caches of a single core ?

    The input data is 2 3D matrices each contains 16x256x16 elements.

    When the core access the data is does it slowly.

    So I guess I caused a lot of cache miss.

    Where can I find information about the size of L1,L2 cache of a display card ?

    I'm using NVIDIA's GeForce 9400 GT: http://www.geforce.com/hardware/desk...specifications

    The spec does not contains this information.

    Thanks,
    Zvika

  2. #2
    Hi Zvika,

    Geforce 9400 GT is compute capability 1.0 (see here: https://developer.nvidia.com/cuda-gpus)

    Look at CUDA programming guide, Appendix G.3, for explanation on Compute Capability 1.x architecture, and how to access the memory (it's a split warp architectures). http://docs.nvidia.com/cuda/cuda-c-p...capability-1-x

    Regards,
    Ofer.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •