Results 1 to 2 of 2

Thread: Reboot or killing jobs remotly after crash?

  1. #1
    Junior Member
    Join Date
    Feb 2011
    Posts
    1

    Reboot or killing jobs remotly after crash?

    Hi everyone!

    I work at a research university and I'm beginning to learn OpenCL for a numerical computations research project...

    The plan is to buy a desktop for the office to run computations...
    The issue is that I'll frequently need to work remotly, logging into the machine from home, or from a difference office.

    The question is: what happens when my program crashes?

    (1) If the GPU locks up, will I be able to log in remotely and kill the process, or just reboot the machine remotely?

    (2) If the machine has 2 GPU's (for example one on-die with the CPU), is it possible to run my computations on one GPU while the other GPU does "normal" GPU duties (my programs won't be outputting to monitor, just to data files) -- and in this case, if I'm acually at the desktop, can program crashes be more easily handled?


    any advice or suggestions welcome!

    thanks!

    NoChance

  2. #2

    Re: Reboot or killing jobs remotly after crash?

    Quote Originally Posted by NoChance
    The issue is that I'll frequently need to work remotly, logging into the machine from home, or from a difference office.
    ATI made a guide for using stream (and therefore, their OpenCL implementation) through ssh:
    http://developer.amd.com/gpu_assets/App ... motely.pdf

    Quote Originally Posted by NoChance
    The question is: what happens when my program crashes?

    (1) If the GPU locks up, will I be able to log in remotely and kill the process, or just reboot the machine remotely?
    Chances are, you won't be able to just kill the process. Any lockups I've had, the process (which is handled by the host CPU) cannot pre-empt the execution of a kernel within the GPU. reboot would be a good option, you could also get remote power-up working, but sometimes you might need a hard-reset. I haven't had too much trouble with OCL lockups, usually some kind of error pops up and crashes the app before locking everything up, but have definitely played with this behavior on DirectX11.


    Quote Originally Posted by NoChance
    (2) If the machine has 2 GPU's (for example one on-die with the CPU), is it possible to run my computations on one GPU while the other GPU does "normal" GPU duties (my programs won't be outputting to monitor, just to data files) -- and in this case, if I'm acually at the desktop, can program crashes be more easily handled?
    you CAN run on one GPU and let the other one do whatever it does, but it probably won't help you at all with debugging a crash. It just means you have another available GPU.

Similar Threads

  1. Replies: 0
    Last Post: 02-13-2013, 06:22 AM
  2. Jobs in bangladesh
    By gopalbd in forum OpenGL ES general technical discussions
    Replies: 0
    Last Post: 10-01-2011, 05:00 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •