Some device support device fission, but it would be also very useful to have some kind of device fusion rather than have to manually handle several devices and command queues. Sure, for some algorithms fusing devices wouldn't give you the kind of fine grain control you need, but for embarassingly parallel algorithms this would be great!