Hello!

Please provide capability to call Open CL kernels on CPU via simple function pointer bypassing the threading engine.

Purpose:
- suitable for very short kernels which require low overhead
- useful for compilers which do not use latest CPU instructions yet (including but not limited to MS VC++). Open CL could in this case deliver faster running functions which would be fastest possible on any platform regardless of the capabilities of the compiler /scripting language used.

Thanks!
Atmapuri