When using an OpenCL data parallel kernel on an SSE enabled CPU, does OpenCL automatically create SSE code to map work items to the channels of the SSE compute units? Or do you have to code using...