by zeryx on 9/16/15, 1:35 AM with 2 comments
by wyldfire on 9/16/15, 1:57 AM
I don't know if it's still the case but in the past CUDA/OCL kernels would do all of the execution work for each path in the CFG and only write the results for the actual path to global memory.