This tute we'll delve into the crux of CUDA programming, threads, thread blocks and the grid. CUDA uses many threads to simultaneously do the work that would usually be given to iterations of a C++ loop. I've included some coding at the end to show how to add elements of two arrays using CUDA, this is the "Hello world!" of CUDA programming. I've not set up the project this time, please refer to the second tutorial for steps to doing this.
There's something wrong with my microp
آی-ویدئو