Here are the results of my experiments with MATLAB's GPU capabilities -- these experiments all require the Parallel Computing Toolbox.
All timing results are on my HP Z420 PC which has a 4-core Intel Xeon E5-1620 CPU running at 3.6GHz, and an NVIDIA GeForce GTX 670 with 1344 cores running at 1GHz. Note that the double precision performance on the NVIDIA GeForce cards is much less than the single precision performance.
25/02/13:
26/02/13:
27/02/13:
28/02/13:
04/03/13:
Installing MATLAB was easy as usual, but trying to run GPU code produced an error because it seems the Parallel Computing toolbox does not come with a built-in copy of the CUDA toolkit. Installing CUDA (again a simple task) fixed this.
One oddity: after installing CUDA I was able to use command "gpuDevice" to check the GPU I had, but running application code produced nonsense. Being used to the peculiarities of software installation, I rebooted the system, and now things seem to work correctly. This advice might have been included in the installation documentation, but of course I didn't read it.
Initial experiments give a disappointing performance, roughly a factor 10 slower than my GTX 670. I'll need to investigate this and seek help from others.
06/03/13:
07/03/13: