These results show the performance of the GPU when calculating the
matrix left division
of an NxN
matrix with an Nx1
vector. The number of operations
is assumed to be 2/3*N^3 + 3/2*N^2
.
This calculation is usually compute-bound, i.e. the performance depends mainly on how fast the GPU can perform floating-point operations.