These results show the performance of the GPU when calculating the matrix left division of an NxN matrix with an Nx1 vector. The number of operations is assumed to be 2/3*N^3 + 3/2*N^2.

This calculation is usually compute-bound, i.e. the performance depends mainly on how fast the GPU can perform floating-point operations.