Hi,

I am writing some test code to compare various matrix-vector multiplication routines.

Thus far my code is working, however the transposed multiplication is incredibly slow compared to the...