Modification to the Micro Kernel


The micro kernel has a huge influence on the overall performance for the GEMM operation. Before we investigate this further in the next session we will do here some simple experiments.


In the micro kernel, switch the two most inner loops when computing

\[R \leftarrow A \cdot B\]

Re-run the benchmark!