======================= GEMM (AVX Micro Kernel) ======================= After studying the __SSE micro kernel__ of __BLIS__ we apply some of the concepts to the __AVX__ instruction set. Again you can use the __Intel intrinsics guide__ for help on AVX. - __Page 1__ Again we start we the naive approach of using AVX. Note that all benchmarks were generated when doctool transformed the doc files to HTML. All this happened on my iMac which has a 2.7 GHz Intel i5. The theoretical peak performance of one core is 21.6 GFLOPS. :links: SSE micro kernel -> http://apfel.mathematik.uni-ulm.de/~lehn/sghpc/gemm/index.html BLIS -> https://code.google.com/p/blis/ AVX -> http://en.wikipedia.org/wiki/Advanced_Vector_Extensions Intel intrinsics guide -> https://software.intel.com/sites/landingpage/IntrinsicsGuide/ Page (\d) -> doc:page0$1/index