============================ Packing Blocks from Matrix A [TOC] ============================ In the cache optimized GEMM-Operation $\beta C + \alpha A B \to C$ the matrix $A$ gets partitioned into blocks with maximal dimension $M_c \times K_c$. Each block $A_{i,l}$ of $A$ gets packed into col-major horizontal panels with $M_r$ rows. Exercise ======== - Implement function `dpack_A` in the test program below. Before you start coding: Make yourself familiar with the test program below. Simple Test Program =================== :import: session16/simple_test_packa_ex.c