============================================ Matrix-Matrix Product Experiments with uBLAS ============================================ +----------------+-------------------------------+ | __Session 1__ | Pure C++ Implementation | +----------------+-------------------------------+ | __Session 2__ | Some optimizations | +----------------+-------------------------------+ | __Session 3__ | Using OpenMP | +----------------+-------------------------------+ | __Session 4__ | Taking advantage of uBLAS | +----------------+-------------------------------+ | __Session 5__ | Using GCC Vector-Extensions | | | for Micro-Kernels | +----------------+-------------------------------+ | __Session 6__ | Notes on the GEMM Algorithm: | | | - Developing a micro-kernel | +----------------+-------------------------------+ | __Session 7__ | Application for a fast | | | Matrix-Matrix Product: | | | LU-Factorization | +----------------+-------------------------------+ | __Session 8__ | There is still some work to | | | do: Comparison with Intel MKL| +----------------+-------------------------------+ The implementation of the GEMM algorithm is based on __BLIS: A Framework for Rapidly Instantiating BLAS Functionality__ and adopted from __ulmBLAS__. The tar-ball __test_ublas.tgz__ contains the files: ---- SHELL --------------------------------------------------------------------- tar cfz test_ublas.tgz session*/*.hpp session*/*.cc session*/plot* tar tfvz test_ublas.tgz -------------------------------------------------------------------------------- :links: Session (\d) -> doc:session$1/page01 test_ublas.tgz -> http://www.mathematik.uni-ulm.de/~lehn/test_ublas/test_ublas.tgz BLIS: A Framework for Rapidly Instantiating BLAS Functionality -> http://www.cs.utexas.edu/users/flame/pubs/blis1_toms_rev3.pdf ulmBLAS -> http://apfel.mathematik.uni-ulm.de/~lehn/ulmBLAS/