News

Use the supplied SIMD SSE intrinsics code to run an array multiplication/reduction timing experiment. Run the same experiment a second time using your own C/C++ array multiplication/reduction code.
This repository contains a serial C++ implementation of matrix-matrix multiplication, also known as the Basic Linear Algebra Subroutine (BLAS3 operation). The code focuses on performance optimization ...