Giter Site home page Giter Site logo

jupyter-matrix-matrix's Introduction

Measuring the floating point performance of your machine

Like many people, I've had access to a bunch of machines over the years and I wanted to keep a record of their performance. Think of this as a personal diary of the performance of some machine I've played with.

Matrix-Matrix multiplication

Matrix-Matrix multiplication is often used to benchmark machines because the mathematics is such that it is one of the few operations where one can obtain close to theoretical peak performance in pratice.

The number of floating point operations(Flops) in a Matrix-Matrix multiplication of two $N$ x $N$ matrices is given by $2N^3-N^2$ (See http://www2.hawaii.edu/~norbert/CompPhys/compphysicsch10.html for details of how this was determined).

For this benchmark, we construct two random $N$ x $N$ matrices and time how long it takes to multiply them together, $t$. We then compute the number of Gigaflops/s exhibited by that operation via $\frac{2N^3-N^2}{t*10^9}$. We do this several times for any given $N$ and record the average and the best (peak) results obtained. $N$ is then varied from, for example, 250 to 10,000 in steps of 250 to see how performance varies with matrix size. I used to think that the best performance would be exhibited by the largest matrices -- the computation saturating the hardware somehow -- but that's not always what I've actually seen. On hardware such as laptops, thermal throttling may play a part for example.

For highest performance, you should use a version of numpy that has been linked against a high performance BLAS library such as OpenBLAS or the Intel MKL(https://software.intel.com/en-us/intel-mkl). The Anaconda Python distribution includes the Intel MKL by default on Windows and Linux (Mac includes its own high performance BLAS library).

Results

If you look at individual results notebooks you'll notice that the notebooks have evolved a little over time. The core computation is always the same though.

Cloud

  • Amazon c5x18xlarge, November 2017, Max size 10000x10000, 1366 Gflops
  • Azure Notebook, May 2017, Max size 1000x1000, 263 Gflops - This was a free service offered by Microsoft. Discussed at https://walkingrandomly.com/?p=6351
  • Amazon c4x4xlarge, September 2017, Max size 10000x10000, 333 Gflops

Laptops

HPC Clusters

Results from various traditional HPC Clusters.

jupyter-matrix-matrix's People

Contributors

michaelcroucher avatar mikecroucher avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

jupyter-matrix-matrix's Issues

Problem with the Iceberg result

The node tested appears to have had Hyperthreading turned on. This is not normal for the rest of the cluster. Need to run again.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.