Giter Site home page Giter Site logo

compression_benchmark's Introduction

Compression Testing

Usage

This set of scripts can be used to test the advantages of using parallel versions of compression programs on your own dataset.

We have written a bash script to perform the compressions, and a R script to create a graph of the data.

The bash script is called like this :

./compresstest.sh file_to_compress 2>&1 | tee `hostname`_`date +%Y%m%d_%H%M%S`.csv

After it has finished, the data needs to be filtered and formatted for use in the R script, like this:

cat dev1_20141220_095316.csv | grep -E -f matches.grep  | grep -v Pavlov | sed ':a;N;$!ba;s/%\n/\t/g' > compressionResults.csv

Then use the R script to generate a graph of the result :

R CMD BATCH --no-save --no-restore "--args compressionResults.csv blah.pdf pdf" compressionResults.R

Compression programs installation

Most of the programs are part of your Linux distribution, so for e.g. Ubuntu do this :

sudo aptitude install lzop p7zip pbzip2 pigz pixz zip liblz4-tool

TAMP has to be created from source, and added to your PATH.

First install the necessary libraries, as TAMP is a 32 bit program:

sudo aptitude install libc6-dev-i386

Then fetch TAMP and QuickLZ 1.41, and build:

wget https://blogs.oracle.com/timc/resource/tamp-2.5-source.zip && \
unzip tamp-2.5-source.zip && cd tamp-2.5 && \
wget http://www.quicklz.com/quicklz141.zip && unzip quicklz141.zip && \
make -f Makefile.linux && \
export PATH=`readlink -f .`:$PATH

Example output

Example output

compression_benchmark's People

Contributors

jvhaarst avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Forkers

jervenbolleman

compression_benchmark's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.