Giter Site home page Giter Site logo

prototype-jpeg's Introduction

build

A prototype JPEG compressor in Python.

preview

Getting Started

Environment

  • Visual Studio Code
  • Poetry 1.0.10 or later

Installation

Clone this repositroy.

git clone https://github.com/seanwu1105/prototype-jpeg

Open the root directory.

cd prototype_jpeg

Install the dependencies with Poetry.

poetry install --no-root

Uses

Show the compressed file results.

python main.py

Or to run some examples,

python example.py

Inside example.py, you could find how to compress a file with following script.

with open(spec['fn'], 'rb') as raw_file:
    original = np.fromfile(raw_file, dtype=np.uint8)
    raw_file.seek(0)
    compressed = compress(
        raw_file,
        # Set image spec.
        size=spec['size'],
        grey_level=spec['grey_level'],
        quality=spec['quality'],
        subsampling_mode=spec['subsampling_mode']
    )
with open('compressed.protojpg', 'wb') as compressed_file:
    compressed['data'].tofile(compressed_file)
header = compressed['header']  # Get compressed header (metadata).

And extract a compressed file.

with open('compressed.protojpg', 'rb') as compressed_file:
    extracted = extract(
        compressed_file,
        # Set extract spec (metadata).
        header={
            'size': header['size'],
            'grey_level': header['grey_level'],
            'quality': header['quality'],
            'subsampling_mode': header['subsampling_mode'],
            'remaining_bits_length': header['remaining_bits_length'],
            'data_slice_lengths': header['data_slice_lengths']
        }
    )

Image Spec

You can set the following image compression spec for compression and extraction.

Spec Type Details
size tuple(int, int) The size of image.
grey_level bool Grey level (True) or RGB (False) image.
quality int Baseline JPEG quality factor.
subsampling_mode 1, 2 or 4 Subsampling modes. Luminance:Chrominance = 4:subsampling_mode. The subsampling_mode in spec will not have any effect if grey_level is set True.

Compressing process would automatically generate the following 2 more items in the header. You should put these items into extract() as well.

Spec Type Details
remaining_bits_length int The length of remaining (appending) bits to fill up to 8n bits for minimal byte size in order to save into a file.
data_slice_lengths tuple(int, int, [, int, int]) Record the length of luminance and chrominance DC/AC bits for extracting process.

Development

Tests

Use the following command to run unit and integrating tests.

pytest

Compressed File Results

The PSNR gets higher, compressing/extracting time elapsing gets longer and file size gets smaller with higher QF.

analysis

For further details, see this report.

Algorithms and Implementation

Bits Storage

Since the algorithm (especially for the extracting procedure) require many times to remove the last element in an iterable, we use strings to save any bit sequence (including the data of Huffman table) for both developing and run-time speed. Actually, lists could be a better choice regarding deletion performance. However, strings have better performance and cleaner codes for the conversion between integer (decimal number) and bit sequence.

Saving bit sequence into strings or lists could lead to memory insufficiency as one bit need a byte (character or boolean in Python) to store.

RGB and Grey Level

The compression will treat grey level images as a luminance (Y) layer in RGB images. Thus, it would not be subsampled. Namely, the subsampling_mode in spec will not have any effect if grey_level is set True.

Level Offset

After the color space conversion, Cb and Cr would be in the range [-128, 127] but Y in [0, 255]. Hence, layer Y would minus 128 in order to have the same range.

Subsampling Modes

Only chrominance (Cb and Cr) would run through subsampling process. The following is the example of subsampled pixels.

subsampled pixel

def downsample(arr, mode):
    """Downsample an 2D array.
    Arguments:
        arr {2d numpy array} -- The target array.
        mode {1 or 2 or 4} -- Downsample ratio (4:mode).
    Returns:
        2d numpy array -- Downsampled array.
    """
    if mode not in {1, 2, 4}:
        raise ValueError(f'Mode ({mode}) must be 1, 2 or 4.')
    if mode == 4:
        return arr
    return arr[::3 - mode, ::2]

Slicing

In order to slice a 2D mn pixel array into 3D k * 8 * 8 blocks sequence evenly, the 2D pixel would be padded up to 8N * 8N with 0.

# Pad Layers to 8N * 8N
data[key] = np.pad(
    layer,
    (
        (0, (nrows // 8 + 1) * 8 - nrows if nrows % 8 else 0),
        (0, (ncols // 8 + 1) * 8 - ncols if ncols % 8 else 0)
    ),
    mode='constant'
)
def block_slice(arr, nrows, ncols):
    """
    Return an array of shape (n, nrows, ncols) where
    n * nrows * ncols = arr.size
    If arr is a 2D array, the returned array should look like n subblocks with
    each subblock preserving the "physical" layout of arr.
    `reshape` will raise a `ValueError` if `nrows` or `ncols` doesn't evenly
    divide the shape.
    """
    h, _ = arr.shape
    return (arr.reshape(h//nrows, nrows, -1, ncols)
               .swapaxes(1, 2)
               .reshape(-1, nrows, ncols))

Quality Factor

The range of quality factor QF is (0, 95] because:

If QF = 0, by the definition of quantization, Q = S * f(QF) / 100, where Q is quantization table, S is default quantization table (for baseline JPEG)

default quantization table

and

function

it would cause division-by-zero error.

If QF < 0, the quantization table would have negative elements.

If there is an element smaller than 1 in quantization table, after the division in quantization process, the corresponding element in the result image block would become larger, which violates the goal of quantization. Assume QF = 95, Q = S / 10. The minimal element in S is 10, which would be 1 in Q after the division, and if QF > 95, the minimal element in Q would be smaller than 1. Thus, QF should always be smaller than or equal to 95.

Baseline JPEG Huffman Tables

The baseline JPEG Huffman table could be found in http://dirac.epucfe.eu, which has the same Huffman coding for DC and AC luminance, as well as DC chrominance in the class slides. However, that Huffman table also includes the AC chrominance coding, which could decrease the compressed file size significantly. You can find the complete table in /prototype_jpeg/codec.py.

CHROMINANCE: bidict({
    EOB: '00',  # (0, 0)
    ZRL: '1111111010',  # (F, 0)
    (0, 1):  '01',
    (0, 2):  '100',
    (0, 3):  '1010',
    (0, 4):  '11000',
    (0, 5):  '11001',
    (0, 6):  '111000',
    (0, 7):  '1111000',
    (0, 8):  '111110100',
    (0, 9):  '1111110110',
    (0, 10): '111111110100',
    # …
})

Furthermore, for the high performance, we save the Huffman table as a bidirectional hash table since every key and value in the Huffman table is unique. For the uniqueness (unique decodable) test, you can find the test in /tests/tests_unit/test_codec.py.

prototype-jpeg's People

Contributors

dependabot-preview[bot] avatar seanwu1105 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

prototype-jpeg's Issues

Test image

Hello, could Hello, could you tell me please, when I try to change the test image, I have are a problem: cannot reshape array of size 786572 into shape (512,512,3). Please tell me, what I doing wrong.you tell me please, when I try to change the test image, I have are a problem: cannot reshape array of size 786572 into shape (512,512,3). Please tell me, what I doing wrong.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.