Giter Site home page Giter Site logo

fabiospampinato / base128-encoding Goto Github PK

View Code? Open in Web Editor NEW
11.0 3.0 0.0 1.18 MB

Base128 encoding, the intersection of latin1 and utf-8, which is basically ASCII, the most memory-efficient string encoding that can be written to disk as utf-8 without ballooning in size.

License: MIT License

TypeScript 55.71% JavaScript 44.29%
base128 encoding

base128-encoding's Introduction

Base128

Base128 encoding, the intersection of latin1 and utf-8, which is basically ASCII, the most memory-efficient string encoding that can be written to disk as utf-8 without ballooning in size.

Comparison

Note: This works differently from the "ascii" encoding of Node's Buffer, which isn't really an encoding since you can't convert back and forth from it without losing data in general.

In JavaScript there are 3 noteworthy encodings for encoding arbitrary binary data to string: base64, base128, and base256.

  • base64: it uses only 64 out of 256 characters that can be encoded with 1 byte, it's therefore memory inefficient, but it produces human-friendly and url-friendly strings.
  • base256: it uses all 256 characters that can be encoded with 1 byte, it's therefore the most memory efficient encoding, but it produces human-unfriendly strings and when persistented to disk it's only efficient as long as the latin1 encoding is used when writing the file, otherwise it will balloon in size. Most notably you can't import a latin1-encoded JS file, so this is not the most appropriate encoding for shipping a binary blob with your library.
  • base128: it uses the first 128 out of 256 characters that can be encoded with 1 byte, it's therefore not the most memory efficient encoding and it still produces human-unfriendly strings, but it's a subset of both latin1 and utf-8, that means it's the most efficient encoding for embedding a binary blob with your library in a JS file that you can just import without issues.

Install

npm install --save base128-encoding

Usage

import Base128 from 'base128-encoding';

// Uint8Array encoding & decoding

{
  const raw = 'Hello ๐Ÿ˜ƒ';
  const uint8 = new TextEncoder ().encode ( raw );
  console.log ( uint8 ); // => Uint8Array(10) [ 72, 101, 108, 108, 111,  32, 240, 159, 152, 131 ]

  const encoded = Base128.encode ( uint8 );
  console.log ( encoded ); // => '$\x19-Fc<ApOf\x100\x04'

  const decoded = Base128.decodeStr ( encoded );
  console.log ( decoded ); // => // => Uint8Array(10) [ 72, 101, 108, 108, 111,  32, 240, 159, 152, 131 ]
}

// String encoding & decoding

{
  const raw = 'Hello ๐Ÿ˜ƒ';
  const encoded = Base128.encodeStr ( raw );
  console.log ( encoded ); // => '$\x19-Fc<ApOf\x100\x04'

  const decoded = Base128.decodeStr ( encoded );
  console.log ( decoded ); // => 'Hello ๐Ÿ˜ƒ'
}

// Check if a string is base128-encoded

{
  console.log ( Base128.is ( 'Hello' ) ); // => true
  console.log ( Base128.is ( '๐Ÿ˜ƒ' ) ); // => false
}

License

MIT ยฉ Fabio Spampinato

base128-encoding's People

Contributors

fabiospampinato avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

base128-encoding's Issues

Streaming version

Hi, I have to encode large binary buffers and found out that using base128 give me the best results in terms of disk space (UTF-8).
However, I can't just send the whole buffer because Chrome crashes (out of memory), so I have to stream the buffer in chunks. Concatenating Buffer128.encode(chunk) does not work so I was thinking if you see possible a streamed version of the encoder/decoder.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.