Giter Site home page Giter Site logo

quack's Introduction

Quack

An enhanced ALTO-viewer for Quality Assurance oriented display of a collections of scans, typically from books or newspapers.

Features

  • Smooth zoom & pan of large images thanks to OpenSeadragon
  • Marking of blown highlights & lowlights by colored overlays (toggleable)
  • TextBlock marking by boxes (toggleable)
  • Interactive inspection of OCR (hover the mouse over a TextBlock)
  • Grid lines for checking skewing and rotation (toggleable)
  • Folder overview with thumbnails
  • Histogram, optionally not analyzing the edges of the image

Quack screenshot 20131127

Requirements

  • A minimum of 2GB free RAM for processing of 30MP scans
  • bash
  • GraphicsMagic
  • ImageMagick (as GraphicsMagic cannot create histograms)
  • openseadragon.min.js
  • A suitable beefy browser equipped machine for display, depending on image sizes and ALTO complexity
  • deepzoom (only if tile based display is enabled)
  • Perl (required by deepzoom)

Verbiage

This is basically a simple bash script that grew to 600+ lines. It works on a collection of images with corresponding ALTO-files with OCR and segmentation markup. It creates a HTML page for each image and uses the relevant parts of the ALTO files for creating TextBlock overlays with OCR inspection. No webserver is required as the pages can be used directly from the file system.

It can be quite heavy for the browser to display large images with OpenSeadragon, when there is a lot of ALTO-derived overlays. This is especially true if tiling is not used. Display has been tested with Firefox & Chrome. Chrome was markedly faster as of 2013-09-17.

Development

Developed primarily by Toke Eskildsen; partly as a personal project, partly as an employee at the State and University Library, Denmark.

quack's People

Contributors

tokee avatar

Watchers

James Cloos avatar Jørn Thøgersen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.