Giter Site home page Giter Site logo

summarizer's Introduction

summarizer

This summarizer intends to sum up pictures of annotated pdfs. It should group together all strokes which are in a spatial threshold and select only the siding text. For instance with the following image (red arrows are not part of the image and are here to show the strokes) :

image of a text with strokes on its left

We should sum it up as :

A l'opposé le Nord et l'Est de la Seine-Saint-Denis cumulent les nombreux handicaps sociaux et résidentiels. On retrouve aussi ces difficultés en deuxième couche de la basse Seine (Les Mureaux, Mantes-La-Jolie), dans certaines villes nouvelles (Cergy, Trappes, Evry, Grigny) et dans les villes secondaires,

Les processus de renforcement des ségrégations concernent aujourd'hui l'ensemble de l'Île-de-France comme l'indique la comparaison départementale. Du fait des blocages sociaux et résidentiels de ces dernières

Du fait de l'énorme bulle immobilière spéculative des dernières décennies, qui touche en particulier Paris et une partie de la première couronne, mais qui se répercute mécaniquement sur l'ensemble de l'espace régional

To-do

implement the grouping algorithm that would group and seperate strokes and text

Inspiration

As I want to distinguish shape versus text in hand-drawn strokes Using Entropy to Distinguish Shape Versus Text in Hand-Drawn Diagrams rose my interest. Yet it doesn't tell what the grouping algorithm they use to group only strokes which are a part of the drawings or the letter.

summarizer's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.