Giter Site home page Giter Site logo

occrp-attic / exactitude Goto Github PK

View Code? Open in Web Editor NEW
8.0 3.0 2.0 68 KB

Parsing and normalising for identifying text data (emails, domains, phone numbers, dates). Combines external libraries into a coherent API.

License: MIT License

Python 98.78% Makefile 1.22%
dates emails phone-number names countries languages addresses url

exactitude's Introduction

exactitude

Build Status

exactitude contains parsers and validators for common text data types, such as phone numbers, dates, URLs, domain names, email addresses and country names. It can be used to normalize source data before further processing.

Most of the functionality of exactitude is provided by third-party packages which are imported and exposed in a uniform way.

Dates

Dates in exactitude are handled as ISO 8601 prefixes. That means that valid dates can indicate a loss of precision by shortening dates down to either a year (2017), or a year-month combination (2017-02).

If a date is not recognized as a valid ISO 8601 prefix and no format has been specified, exactitude will attempt to recognize the format used.

Name

... In that Empire, the Art of Cartography attained such Perfection that the map of a single Province occupied the entirety of a City, and the map of the Empire, the entirety of a Province. In time, those Unconscionable Maps no longer satisfied, and the Cartographers Guilds struck a Map of the Empire whose size was that of the Empire, and which coincided point for point with it. The following Generations, who were not so fond of the Study of Cartography as their Forebears had been, saw that that vast map was Useless, and not without some Pitilessness was it, that they delivered it up to the Inclemencies of Sun and Winters. In the Deserts of the West, still today, there are Tattered Ruins of that Map, inhabited by Animals and Beggars; in all the Land there is no other Relic of the Disciplines of Geography.

purportedly from Suárez Miranda, Travels of Prudent Men, Book Four, Ch. XLV, Lérida, 1658

exactitude's People

Contributors

jcshea avatar pudo avatar rhiaro avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

reuf pombredanne

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.