Name: Patrick T. Rourke
Type: User
Company: @LibraryOfCongress
Bio: Software developer at @LibraryOfCongress working on loc.gov, the loc.gov API, and other LC projects; non-LC projects time permitting.
Location: Rockville, MD and Washington, DC
Blog: http://loc.gov/
Patrick T. Rourke's Projects
The data behind the President's 2016 Budget
Akara is an open-source (Apache2 license) Web framework specialized for RESTful data services, especially involving XML and other semi-structured formats. It's implemented in Python and C. You express data models and create transforms based on these data models, plugging required inputs and outputs (XML, JSON, CSV, Atom, etc.) together into pipelines which implement the desired services. You then make these services available on the Web using simple wrappers. The wrappers are based on REST concepts and among other things make it easy to discover and reuse the services, and to connect them to local and remote systems using Web triggers (AKA Web hooks). Akara simplifies extract-transform-load (ETL), data-driven integration of systems, and makes it easy to wrap Web sites and services, turning tag soup and painful APIs into clean, RESTful end-points.
Akara Site information
Amara 2.0. Amara XML toolkit is an open-source collection of Python tools for XML processing, not just tools that happen to be written in Python, but tools built from the ground up to use Python idioms and take advantage of the many advantages of Python over other programming languages.
Tools to analyze web archives
An implementation for Ahmed AlSum's "Thumbnail Summarization Techniques for Web Archives"
Archive Research Services Workshop
A mirror of David J. Perry's Cardo font, a Bembo-inspired font which is useful for scholars
Chef cookbook for Akara
A chef cookbook for installing Viewshare
Crowdsourcing platform for full text transcription and tagging. https://crowd.loc.gov
This repository contains CTSized Ancient Greek Literature texts
Library of Congress Labs Data Exploration Scripts
The Leek group guide to data sharing
django staticfiles bundle of Simile widgets
DROID (Digital Record and Object Identification)
Python module that allows one to easily write and run Hadoop programs.
Scripts for generating html files (which can then be loaded into Calibre to create ebooks) from OPenn and The Digital Walters.
Plotting Assignment 1 for Exploratory Data Analysis
Simile Widgets Exhibit 3 code repository
Akara modules suitable for freemix
trunk
A demo showcasing Require.js with multiple versions of dependencies and jQuery UI widgets (within the Require.js framework)
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
knitpy: Elegant, flexible and fast dynamic report generation with python
Data-Intensive Text Processing with MapReduce
Docker images for Apache/mod_wsgi.