Giter Site home page Giter Site logo

rfc-analysis's Introduction

Analyzing RFCs and I-Ds

This project is developing code for the automated analysis of the text of Requests for Comment (RFCs) published by the Internet Engineering Task Force, as part of a larger research project studying privacy in technical standard-setting.

For more information, if you want to use these tools or collaborate on their development, please contact Nick Doty.

Some basic graphs produced with this code are available online.

Usage

Scripts are not fully parameterized or user friendly. Current usage pattern:

  • clone the repository
  • download all RFCs (see "Getting the documents" below) as .txt into a RFC-all directory within the main directory of the repository
  • configure by copying config.ini.example to config.ini and pointing it to your downloaded RFCs
  • python search.py --rfc will create a file rfc-search.json with section titles and lengths and word search counts for every available RFC

Other functionality:

  • search.py can do basic string matching against all RFCs (or similar code for all W3C TRs)
  • search.py --id does the same parsing for Internet-Drafts if you've rsynced them (and added that directory to your config.ini)
  • the graphs/ directory contains d3.js visualizations of some of the measurements

Getting the documents

There are several thousand RFCs and many more drafts and other IETF docs. You can download some or all of those documents for easier local analysis.

Rsync all the documents via ietf-cli

Clone the ietf-cli, add the config file to an appropriate location (and specify where you want all the documents synced) and run ./ietf mirror to download all RFCs, drafts and some minutes and other documents. It's more than 2 GB of data and takes at least a few minutes to download.

Just download the RFCs

The RFC Editor maintains zip and tar files of all the RFCs, in TXT and PDF formats, for download with your browser. The compressed RFC-all.zip file is a couple hundred megabytes.

See also

rfc-analysis's People

Contributors

npdoty avatar sbenthall avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Forkers

nllz sbenthall

rfc-analysis's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.