Giter Site home page Giter Site logo

thepigeonoftime / german-gov-domains Goto Github PK

View Code? Open in Web Editor NEW

This project forked from robbi5/german-gov-domains

0.0 0.0 0.0 573 KB

An incomplete listing of german government domains

License: Creative Commons Zero v1.0 Universal

Ruby 70.34% Python 6.50% Makefile 23.17%

german-gov-domains's Introduction

German Government Domains

An incomplete listing of german government domains (and the code for the scraper used to build the list).

You can download the list as a .csv file or view it with github's pretty formatting.

We try to use the same format as the US GSA (example), so the CSV file has a header of Domain Name,Domain Type,Agency,City,State and currently contains government agencies and cities.

Variants

If you only want a subset of the available data, variants filtered by Domain Type are provided:

Why?

There currently isn't a publicly available directory of all the domain names registered by the german government and its agencies. Such a directory would be useful for people looking to get an aggregate view of government websites and how they are hosted. For example, Ben Balter has been doing some great work analyzing the official set of US .gov domains.

This is by no means an official or a complete list. It is intended to be a first step toward a better understanding of how the government is managing its official sites.

What can I do with it?

  • Plug the CSV into 18F/domain-scan to get more data (like HTTPS support) about the domains
  • Check the IPv6 reachability
  • Test if the sites are reachable even without the www. subdomain
  • ...?

How to update

The list is populated by scrapers and static files and merged by a makefile. To run the process yourself, checkout this repository and run:

bundle install
make

After everything ran, you can look into data/domains.csv.

Scrapers and Sources

Contributing

I'd love to have some help with this! Please feel free to create an issue or submit a pull request if you notice something that can be better. Specifically, suggesting additional pages we can scrape and domains that are either not found or have incorrect organization names associated with them would be very helpful.

Ideas

Thanks

Thanks to @esonderegger for the dotmil domains project that served as an template for this repo.

german-gov-domains's People

Contributors

codedust avatar corvusmo avatar deknos avatar derhuerst avatar lucaswerkmeister avatar martinhartwig avatar robbi5 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.