Giter Site home page Giter Site logo

teletext-archive's Introduction

archive of german online teletexts

Or videotext, as we used to call it.

DEPRECATED: Collecting raw html files every 30 minutes is just too much:

  • for github: repo size is 800 mb after only 3 weeks
  • for parsing: it takes 6 single-thread hours to beautiful-soup through all files in each commit

A slimmer version runs at teletext-archive-unicode

Below is historical

------8<------8<------8<------8<------8<------

This repo exists mainly because it's just possible to scrape those online teletexts with github actions. And, you know, interesting stuff might evolve from historic beholding.

The data is collected raw in docs/snapshots. Each commit adds, overwrites or removes the individual files of each teletext page.

scraped stations:

station since type link
3sat 2022-01-28 html with font-map https://blog.3sat.de/ttx/
ARD 2022-01-28 html https://www.ard-text.de/
NDR 2022-01-27 html https://www.ndr.de/fernsehen/videotext/index.html
n-tv 2022-01-28 json https://www.n-tv.de/mediathek/teletext/
SR 2022-01-28 html https://www.saartext.de/
WDR 2022-01-28 html https://www1.wdr.de/wdrtext/index.html
ZDF 2022-01-27 html https://teletext.zdf.de/teletext/zdf/
ZDFinfo 2022-01-27 html https://teletext.zdf.de/teletext/zdfinfo/
ZDFneo 2022-01-27 html https://teletext.zdf.de/teletext/zdfneo/

related stuff

Oh boy, look what else exists on the web:

TODO

beyond the borders

teletext-archive's People

Contributors

defgsus avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.