Giter Site home page Giter Site logo

intro's People

Contributors

epogrebnyak avatar fastali avatar jarovojtek avatar zarak avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

intro's Issues

idea: promote a poll about machine readable data sources in economics

Do you write code that imports macroeconomic time series data?

  • all the time
  • once in a while
  • I don't but people around do a lot
  • I stay away from it
  • I knew how

Do you feel there is more better data available to work with?

  • yes
  • no

Do you use Excel as much as 3 years ago?

  • even more
  • just the same
  • I allocate a bit more time to other software
  • Excel is history now

Do you write code that imports macroeconomic time series data?

  • all the time
  • once in a while
  • I don't but people around do a lot
  • I stay away from it

What public sources do you use?

  • FRED
  • Quandl
  • OECD API
  • IMF API
  • World bank API
  • national statistics agency any
  • oh, here is a cool link: __________________________

How do you import data?

  • R
  • python pandas
  • econometrics package
  • Excel plugin

What is the nature of work you do macroeconomic data"

  • student assignments / excercises
  • descriptive statistics / visualisation
  • modelling/forecasting

Your affiliation:

  • university
  • central bank or government agency
  • bank or financial sector
  • other corporate sector
  • non-profit
  • other

open for comments: data pipeline demo

This file mimics the data pipline from parsers to end-user API:

  1. parsers make data from different sources available
  2. database stores the data
  3. standard (REST) API returns query results
  4. custom end-user API returns json readable by pd.read_json()

Questions and comments are highly appreciated.

additional text about parsers

Definitions:

  • 'source' is a file or API backend published on the web by Rosstat or other agency
    • 'clean source' is something we can truct for its structure, usually an API
    • 'messy source' is something that changes once in a while, eg Word files
  • 'scrapper' is a program that downloads the data without transforming it (download files, unpack from zip/rar)
  • 'parser' is a program that reads raw data and makes 'processed output'
  • 'processed output' is canonical result of parsing, importable to production database

In our pipleine:

  • Scrapper loads Source to Raw Database
  • Parser reads Source from Raw Database layer and produces Processed Output
  • Processed Output is imported into Production Database

Cleaner scrappers/parsers, adressing APIs or XML:

Messy parsers for Rosstat:

Gov interest rate curves:

USD/euro echange rate at ECB:

More source to follow. I think we need a template to describe sources and make sure everyone understands Raw Database layer.

docs: separate documentation

Must separate for easier update:

  • README.md (short entry and quick examples)
  • CHANGELOG.md (updated often)
  • project wiki (rather immutable)

other notes form README.md

Other notes

  • 'maas' - forecasting model as a service (experimental)

  • This pipeline may also apply to other countries macroeconomic data, or other kinds datasets (eg firm-level data, banking statistics).

  • May want to condense all of the above in one repo for tasks in other domains.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.