Comments (2)
We also will need to determine if we can create an airflow tasks that will export all ledger entries to BigQuery even if they are all in the same file, or if we will need to separate out changes by type/batch.
from stellar-etl.
DAG organization
- History archive DAG
- exports ledgers, txs, ops
- Load exported files into Google Cloud Storage
- Load GCS files into BigQuery (append new rows)
- Long Running Core DAG
- Exports accounts, trustlines, offers
- All in single command; long running stellar-core instance exports incrementally
- Changes Export DAG
- File sensors look at the location where ledger entry changes are exported in batches
- As new files are created, they are uploaded to BigQuery and deleted from local storage
- Files are used up update BigQuery tables (can create new rows, update existing rows, or delete rows)
- Catchup DAG
- Exports accounts, Trustlines, offers
- Everything in separate commands; uses bucket list to catchup
from stellar-etl.
Related Issues (20)
- [2](2) Add history_operations_id field to trades
- [5] (5) Change transformation errors to warnings
- [5](5) Fix infinite loop when getting ledger range from times HOT 1
- [5](previous-3) Add export_orderbooks command
- Add automated testing HOT 2
- Refactor `transform` code to use functions from Horizon HOT 1
- Look into if stellar-etl doesn't need to create a file when exporting data HOT 1
- investigate and fix cmd unit tests HOT 1
- Broken Dockerfile
- Add feature for Exporter Interface(GCS, S3, Kafka and etc) HOT 2
- Use local buckets
- [8](8) Dockerize CLI
- Investigate having a mandatory long-running stellar core HOT 1
- [0.5](0.5) Add base and counter offer ids to trades
- [2](3) Update transforms to handle deleted entries
- [3](1) Check permissions of output files
- [1](1) Add get_ledger_range_from_times command
- [2](2) Fix infinite loop when converting early times
- [2](2) Change null slice to empty slices
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from stellar-etl.