Giter Site home page Giter Site logo

algoseeknotebooks's Introduction

Algorithmic Trading with Python and AlgoSeek on AWS

Introduction

This is a collection of notebooks, recipes, and scripts demonstrating how to use AlgoSeek as a data provider for Quantitative Finance, Algorithmic Trading, and Machine Learning. There are samples covering everything from data ingestion (real-time and batch), stock/universe selection, backtesting, and feature engineering. Additionally, there are examples showing how to manipulate financial data with AWS EMR, PySpark, and AWS Sagemaker for end-to-end ML pipelines and intraday strategy research.

1) Overview

The notebooks in the root of this repo are the starting point. These are the broad introductions to sections detailed in the notebooks portion. The rest of the subdirectories are as follows

Directory Description
algoseek Library for algoseek-specific functions
data data folder (gitignored but created in first notebook
Datasets Samples and descriptions of AlgoSeeks Datasets
eda Exploratory Data Analysis and Data Visualizations
integrations Using AlgoSeek with external data sources
ML Machine Learning Scripts
Notebooks Miscellaneous Notebooks
Strategies Trading Strategies
WIP Work In Progress Notebooks

2) AlgoSeek Datasets

Here are dataset-specific notebooks exploring data for daily and intraday frequencies.

2.1) Equity Market Data:

These datasets contain the actual stock movement data.

Dataset Description
BasicOHLCDaily
BasicAdjustedOHLCDaily
PrimaryOHLCDaily
PrimaryAdjustedOHLCDaily
StandardOHLCDaily
StandardAdjustedOHLCDaily
TradeAndQuote
TradeAndQuoteMinuteBar
TradeAndQuoteMinuteBarExcludingTRF
TradeOnly
TradeOnlyAdjusted
TradeOnlyAdjustedMinuteBar
TradeOnlyAdjustedMinuteBarBBG
TradeOnlyAdjustedMinuteBarExcludingTRF
TradeOnlyMinuteBar
TradeOnlyMinuteBarBBG
TradeOnlyMinuteBarExcludingTRF

2.2) Equity Reference Data:

These datasets provide more information about the securities and the Equity datasets.

Dataset Description
BasicAdjustments
DetailedAdjustments
LookupBase
SecMasterBase

3) Data Access

There are currently two different ways to access the data: using the AlgoSeek SDK or using Boto3. There are notebooks for both methods here, but you should start with the introductions for both.

4) Time Series Analysis

5) Getting Started with Machine Learning

Introductions to using AlgoSeek datasets with several popular libraries

Some sample machine learning models to get you started

Model Frequency Description
Keras Univariate LSTM Intraday Intraday Regression
Linear Regression Intraday
LightGBM Regression Intraday
Random Forest Regressor Intraday
XGBoost Intraday
Model Frequency Description
LightGBM Regressor Intraday [Link](ml/mlflow/lightgbm/

5) Strategies

Strategy Frequency Description

algoseeknotebooks's People

Contributors

julianwileymac avatar angelmvhill avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.