Giter Site home page Giter Site logo

elopezaguilera / automl Goto Github PK

View Code? Open in Web Editor NEW

This project forked from kxsystems/automl

0.0 1.0 0.0 85 KB

Automated Machine Learning Framework for kdb+

License: Apache License 2.0

q 88.95% Batchfile 1.73% OpenEdge ABL 0.31% Dockerfile 1.97% Raku 5.99% Perl 1.06%

automl's Introduction

Automated machine learning in kdb+

GitHub release (latest by date) Travis (.org) branch

Introduction

The automated machine learning library described here is built largely on the tools available within the machine learning toolkit available here. The purpose of this framework is to provide users with the ability to automate the process of applying machine learning techniques to real-world problems. In the absence of expert machine learning engineers this handles the following processes within a traditional workflow.

  • Data preprocessing
  • Feature engineering and feature selection
  • Model selection
  • Hyperparameter Tuning
  • Report generation and model persistence

Each of these steps is outlined in depth within the documentation for this platform here. This allows users to understand the processes by which decisions are being made and the transformations which their data undergo during the production of the output models.

At present the machine learning frameworks supported for this are based on:

  1. One-to-one feature to target non time-series
  2. FRESH based feature extraction and model production

The problems which can be solved by this framework will be expanded over time as will the available functionality.

Requirements

The following requirements cover all those needed to run the libraries in the current build of the toolkit.

  • embedPy
  • ML-Toolkit

A number of Python dependencies also exist for the running of embedPy functions within both the the machine-learning utilities and FRESH libraries. Install of the requirements can be completed as follows

pip:

pip install -r requirements.txt

or via conda:

conda install --file requirements.txt

Note: Tensorflow and Keras are required for the application of the deep learning models within this platform. However given the large memory requirements of tensorflow the platform will operate without tensorflow by not running the deep learning models. In order to access the full functionality of the interface keras and tensorflow will need to be installed separately by a user.

Installation

Place the library file in $QHOME and load into a q instance using automl/automl.q

This will load all the functions contained within the .ml namespace

$q automl/automl.q
q).automl.loadfile`:init.q

Documentation

Documentation for all sections of the automated machine learning library are available here.

Status

Automated machine learning in kdb+ is still in development and is available here as a beta release, further functionality and improvements will be made to the library in the coming months.

Any issues with the framework should be raised in the issues section of this repository. Functionality suggestions or more general questions should be submitted via email to [email protected]

automl's People

Contributors

cmccarthy1 avatar dianeod avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.