ashton-sidhu / aethos Goto Github PK

Automated Data Science and Machine Learning library to optimize workflow.

License: GNU General Public License v3.0

Python 100.00%

data-scientists engineers machine-learning-engineers analysts machine-learning machinelearning data-science data-visualization automl automation

aethos's Introduction

aethos's People

Contributors

Stargazers

Watchers

Forkers

vitasiku evgeny007 kostaschar nperera0 bacoco sghosh1999 eavidan peterhamfelt

aethos's Issues

Parallel Computing with native multiprocess

Investigate not using pathos and using native multiprocessing:

since i'm forming the function as a partial function, its an instance made at runtime as is a pyautoml.Model.function but needs to be pyautoml.modelling.model.Model.function instance

Add self.use_full_dataset condition to imputer values based off training set to ...

https://github.com/Ashton-Sidhu/py-automl/blob/3cd09a308f752020c5b39c1b544ed167e7f1f4d4/cleaning/numeric.py#L11-L14

This issue was generated by todo based on a `TODO:` comment in `3cd09a3`.

FileNotFoundError: [Errno 2] No such file or directory: '/home/<user>/.aethos/config.yml'

Describe the bug
After I did 'pip install aethos', I tried to create a data science folder structure with 'aethos create'. I also just tried to import using 'import aethos as at' and both yielded the same error:

FileNotFoundError: [Errno 2] No such file or directory: '/home//.aethos/config.yml'

It seems that aethos is looking in the wrong place for the config file. Everything is stored in my Anaconda distro:

To Reproduce
Try either 'aethos create' or importing in Jupyter Notebook (I have Jupyter 6.0.1, if that matters).

Expected behavior
I didn't expect to receive an error. I wanted to work with this promising package.

Screenshots

Desktop (please complete the following information):

OS: Linux pop-os 5.3.0-7625-generic
Browser: Mozilla Firefox 72.0.1

Standardize function documentation to Numpy format

Some are in Numpy, some are in Google and some are in this random format.

Add data validation on the custom_cols argument to make sure it is float or int

https://github.com/Ashton-Sidhu/py-automl/blob/106a7f8482881be3e921c8987f58e5c809313f36/cleaning/numeric.py#L9-L14

This issue was generated by todo based on a `TODO:` comment in `106a7f8`.

Add full dataset to data_properties object

This allows us to pass one object from state to state.

Implement KNN, and replacing with most common category

https://github.com/Ashton-Sidhu/py-automl/blob/b3c335d2b43a38a5839915c6f95dc2747799569a/cleaning/categorical.py#L6-L11

This issue was generated by todo based on a `TODO:` comment in `b3c335d`.

Figure out a better way to identify text that is categorical

Add conditional validation based off list_of_cols, data, train_data and test_dat...

https://github.com/Ashton-Sidhu/py-automl/blob/88c0db66a7327485e389611c7c30ed7f0f8879e2/cleaning/numeric.py#L14-L19

This issue was generated by todo based on a `TODO:` comment in `88c0db6`.

Implement a 'project metric'

Implement a project metric where a user can chose a metric(s) that they want to compare there models against. This should be shown when a user runs compare_models

Improve import times

Improve the time it takes to load the library.

Add experiment tracking

Add Numeric Validation

https://github.com/Ashton-Sidhu/py-automl/blob/106a7f8482881be3e921c8987f58e5c809313f36/data-validation/datavalidation.py#L6-L9

This issue was generated by todo based on a `TODO:` comment in `106a7f8`.

Find way to autoname reports

Log the input type classification as a report.

https://github.com/Ashton-Sidhu/py-automl/blob/6f0395bb2f15a4af83993ed8b5a3824eee2dc3e6/cleaning/utils.py#L87-L90

This issue was generated by todo based on a `TODO:` comment in `6f0395b`. It's been assigned to @Ashton-Sidhu because they committed the code.

Make functions adhere to Google coding standard

Implement Dask as normal back end computational framework

For at home labs/on prem clusters

Move making the directory on first time run to some config file

https://github.com/Ashton-Sidhu/py-automl/blob/25c48182719c29a0fc0888253b069c9b3230ce9f/report/report.py#L10-L15

This issue was generated by todo based on a `TODO:` comment in `25c4818` when #15 was merged.

Use Scikit Learns Column Transformer instead of reconstructing/concatting dataframes after applying Scikit Learn transformations

OSError when installing or creating folder structure

Describe the bug
After I did 'pip install aethos', I attempted the following

creating DS folder structure with 'aethos create'
installing NLP corpora with 'aethos install-corpora,
Enabling extensions with 'aethos enable-extensions'

All yielded the same error message

OSError: dlopen(/usr/local/lib/python3.7/site-packages/lightgbm/lib_lightgbm.so, 6): Library not loaded: /usr/local/opt/libomp/lib/libomp.dylib
Referenced from: /usr/local/lib/python3.7/site-packages/lightgbm/lib_lightgbm.so
Reason: image not found

To Reproduce
try installing, creating or enabling aextensions
Expected behavior
I wasn't expecting any error.I wanted to follow your example that you posted on Medium

OS: MacOS
Browser: Safari/Chrome

Add customization to BoW and TF-IDF through the parameters of the constructor

https://github.com/Ashton-Sidhu/py-automl/blob/098cc134f9b93cf00245c79dd8a0025e4a56e1cf/feature-extraction/text.py#L7-L10

This issue was generated by todo based on a `TODO:` comment in `098cc13`.

Figure out a better way to identify text that is not categorical

Remove pandas dependency and only use it for the "Experimental" code generation

aethos install-corpora failing to find python3 command on windows

Describe the bug
I ran the command >aethos install-corpora to setup but got the below error.
'python3' is not recognized as an internal or external command,
operable program or batch file.
'python3' is not recognized as an internal or external command,
operable program or batch file.
'python3' is not recognized as an internal or external command,
operable program or batch file.

To Reproduce
Steps to reproduce the behavior:

Setup a new virtual environment on Windows 2
pip install aethos
pip install aethos[ptmodels]
aethos install-corpora (errors out as above)

Windows does not have python3 as a command and has python which can be used to execute python program.

Expected behavior
Installatoon of corpora packages.

Desktop (please complete the following information):

OS: Windows 10
Anaconda prompt cmd line execution
python 3.8

Documentation of how to proceed post aethos create execution

Is your feature request related to a problem? Please describe.
I was referring to the installation documentation and I was not clear how to proceed once you execute aethos create. Also, documentation of what aethos create does is not very clear.

Describe the solution you'd like
Description of what aethos create command does so that it is clear for a user what to do next. Post execution of this command successfully, what is the next step that a user should do is again missing.

Describe alternatives you've considered
None

Additional context
None

Remove .apply and replace with Series vectorization

Implement multi processing running of models """

https://github.com/Ashton-Sidhu/py-automl/blob/6ea81da3e50bb91e7a1e3c0ab1ac745fbe81b682/pyautoml/modelling/model.py#L133-L138

This issue was generated by todo based on a `TODO` comment in `6ea81da` when #85 was merged. cc @Ashton-Sidhu.

Add String validation

https://github.com/Ashton-Sidhu/py-automl/blob/106a7f8482881be3e921c8987f58e5c809313f36/data-validation/datavalidation.py#L7-L9

This issue was generated by todo based on a `TODO:` comment in `106a7f8`.

Implement a better dev/test test split

Make it so that the dev and test set follow the same distribution.

Improve this detection function and to detect numeric categorical data

https://github.com/Ashton-Sidhu/py-automl/blob/6cb876c00b2e39f31423c57527ef498525341077/cleaning/utils.py#L30-L33

This issue was generated by todo based on a `TODO:` comment in `6cb876c`. It's been assigned to @Ashton-Sidhu because they committed the code.

Implement KNN, Interpolation, Extrapolation, Hot-Deck imputation for replacing m...

https://github.com/Ashton-Sidhu/py-automl/blob/88c0db66a7327485e389611c7c30ed7f0f8879e2/cleaning/numeric.py#L13-L18

This issue was generated by todo based on a `TODO:` comment in `88c0db6`.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

ashton-sidhu / aethos Goto Github PK

aethos's Introduction

Aethos

Table of Contents

Introduction

Usage

Options

Analysis

Modelling

Running a Single Model

Running multiple models in parallel

Using Pretrained Models

Model Interpretability

Code Generation

Installation

Development Phases

Phase 1

Phase 2

Phase 3

Phase 4

Phase 5

Phase 6

Feedback

Contributors

Sponsors

Acknowledgments

For Developers

aethos's People

Contributors

Stargazers

Watchers

Forkers

aethos's Issues

This issue was generated by todo based on a TODO: comment in 3cd09a3.

This issue was generated by todo based on a TODO: comment in 106a7f8.

This issue was generated by todo based on a TODO: comment in b3c335d.

This issue was generated by todo based on a TODO: comment in 88c0db6.

This issue was generated by todo based on a TODO: comment in 106a7f8.

This issue was generated by todo based on a TODO: comment in 6f0395b. It's been assigned to @Ashton-Sidhu because they committed the code.

This issue was generated by todo based on a TODO: comment in 25c4818 when #15 was merged.

This issue was generated by todo based on a TODO: comment in 098cc13.

This issue was generated by todo based on a TODO comment in 6ea81da when #85 was merged. cc @Ashton-Sidhu.

This issue was generated by todo based on a TODO: comment in 106a7f8.

This issue was generated by todo based on a TODO: comment in 6cb876c. It's been assigned to @Ashton-Sidhu because they committed the code.

This issue was generated by todo based on a TODO: comment in 88c0db6.

Recommend Projects

Recommend Topics

Recommend Org

This issue was generated by todo based on a `TODO:` comment in `3cd09a3`.

This issue was generated by todo based on a `TODO:` comment in `106a7f8`.

This issue was generated by todo based on a `TODO:` comment in `b3c335d`.

This issue was generated by todo based on a `TODO:` comment in `88c0db6`.

This issue was generated by todo based on a `TODO:` comment in `106a7f8`.

This issue was generated by todo based on a `TODO:` comment in `6f0395b`. It's been assigned to @Ashton-Sidhu because they committed the code.

This issue was generated by todo based on a `TODO:` comment in `25c4818` when #15 was merged.

This issue was generated by todo based on a `TODO:` comment in `098cc13`.

This issue was generated by todo based on a `TODO` comment in `6ea81da` when #85 was merged. cc @Ashton-Sidhu.

This issue was generated by todo based on a `TODO:` comment in `106a7f8`.

This issue was generated by todo based on a `TODO:` comment in `6cb876c`. It's been assigned to @Ashton-Sidhu because they committed the code.

This issue was generated by todo based on a `TODO:` comment in `88c0db6`.