Giter Site home page Giter Site logo

feat's Introduction

Hi there ๐Ÿ‘‹

My name is Tim. I clean and analyze data, and like to build tools to make that easier. I work at Wave. I'm at home programming in R, but I also really enjoy Python.

  • ๐Ÿ”ญ Iโ€™m currently working on feat, a Python package to help with feature importances in sklearn.
  • โญ๏ธ Up next, I'm updating my AWS sagemaker to make it easy to productionize machine learning models in R.
  • ๐ŸŒฑ Iโ€™m currently learning the IPython API and auto-completion for siuba.
  • ๐Ÿ“– I recently learned Hugo and updated the layout of my blog.
  • ๐Ÿ’ฌ Ask me about AWS and R.
  • ๐Ÿ“ซ How to reach me: @timmastny on Twitter, https://timmastny.com/about/
  • ๐Ÿ˜„ Pronouns: he/him

feat's People

Contributors

tmastny avatar

Watchers

 avatar  avatar

feat's Issues

feature_selection cases

Single column transformations make sense

make_pipeline(
    StandardScaler(),
    VarianceThreshold()
)

but how would this work? I don't think my logic would currently work. I'm guessing I could get a mask from the output OneHotEncoder. I think I have to think more carefully about my pipeline logic.

make_pipeline(
    OneHotEncoder(),
    VarianceThreshold()
)

Test PCA with SelectorMixin

Good reason not to nest output.

If we nested output of multi-column to multi-column transformations:

name               | feature
-----------------------------
["c1", "c2", "c2"] | ["pca1", "pca2", "pca3"]

Then it doesn't easily work with a `SelectorMixin, because we need to explode it anyway to remove one of the features. So we want

name               | feature
-----------------------------
["c1", "c2", "c2"] | "pca1"
["c1", "c2", "c2"] | "pca2"
["c1", "c2", "c2"] | "pca3"

even if it seems a little redundant. Although we could always nest at the end of the transformation.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.