Comments (6)
I belefie the issue here is to understand the fundamental of Automatie ML methods, whcih is A transformation acts on a single table (thinking in terms of Python, a table is just a Pandas DataFrame ) by creating new features out of one or more of the existing columns. Like many topics in machine learning, automated feature engineering is a complicated concept built on simple ideas.
Using concepts of entitysets, entities, and relationships, featuretools can perform deep feature synthesis to create new features.
Deep feature synthesis in turn stacks feature primitives — aggregations, which act across a one-to-many relationship between tables, and transformations, functions applied to one or more columns in a single table — to build new features from multiple tables.
read more with basic example here
https://towardsdatascience.com/automated-feature-engineering-in-python-99baf11cc219
from open_source_demos.
Hi @shellwang ,
Can you provide us more details about your goals?
As Max says, you need more related tables to extract this kind of features.
from open_source_demos.
@billy-odera can you provide an example of a feature you would expect to get created using just that one table?
from open_source_demos.
@kmax12 This is the dataframe
customerid age outflows_amout inflows_amount
1 28.00 0 355.00
2 72.00 1 240.00
3 22.00 6 nan
I would expect to get count.outflow_amount, mean,skew etc
from open_source_demos.
@billy-odera not sure i follow your example. if you want to calculate the mean outflows_amount per customer, you would want to create a second entity for your customers that has a relationship to a the entity with multiple rows per customer with different outflow_amounts. let me know if that's helpful or please provide a complete example of what you want to generate so I can better help.
from open_source_demos.
Yes. I encounter the same problem.
from open_source_demos.
Related Issues (20)
- File Not Found Error HOT 11
- Hotfix: Broken link HOT 1
- Update notebook to use Featuretools Dask Implementation HOT 1
- AssertionError: target columns not found HOT 2
- fails to install for windows 10 computer and python 3.5.2 HOT 1
- Broken link to data HOT 1
- Credit Card Churn notebook does not have print outs
- where_primitives that are also not specified under agg_primitives don't get used and hence result in warnings.warn(warning_msg, UnusedPrimitiveWarning) HOT 1
- Update demos with Featuretools 1.0
- Problem with non-ASCII character in csv HOT 1
- NameError: name 'data_dir' is not defined HOT 2
- IndexError: list index out of range HOT 7
- ImportError: cannot import name 'infer_feature_types' from 'evalml.utils.gen_utils' HOT 1
- data leakage in predict_next_purchases HOT 2
- Add link to Featuretools Time Series guide back into Daily Temperature 2 - Featuretools Solution notebook
- Remove make_agg and make_trans functions from demos
- ModuleNotFoundError: No module named 'woodwork.serialize' HOT 3
- Attribute Error
- module 'dask' has no attribute 'config' HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from open_source_demos.