I think what <a class="user-mention notranslate" data-hovercard-type="user" data-hover

I agree this would be a great feature I think what <a c

Add "target" variable to ProfileReport and then add more graphs about ydata-profiling HOT 6 CLOSED

ydataai commented on May 21, 2024

Add "target" variable to ProfileReport and then add more graphs

from ydata-profiling.

Comments (6)

Aylr commented on May 21, 2024

How is this better than what's already there? If the column is categorical you'll get that histogram.

from ydata-profiling.

eamag commented on May 21, 2024

countplot and histogram are different things, we may see different distributions in countplot

from ydata-profiling.

conradoqg commented on May 21, 2024

I think what @eamag meant is that with a countplot you can compare side-by-side the distribution of different categorical fields, where the Y is the count. See https://seaborn.pydata.org/tutorial/categorical.html

One way to implement this feature is to generate a countplot on each categorical field against every other one. I'm sure this will hurt the performance and won't give meaningful value to the user.

In an exploratory process, usually, you need to choose rationally which categorical fields you want to compare (like the Titanic example in the above link).

Creating plots by comparing all categorical fields, like A vs B, B vs C, C vs A (2-fields) or A vs B vs C (3-fields) will create an exponential amount of plots (because it is a combinatory analysis).

In my opinion, we shouldn't implement this feature.

Best

from ydata-profiling.

romainx commented on May 21, 2024

Hello,

I still keep it open it could be studied.

from ydata-profiling.

bensdm commented on May 21, 2024

I agree this would be a great feature

I think what @eamag meant is that with a countplot you can compare side-by-side the distribution of different categorical fields, where the Y is the count. See https://seaborn.pydata.org/tutorial/categorical.html

One way to implement this feature is to generate a countplot on each categorical field against every other one. I'm sure this will hurt the performance and won't give meaningful value to the user.

In an exploratory process, usually, you need to choose rationally which categorical fields you want to compare (like the Titanic example in the above link).

Creating plots by comparing all categorical fields, like A vs B, B vs C, C vs A (2-fields) or A vs B vs C (3-fields) will create an exponential amount of plots (because it is a combinatory analysis).

In my opinion, we shouldn't implement this feature.

Best

I do not understand your point, is it really heavier to plot

instead of plotting ?

from ydata-profiling.

github-actions commented on May 21, 2024

Stale issue

from ydata-profiling.

Add "target" variable to ProfileReport and then add more graphs about ydata-profiling HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent