Giter Site home page Giter Site logo

Comments (6)

Aylr avatar Aylr commented on May 21, 2024

How is this better than what's already there? If the column is categorical you'll get that histogram.

from ydata-profiling.

eamag avatar eamag commented on May 21, 2024

countplot and histogram are different things, we may see different distributions in countplot

from ydata-profiling.

conradoqg avatar conradoqg commented on May 21, 2024

I think what @eamag meant is that with a countplot you can compare side-by-side the distribution of different categorical fields, where the Y is the count. See https://seaborn.pydata.org/tutorial/categorical.html

One way to implement this feature is to generate a countplot on each categorical field against every other one. I'm sure this will hurt the performance and won't give meaningful value to the user.

In an exploratory process, usually, you need to choose rationally which categorical fields you want to compare (like the Titanic example in the above link).

Creating plots by comparing all categorical fields, like A vs B, B vs C, C vs A (2-fields) or A vs B vs C (3-fields) will create an exponential amount of plots (because it is a combinatory analysis).

In my opinion, we shouldn't implement this feature.

Best

from ydata-profiling.

romainx avatar romainx commented on May 21, 2024

Hello,

I still keep it open it could be studied.

from ydata-profiling.

bensdm avatar bensdm commented on May 21, 2024

I agree this would be a great feature

I think what @eamag meant is that with a countplot you can compare side-by-side the distribution of different categorical fields, where the Y is the count. See https://seaborn.pydata.org/tutorial/categorical.html

One way to implement this feature is to generate a countplot on each categorical field against every other one. I'm sure this will hurt the performance and won't give meaningful value to the user.

In an exploratory process, usually, you need to choose rationally which categorical fields you want to compare (like the Titanic example in the above link).

Creating plots by comparing all categorical fields, like A vs B, B vs C, C vs A (2-fields) or A vs B vs C (3-fields) will create an exponential amount of plots (because it is a combinatory analysis).

In my opinion, we shouldn't implement this feature.

Best

I do not understand your point, is it really heavier to plot
image

instead of plotting ?

image

from ydata-profiling.

github-actions avatar github-actions commented on May 21, 2024

Stale issue

from ydata-profiling.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.