Giter Site home page Giter Site logo

r-advantages-over-python's People

Contributors

edvardoss avatar iyarlin avatar saintrod avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

r-advantages-over-python's Issues

Numpy can work with different types

I am referring to this part of your comparison:

Do note however that numpy can’t help you if the vector you’re working
with is made up of different types of elements (e.g. strings and
numbers). In R you can use lists and still enjoy vectorization:

It seems that you can easily do that:

values = np.array(['a', 1, (1, 2, 3), 'wow', ['sweet', 2]])
indices = np.arange(len(values))
values[indices > 2]
## array(['wow', list(['sweet', 2])], dtype=object)

Moving windows example

I'm not so sure it's objectively worse in Python. It's about the same number of lines, and the only difference seems to be a subjective judgement call about the ubiquitousness of functions. I'm not sure it's a strong enough example.

H2O.ai benchmark is out of date

Great repo by the way!

The benchmark has quite different results now that Polars and DataFrames.jl have matured a bit more, even datatable for python. I've been working on automl packages for both R and Python (the R one is much further along and the Python one is relatively new) and I can't state enough how almost everything I do to create these packages is more easily done in R.

Now that Python does have some promising data wrangling packages (polars / datatable) I think it's important to focus on the capabilities in those packages compared to data.table (which is still king in my book). The example you have with the case-when of dplyr vs pandas is a superb example of two things: the capabilities that exists and the ease of which to deploy them, and in both cases, R wins. I believe that you'll find this to be true to more you go digging into feature engineering capabilities from all those packages.

Anyway, great work

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.