This repository contains Jupyter notebooks which are used to execute and compare DataFrame benchmarks between Pandas, Polars and DataFrames.jl packages.
-
Python 3.10.6
- Polars 0.18.15
- Pandas 2.0.3
-
Julia 1.9.2
- DataFrames.jl 1.6.1
-
Hard drive S.M.A.R.T. data has been graciously made available for public use by Backblaze. The quarterly CSV data (per day) are bundled together into a zipped file, which can be downloaded from here. We are making use of data for only a selected number of days.
-
Steam games recommendation data has been obtained from kaggle. More information is available here.
-
Rotten Tomatoes movies review data has been obtained from kaggle. More information can be found here.