Comments (2)
there should probably be a separate path which allows for conversion to pandas to happen directly without going through pyarrow
Even so, there will be dataframes that require pyarrow for conversion to pandas. So it should be part of the extras. I will pick it this up as part of #17064
from polars.
thanks for the report - to be honest I think there should probably be a separate path which allows for conversion to pandas to happen directly without going through pyarrow, just as it already happens in from_pandas
:
In [6]: import pandas as pd
In [7]: pl.from_pandas(pd.DataFrame({'a': [1,2,3], 'b': [4,5,6]}))
Out[7]:
shape: (3, 2)
┌─────┬─────┐
│ a ┆ b │
│ --- ┆ --- │
│ i64 ┆ i64 │
╞═════╪═════╡
│ 1 ┆ 4 │
│ 2 ┆ 5 │
│ 3 ┆ 6 │
└─────┴─────┘
In [8]: import pyarrow
---------------------------------------------------------------------------
ModuleNotFoundError Traceback (most recent call last)
Cell In[8], line 1
----> 1 import pyarrow
ModuleNotFoundError: No module named 'pyarrow'
from polars.
Related Issues (20)
- Scan parquet should allow manual schema overrides (and/or should treat any `list[null]` encountered as a type to be resolved at query execution time?)
- Support sorting list-of-lists column
- Change `dt.week()` to be more consistent with `dt.year()` (Gregorian Year) HOT 1
- Deserializing list-of-null data leads to incorrect results
- Cannot instantiate Series of type Struct(List(UInt64)) with large integer value
- Aliasing Columns Names in Map_Element () with Dataclass Fields HOT 1
- Parallel string operations HOT 1
- add option for `to_dummies` to respect nulls
- `.mean_horizontal()` does not work with `pl.datetime` (but `.mean()` does)
- floor_div runtime error for i64, u32 and u64
- Support No-op round() on Int columns HOT 1
- `sort_by("col", nulls_last=True).over("other")` does not put nulls at the end HOT 2
- Native `delta` reader and writer using delta-kernel-rs
- ComputeError: could not append value when creating Polars DataFrame from TimescaleDB query result HOT 1
- `pl.Expr.list.get` returns erroneous results after version `0.20.19` HOT 3
- Implement (de)serialization of Series/DataFrames using IPC
- `pl.Expr.list.get` raises error on null values. HOT 4
- polars.read_delta from a GCP bucket using service account
- `scan_parquet` expects columns to be in the same order iff `POLARS_FORCE_ASYNC=1` or scanning against cloud, local works regardless of ordering HOT 1
- PanicException when doing a `scan_parquet` on files with mismatching schemas
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from polars.