Comments (4)
I cannot reproduce. What does valgrind say? And have you tried with latest Polars?
from polars.
I just downgraded to polars==0.20.15 and pyarrow==15.0.0 and still I can't reproduce although this env has python 3.11.9
--------Version info---------
Polars: 0.20.15
Index type: UInt32
Platform: Linux-5.15.146.1-microsoft-standard-WSL2-x86_64-with-glibc2.35
Python: 3.11.9 (main, Apr 6 2024, 17:59:24) [GCC 11.4.0]
----Optional dependencies----
adbc_driver_manager: <not installed>
cloudpickle: <not installed>
connectorx: 0.3.2
deltalake: <not installed>
fastexcel: <not installed>
fsspec: <not installed>
gevent: <not installed>
hvplot: <not installed>
matplotlib: 3.8.4
numpy: 1.26.4
openpyxl: <not installed>
pandas: 2.2.2
pyarrow: 15.0.0
pydantic: <not installed>
pyiceberg: <not installed>
pyxlsb: <not installed>
sqlalchemy: <not installed>
xlsx2csv: <not installed>
xlsxwriter: <not installed>
from polars.
Ok thanks for checking in on this, I may have something far more wrong with my system then.
❯ python3
Python 3.12.3 (main, Apr 9 2024, 08:09:14) [GCC 13.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import polars as pl
>>> pl.DataFrame({"a": [1,2,3], "b":[4,5,6]})
shape: (3, 2)
┌─────┬─────┐
│ a ┆ b │
│ --- ┆ --- │
│ i64 ┆ i64 │
╞═════╪═════╡
│ 1 ┆ 4 │
│ 2 ┆ 5 │
│ 3 ┆ 6 │
└─────┴─────┘
>>> pl.show_versions
<function show_versions at 0x7f343ed68cc0>
>>> pl.show_versions()
fish: Job 1, 'python3' terminated by signal SIGSEGV (Address boundary error)
I went ahead and upgraded my versions and same issue. Seemingly works fine from an jupyter notebook however. I fear something is really wrong with my system. Ill go ahead and close this.
from polars.
I have narrowed it down to, in my Nix configuration, when I have PyArrow enabled, the error happens as soon as I try to use Polars for certain activities. However, what is odd is I do not get any of these errors when running in a Jupyter notebook.
(python312.withPackages (ppkgs: with ppkgs; [
polars
pyarrow
numpy
pandas
scipy
matplotlib
seaborn
boto3
tqdm
pyyaml
requests
ipython
ipykernel
humanize
]))
Is there a certain configuration i need to pass to valgrind to get this to report something useful?
❯ valgrind --leak-check=yes python3 -c "import polars as pl; pl.show_versions()"
==39677== Memcheck, a memory error detector
==39677== Copyright (C) 2002-2022, and GNU GPL'd, by Julian Seward et al.
==39677== Using Valgrind-3.22.0 and LibVEX; rerun with -h for copyright info
==39677== Command: python3 -c import\ polars\ as\ pl;\ pl.show_versions()
==39677==
fish: Job 1, 'valgrind --leak-check=yes pytho…' terminated by signal SIGSEGV (Address boundary error)
from polars.
Related Issues (20)
- pl.struct with no arguments triggers a panic
- Multiple expr.head(n).max()/min()/etc operations in with_columns causing ShapeError
- Add `repeat` and `tile` for Series/Expr
- `SchemaFieldNotFoundError` when chaining `select` and `collect`
- Series is ignoring the dtype argument, series.to_numpy() dtype depends on values passed
- Problem filtering categorical string columns with lazy frame and scan_parquet HOT 4
- PanicException: validity must be equal to the array's length HOT 6
- Polars-lts-cpu fails to import on older CPU (no SSSE3/SSE4 support)
- Release GIL on `collect_schema`
- Error when using struct expression with `with_fields` in an `over` context HOT 6
- How to write a UDF for polars that run concurrently?
- `write_excel`: write column formats for column, not individual cells within column HOT 1
- Horizontal concat execution time is quadratic in the number of columns
- Incorrect results from `Series.__rtruediv__` HOT 1
- Add SQL feature of ORDER BY RANDOM()
- Expressions support in insert_column (like with_columns)
- Passing a `Series` to `DataFrame.sort` gives "literal expressions are not allowed for sorting" error HOT 3
- polars' ingestion of decimal.Decimal values fails if all values do not have the same number of decimal places HOT 3
- Series.is_in called with a mixed list of Python integers and floats fails HOT 4
- Where is indexing `__getitem__` (e.g. `df[...]`) documented for polars DataFrame and Series? HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from polars.