Comments (2)
I agree that this is a great feature. We should add this natively on the rust side.
from polars.
I'm not quite experienced with Polars and understand that this is already known and you'd like a more straightforward interaction (apologies for the noise if so), just in case it's any useful to anyone, it is possible to directly write parquet files to GCS (as well as to other storage providers)
import polars as pl
import gcsfs
# df = pl.read_parquet('file_path')
# Assuming `df` is your Polars DataFrame, and that GOOGLE_APPLICATION_CREDENTIALS env variable si correctly set
fs = gcsfs.GCSFileSystem()
# Define your GCS bucket and file path
destination = "gs://bucket/folder/file.parquet"
# Write the DataFrame to a Parquet file directly in GCS
with fs.open(destination, mode='wb') as f:
df.write_parquet(f)
from polars.
Related Issues (20)
- Infer `time_unit` in `pl.duration` when nanoseconds is specified HOT 6
- [Polars Python 0.20.12] Importing polars fails due to 'RuntimeError: unknown feature flag "pclmulqdq"' HOT 1
- polars version 0.20.12 not available on pypi HOT 2
- Enable Rolling aggregation by Duration HOT 3
- Support for Column-Specific Float Precision HOT 2
- Writing partitioned Parquet files to S3 has weird memory consumption
- Join drops right_on column HOT 4
- datatype with `__slots__` breaks copy.deepcopy HOT 2
- ComputeError when writing `Decimal` type to csv using `write_csv` HOT 1
- Slight discrepancies between pl.DataFrame() and pl.from_records() when loading (key, value) data from Python dictionaries
- with_context not present for pl.DataFrame HOT 5
- Invalid polars.lit causes panic, ungraceful termination or hang HOT 5
- Cannot upgrade to 0.38.0: count_rows_ipc not found in this scope HOT 2
- Make polars.Expr.str.replace and polars.Series.str.replace default to similar behavior of built python and rust replace methods HOT 2
- Clarify the use of period = `'1i'` when passing an int-type `index_column` in `.rolling` HOT 5
- Resolve inconsistencies of connection uri / objects in read_database, read_database_uri and write_database HOT 1
- is_between behavior HOT 2
- Incorrect rounding of `Decimal` values during casting in window aggregation with `drop_nulls()` HOT 3
- Failed to upgrade from 0.37 to 0.38: unresolved import `crate::operators::StreamingVstacker`
- `pl.from_dicts` doesn't support np.ndarray
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from polars.