Comments (5)
I think conversion between parquet and arrow is a valid use case. The parquet-java provides built-in row-level interfaces like avro/thrift/protobuf. Other parquet (Java) implementations (Presto/Trino/Spark) simply leverage the page & metadata reader/writer from this library to build extensions. Extending native arrow support would be a welcome extension to this library, IMO.
from parquet-java.
If possible to use C++, I think parquet-cpp in the Apache Arrow is the best solution to your case: https://arrow.apache.org/docs/cpp/parquet.html
from parquet-java.
Yes, We know there is a c++ implementation here, but I was wondering if there is a corresponding implementation for java, since all our code is java .
from parquet-java.
parquet-arrow The library looks like it's meant to do this, But I don't know why it's always just the schema part.
from parquet-java.
So, the library parquet-arrow hasn't been used yet? because it only has schema mappings.
And We've looked at transformations in Spark, which are missing some types due to limitations in Spark, such as uint.So it's not really a universal conversion.
from parquet-java.
Related Issues (20)
- Bump Spotless to 2.43.0
- Remove Jackson JDK8 module
- missing changelog in release note
- Parquet check-stats command only supports checking BINARY type columns.
- Migrate Parquet Jira issues to GitHub
- Old Parquet files with wrong Compressed Size not Readable
- Why doesn't Parquet currently support writing multiple row groups simultaneously? HOT 1
- Introduce issue templates
- Adjust PR template to reference GitHub issue tracker HOT 5
- Double close of ParquetFileWriter in ParquetWriter
- Out of the box support for LocalOutputFile with ParquetWriter? HOT 7
- Update PULL_REQUEST_TEMPLATE
- Remove support for Hadoop <2.9
- Remove support for Hadoop <3.3
- Support Java 21
- NullPointerException when using the AvroParquetReader.Builder with a LocalInputFile HOT 2
- Create a comparator utility to compare 2 parquet files and give a clear message for the differences
- Add maven wrapper
- parquet-avro 1.14 not compatible with avro 1.8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from parquet-java.