Comments (10)
I got it working with:
- parquet-mr (master 6e0cc729d)
- spark (branch-2.3 aa1764ba1a)
I'm not that familiar with gradle and probably misread the error message. I agree that core doesn't look like it has a dependency on parquet. :)
from iceberg.
Also, which git hash should I be using for spark? The tip of branch-2.3 (75131ee867bc) didn't work, even after I changed build.gradle to use spark 2.3.1-SNAPSHOT. sigh
from iceberg.
Core shouldn't have a dependency on Parquet, I was trying to keep that contained to just iceberg-parquet. I'll have a look.
Spark's DataSourceV2 implementation has been changing, so I've been avoiding an update... for example, lots of the classes were just recently renamed in time for the next RC. I plan to base this on 2.3.0 when it is released.
from iceberg.
Can you tell me which hash you use for spark?
from iceberg.
version=2.3.0-SNAPSHOT
user=blue
revision=aa1764ba1addbe7ad79344d5640bf6426267a38c
branch=master
date=2017-12-13T20:22:53Z
url=https://github.com/rdblue/spark.git
from iceberg.
Thanks, Ryan. That did the trick.
from iceberg.
Do you know how iceberg-core depends on parquet-avro? I'm looking at the dependencies and I don't see it:
gradle iceberg-core:dependencies --configuration compile
compile - Dependencies for source set 'main' (deprecated, use 'implementation ' instead).
+--- project :iceberg-api
+--- project :iceberg-avro
| +--- project :iceberg-api
| +--- project :iceberg-common
| \--- org.apache.avro:avro:1.8.2
| +--- org.codehaus.jackson:jackson-core-asl:1.9.13
| +--- org.codehaus.jackson:jackson-mapper-asl:1.9.13
| | \--- org.codehaus.jackson:jackson-core-asl:1.9.13
| +--- com.thoughtworks.paranamer:paranamer:2.7
| +--- org.xerial.snappy:snappy-java:1.1.1.3
| +--- org.apache.commons:commons-compress:1.8.1
| +--- org.tukaani:xz:1.5
| \--- org.slf4j:slf4j-api:1.7.7
+--- com.fasterxml.jackson.core:jackson-databind:2.6.7
| +--- com.fasterxml.jackson.core:jackson-annotations:2.6.0
| \--- com.fasterxml.jackson.core:jackson-core:2.6.7
\--- com.fasterxml.jackson.core:jackson-core:2.6.7
Iceberg's Spark support does depend on Parquet, which I agree sucks for now. We'll get a Parquet release out to fix this.
from iceberg.
You might be able to get at least core working with the latest parquet-avro release. It won't work with column ids, but it might compile and unblock you guys for Avro and ORC.
from iceberg.
#16 updated Spark to the 2.3.0 release. Thanks, @julienledem!
We'll get a Parquet release out soon, too.
from iceberg.
Just updated to Parquet 1.10.0.
from iceberg.
Related Issues (20)
- Allow Specifying Partitioning Function for External Mappings HOT 1
- Document Spark Issues Affecting the Iceberg Implementation
- Snapshot-Level Metrics and Statistics
- Cannot convert unknown primitive type: required int96 `timestamp` HOT 2
- Add More Documentation HOT 4
- Custom metadata in data files
- Custom InputFile / OutputFile providers for Spark HOT 4
- Support for AWS Glue as an alternative Hive metastore implementation HOT 5
- .travis.yml: The 'sudo' tag is now deprecated in Travis CI
- Port to Azure or Google Cloud?
- Any way to integrate Iceberg with AWS EMR Glue metastore?
- Problem inserting data into a table with structs (iceberg-spark)
- java.lang.AbstractMethodError: Method org/apache/iceberg/spark/source/SparkTable.newWriteBuilder HOT 1
- Long wait time for retry when refreshing table metadata HOT 1
- Support Customizing The Location Of Data Files Written By The Spark Data Source HOT 3
- Upgrade to Spark 2.4.0 HOT 2
- iceberg-runtime jar on JitPack is empty HOT 4
- Snapshot cryptographic integrity HOT 1
- Replace Literals with Stateless Functions
- Predicate Compiler for Evaluators
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from iceberg.