Giter Site home page Giter Site logo

Comments (5)

chrisbennight avatar chrisbennight commented on September 16, 2024

Is there a separate 'spark-yarn'? I think the regular 'spark-core' package should work (did a quick glance and it seemed like it was build with yarn support)

http://archive-primary.cloudera.com/cdh5/redhat/6/x86_64/cdh/5.3/RPMS/noarch/spark-core-1.2.0+cdh5.3.2+369-1.cdh5.3.2.p0.17.el6.noarch.rpm

from mrgeo.

pliguori avatar pliguori commented on September 16, 2024

Sorry maybe I wasn't clear enough. I'm talking about the maven artifacts that are defined in the mrgeo pom file.
In the cdh532 profile the pom wants to download the following spark-yarn dependency:

org.apache.spark
spark-yarn_2.10
${spark.version}

Where spark.version is<spark.version>1.2.0-cdh5.3.2</spark.version>
But this version does not exist in the cdh artifactory as you can see here:

https://repository.cloudera.com/artifactory/cloudera-repos/org/apache/spark/spark-yarn_2.10/

from mrgeo.

ttislerdg avatar ttislerdg commented on September 16, 2024

For some reason, Cloudera has left out the spark_yarn_2.10 jar from its CDH5.3.x repos. They do exist in previous and more recent versions. To get around the problem, I had to download the spark release from Cloudera (located at http://archive.cloudera.com/cdh5/cdh/5/, specifically http://archive.cloudera.com/cdh5/cdh/5/spark-1.2.0-cdh5.3.2.tar.gz for CDH5.3.2). Unpacking the tar contains all the spark jars. Then take the spark_yarn_2.10 and load it into your local maven repo using a typical mvn install:install-file command.

Let me know if this works for you.

from mrgeo.

pliguori avatar pliguori commented on September 16, 2024

I will try soon. In the meantime I spoke to Cloudera support and they recommended to use spark-network-yarn instead of spark-yarn.

from mrgeo.

pliguori avatar pliguori commented on September 16, 2024

I can confirm that your trick worked. The workaround suggested by Cloudera doesn'work at all, since spark-network-yarn depends on spark-yarn.
However it seems that Cloudera will redeploy the 1.2 artifact on their repository soon.

from mrgeo.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.