Giter Site home page Giter Site logo

thedatasociety / lab-hadoop Goto Github PK

View Code? Open in Web Editor NEW
3.0 2.0 7.0 4.71 MB

Shell 0.37% Dockerfile 0.04% Jupyter Notebook 2.38% Python 0.19% PLpgSQL 97.02%
hadoop hbase hive sqoop hadoop-mapreduce hadoop-hdfs hadoop-streaming mrjob hadoop-yarn flume

lab-hadoop's Introduction

The Data Science and Engineering Society


github organization our docker hub organization our slack our twitter quilt packages

lab-hadoop

A laboratory for learning Hadoop.

Table of contents

Launching this laboratory on a binderhub instance

Binderhub uses repo2docker for launching and serving the computational environments. The list below contains the known binderhub instances.

The list below provides the badges and the hyperlinks for launching one of the specific interfaces in one of the binderhub instances.

JupyterLab

  • launch @ gke.mybinder.org

  • launch @ ovh.mybinder.org

  • launch @ gesis.org

  • launch @ pangeo.io

Jupyter

  • launch @ gke.mybinder.org

  • launch @ ovh.mybinder.org

  • launch @ gesis.org

  • launch @ pangeo.org

Launch this laboratory locally using Docker

We offer here two options for launching this lab locally: via our Docker image or via repo2docker. Before firing up your container, make sure your local user is in the docker group. Please refer to this Docker documentation for more details. It is strongly advised to not to run the container as root.

Via our Docker image

You can run a containerized instance of this lab from our own Docker image. Please refer to this link for installing Docker. Then, try:

docker run -it -p 8888:8888 thedatasociety/lab-hadoop \
           jupyter lab --ip 0.0.0.0 --NotebookApp.token='dstoken1234567' 

Alternatively, you can map you local home folder into the container:

docker run -it -p 8888:8888  \
           -v $(echo ~):/jupyter/data/local-home \
           thedatasociety/lab-hadoop \
           jupyter lab --ip 0.0.0.0 --NotebookApp.token='dstoken1234567'

The Dockerfile used to build the image can be found here.

Please read the section Accessing your Docker container for more details about running/accessing your container.

You can also launch this laboratory locally using Docker and repo2docker. Please refer to this link for installing Docker and this link for installing repo2docker. Use the commands below to build a Docker image and start a container from it.

repo2docker -p 8888:8888 \
            -v $(echo ~):$(echo ~)/local-home \
            https://github.com/thedatasociety/lab-hadoop \
            jupyter lab --ip 0.0.0.0 --NotebookApp.token='dstoken1234567'

Please read the section Accessing your Docker container for details about running/accessing your container.

Accessing your Docker container

Both options above will launch a container on port 8888. They will also create a Docker volume that maps the user's home into the container (in the container, look for the local-home folder).

Please also be aware that the --ip 0.0.0.0 directive will start a sever which will accept connections from any ip. For security purposes the --NotebookApp.token='dstoken1234567' directive forces the need of a token for accessing any interface. Use the dstoken1234567 to login or feel free to set a stronger token.

Each interface will be available at a specific path, as follows:

See the repo2docker documentation for more details regarding the use of multiple interfaces.

Contributing

License

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.