Giter Site home page Giter Site logo

ging / edvl Goto Github PK

View Code? Open in Web Editor NEW
1.0 11.0 0.0 72.94 MB

Educational Data Virtual Lab

License: GNU Affero General Public License v3.0

Shell 28.32% Python 11.98% Dockerfile 4.09% Scala 7.23% XSLT 1.33% Jupyter Notebook 47.06%
education fiware data notebooks zeppelin zeppelin-notebook spark streaming-data fiware-orion fiware-ngsi fiware-cosmos fiware-draco fiware-keyrock upm ipynb notebook apache-zeppelin big-data-platform human-data-interaction

edvl's Introduction

Educational Data Virtual Lab (EDVL)

The Educational Data Virtual Lab (EDVL) is a component of the ADA project that will be used for the delivery of the practical and hands-on part of the Urban Mobility Data Science courses.

It is based on Apache Zeppelin and the European FIWARE platform, in which the specific components of Data Science applied to Urban Mobility will be integrated.

Apache Zeppelin is a new and upcoming web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more. It provides data exploration, visualization, sharing and collaboration features and supports a plethora of languages and technologies.

FIWARE is a curated framework of open source components to accelerate the development of smart solutions, which enable the connection to IoT with Context Information Management and Big Data services in the Cloud. Furthermore, it provides standard APIs for data management and exchange, as well as harmonised data models.

Requirements

  • Docker and Docker-compose

Installation

  • Clone this project
git clone https://github.com/ging/edvl
cd edvl
  • Run the whole scenario
docker-compose up

Example notebooks

EDVL comes with a curated set of notebooks that can be use to get started in data science training. They are available in the notebookdirectory. To run any of the notebooks you just need to:

  • Click "Import note". Pick a name and choose the "Select JSON File/IPYNB File" option

  • Choose the notebook that you want to explore from the notebook directory

  • Open the notebook and run all of the chunks one by one in order.

Below is a description of the notebooks available.

MongoDB with native visualizations

Notebook 1. ExampleMongo.zpln showcases Apache Zeppelin's native visualizations when querying a Mongo database. It can be seen how data can be explored in an interactive way through many graphs and visualizations.

SparkML

Stepping up from mere data exploration, notebook 2. ExampleSparkML.zpln shows how EDVL can be used for the complete lifecycle of machine learning, from data acquisition and storage provided by FIWARE Generic Enablers, to model training and prediction thanks to the SparkML library.

MongoSpark with Scala

Instead of directly querying a Mongo database, notebook 3. ExampleMongoSpark.zpln shows how MongoSpark can be used to query data using the Scala language, and how the data retrieved can be ploted using web visualization libraries.

Python Pandas

Apache Zeppelin supports one of the most common languages for data analysis (i.e., Python). In notebook 4. ExamplePandas.zpln, a common workflow of analyzing a CSV file using Python Pandas is provided.

Spark streaming

Not only batch analysis is supported, but also real-time. Thanks to Spark Streaming and the FIWARE Cosmos Spark Connector, data can be analyzed as soon as it arrives from the FIWARE Context Broker and plotted in real time using web visualization libraries (5. ExampleStreamingPrint.zpln and 6. ExampleStreamingGraph.zpln).

Legacy Jupyter Notebook

Apache Zeppelin allows to import Jupyter Notebooks and reuse existing code. This way, users who are migrating from Jupyter can resume their work immediately. An example is provided in notebok 7. Jupyter2Zeppelin.ipynb

edvl's People

Contributors

anmunoz avatar sonsoleslp avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

edvl's Issues

Concurso Software Libre

Buenas, desde el CUSL nos gustaría que miráseis el último correo que os hemos enviado.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.