Giter Site home page Giter Site logo

brunocalou / bigdatatime Goto Github PK

View Code? Open in Web Editor NEW
1.0 9.0 2.0 6.12 MB

[College Project] Big Data project using Spark for the Big Data class at UFRJ

License: MIT License

Java 0.18% Scala 0.15% HTML 0.14% CSS 92.18% JavaScript 7.34% PHP 0.01%

bigdatatime's Introduction

BigDataTime

[College Project] Big Data project using Spark for the Big Data class at UFRJ

Dependencies

Install the following dependencies in order to build and run the project

Project

SOM

Self-organizing Map

  • Location: src/som
  • Language: Scala

BTC Variation Calculator with DataFrame

This programs reads the Bitcoin valuation csv file and calculates the variation from a day to another. It also implements DataFrame, so we can easily get the variation of the currency by calling the method getVariation() by passing the date as a parameter.

  • Location: src/variation
  • Language: Java

Tools

news-finder

This program is used to retrieve news from a few websites and save them on disk

  • Location: tools/news-finder
  • Language: Javascript

bitcoin-market-price-downloader

This program is used to retrieve the bitcoin price history and save it as a CSV file

  • Location: tools/bitcoin-market-price-downloader
  • Language: Javascript

Build

SOM

cd src/som/
sbt package

BTC Variation Calculator with DataFrame

cd src/variation/
mvn package

Creating the jar application use maven to create a jar: add this to to your pom.xml file:

        <plugin>
            <artifactId>maven-assembly-plugin</artifactId>
            <configuration>
                <archive>
                    <manifest>
                        <mainClass>fully.qualified.MainClass</mainClass>
                    </manifest>
                </archive>
                <descriptorRefs>
                    <descriptorRef>jar-with-dependencies</descriptorRef>
                </descriptorRefs>
            </configuration>
        </plugin>

and call:

cd src/variation
mvn clean compile assembly:single

news-finder tool

cd tools/news-finder/
npm install

bitcoin-market-price-downloader

cd tools/bitcoin-market-price-downloader/
npm install

Run

SOM

cd src/som/
<your-spark-folder>/bin/spark-submit target/scala<version>/som_project_<version>.jar

BTC Variation Calculator with DataFrame

cd src/variation
<your-spark-folder>/bin/spark-submit VBBigData-1.0-SNAPSHOT.jar variation "date(yyyy-mm-dd)"

news-finder tool

Download all the news from all the available sites

cd tools/news-finder/
./run.sh

If you can't execute the script, change its permission (on linux)

chmod u+x ./run.sh

Download specific sites using the command

node index.js -s <site>

You can also choose the keyword, the initial and final pages

node index.js -s <site> -k <keyword> -f <from-page> -t <to-page>

Need help? Use the -h parameter

node index.js -h

bitcoin-market-price-downloader

Just run it using

cd tools/bitcoin-market-price-downloader/
node index.js

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.