Giter Site home page Giter Site logo

msbx5420spring2019's Introduction

MSBX 5420 - Spring 2019

Unstructured and Distributed Data Modeling and Analysis

Leeds School of Business, University of Colorado Boulder

Instructor: Dr. Spencer Stirling

Contact us

Schedule (subject to change)

Date Topic
Week 1
January 14
VM Installation
Course Overview
What is Virtualization?
Virtualization-lite: Docker
HOMEWORK Linux Basics and Bash
Week 2
January 21
MLK NO CLASS
HOMEWORK Intro to Python
Week 3
January 28
COMPLETE BEFORE CLASS Install standalone Spark
HOMEWORK Stats in Python
SOLUTIONS Stats in Python
Week 4
February 4
Intro to Spark
HOMEWORK NASA weblogs
Week 5
February 11
Spark architecture
EXAM Exam 1
Week 6
February 18
Install Spark and HDFS clusters
JSON tutorial
Intro to HDFS
HOMEWORK Manage source code with git
NO OTHER HOMEWORK THIS WEEK
Week 7
February 25
Intro to DataFrames
HOMEWORK CWL analysis
Week 8
March 4
Working with DataFrames
HOMEWORK COMBINED WITH WEEK 9
Week 9
March 11
More DataFrames
HOMEWORK ETL CWL data
Week 10
March 18
Install Kafka cluster
Intro to Kafka
EXAM Exam 2
March 25 Spring Break NO CLASS
Week 11
April 1
REST APIs
HOMEWORK Ingest CitiBike into Kafka
Week 12
April 8
Consuming from Kafka
HOMEWORK COMBINED WITH WEEK 13
Week 13
April 15
Intro to Elasticsearch and Kibana
HOMEWORK CitiBike Elasticsearch analysis
Week 14
April 22
Ask Me Anything lecture
Week 15
April 29
Elasticsearch query language: Lucene
Final Final

msbx5420spring2019's People

Contributors

sstirlin avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

msbx5420spring2019's Issues

Import Error: No module named numpy

I just wanted to make a note of an issue I had when following the Python tutorials for the week 2 homework, just in case anyone else in the class runs in the same issue.

When importing Numpy in the "intro_to_python02.mp4" video, I was getting an error telling me it could not find a module named numpy: Import error: No module named numpy

This is pretty easy to fix through the terminal of the vagrant machine, but before you install numpy, you need to make sure you run the code conda activate py37 to affect the correct environment. Once you run that, you can run either pip install numpy, or conda install numpy to install numpy and have it be found as a module on Jupyter notebooks.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.