Giter Site home page Giter Site logo

gallettilance.github.io's Introduction

layout title nav_exclude permalink seo
home
CS 506
true
index.html
type name
Course
Data Science Tools and Applications

Data Science Tools and Applications

Getting Started Checklist

  1. Join Piazza and Discord
  2. Create a GitHub account
  3. Create a Kaggle account
  4. Fill out this form (requires BU email) with your GitHub and Kaggle account username
  5. Install Python and Jupyter Notebook
  6. Sign up for GradeScope (code: ZWZ4X5)

About

The goal of this course is to provide students a hands-on understanding of classical data analysis techniques and to develop proficiency in applying these techniques in modern programming languages (Python) while also learning about the social and ethical challenges of collecting and mining data by studying real world examples.

The course introduces students to a wide range of techniques that are commonly used in the analysis of data such as clustering, classification, regression, and neural networks.

Note that this is not a Python (or an introduction to programming) course, so self-study will be necessary for those students who do not already know the language.

Prerequisites

Students taking this class must have some prior familiarity with programming at the level of CS 105, 108, or 111, or equivalent. CS 132 or equivalent (MA 242, MA 442) is required. CS 112 is also helpful.

Emails

If emailing the CS506 staff, or creating a private Piazza post, please always CC or include the instructor, the CF, and all TAs.

Final Project

BU Spark! offers students an opportunity to work on technical projects provided by companies or organizations in the Greater Boston area through our experiential learning lab (X-Lab). For this semester, Spark! has partnered with CS506 to offer a diverse selection of external data science projects scoped to support the course’s learning outcomes and enhance the student experience. To learn more about Spark!, please visit their website.

Spark! projects are a great opportunity for students to get real-world project experience to highlight on their github and CV. These projects have already been curated and will be presented during “Pitch Day”. Project descriptions will be made available at the start of the semester. BU Spark! projects will be led by one of the Spark! Project managers. Each project will be assigned a Spark! Technical Engineer to review PRs, review code, and provide technical support.

Teams will have ~5 students. Students that decide to take on the role of team lead will receive extra credit. Teams will be formed based on availability and a project preference form that you will be asked to submit after Pitch Day.

At the end of the semester, one team will be selected to present their project on the Spark Demo day stage. One team per project will be selected to present a poster of their project on Demo Day.

Homeworks

The homework assignments will be due throughout the semester as relevant material is covered. There will be 7 assignments in total. Homeworks are collaborative. You can submit them individually or as a group. I highly encourage you to create a study group, share knowledge, and work on and submit the homeworks together.

Late Policy

Late homeworks will be accepted up to 48h after the due date. Late homeworks will incur a 12% penalty for the first 24h and a 20% penalty for next 24h. No points will be awarded after 48h. The lowest homework grade will be dropped at the end of the semester.

Re-Grades

If you notice an issue with a grade you’ve received, please don't email the teaching staff. Instead, please submit a regrade on Gradescope within 48h of receiving the grade. Anything beyond 48h will not be accepted for a re-grade.

Midterm

The midterm will be a Kaggle Data Science competition among the students in the class with a live leaderboard. Students will need to submit predictions based on a training dataset and a report detailing the methods used and decisions made.

Worksheets

Every lecture is accompanied with a worksheet due 24h after lecture. To receive credit, you need to reasonably complete them reasonably well. There will be no partial credit for worksheets or late deadlines. These are meant to help you further develop practical skills learned in class and only the highest 15 worksheet grades will count toward your grade (so about 7 or so will be dropped / optional).

Labs

Labs will be dedicated to helping you with your project. Team Leads must attend labs to provide an update on the project which will count toward the project grade. Technical Experts (TEs) as well as TAs will be available during labs to help.

Grading

Grading
15% midterm
30% homeworks
15% worksheets
40% final project
5% extra credit
Letter Grade
A 95% +
A- 90% - 95%
B+ 87% - 90%
B 83% - 87%
B- 80% - 83%
C+ 77% - 80%
C 73% - 77%
C- 70% - 73%
D 60% - 70%
F below 60%

Extra Credit

Extra credit can be earned by consistently:

  • Attending class and labs.
  • Asking and answering questions on Piazza.
  • Submitting PRs to our class repository with code or class notes.
  • Contributing to our class repository or course website (by fixing typos, providing clarification edits etc.)

gallettilance.github.io's People

Contributors

fahdfareed avatar funkyvoong avatar gallettilance avatar gavintanbu avatar haoranhu01 avatar hipatil avatar jankihc avatar krith-man avatar nataliaclark01 avatar realazee avatar rechhabra avatar showndarya avatar sueda-n avatar superkakayong avatar vedikasrivastava avatar vivian271 avatar yabazah avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.