Giter Site home page Giter Site logo

novalegra / explore-jaeb-data-old Goto Github PK

View Code? Open in Web Editor NEW

This project forked from tidepool-org/data-science-explore-jaeb-data

0.0 1.0 0.0 327 KB

Exploratory work performed with the Jaeb Loop Study data

License: BSD 2-Clause "Simplified" License

Python 100.00%

explore-jaeb-data-old's Introduction

Tidepool Data Science Project Template

Creating a new repository from this template

Manually create a new repo from this template in github; github directions are here.

New Repo Checklist & Instructions

  • Update repo settings in github (manual process)
    • Update Settings/Options/Repository name
      • Name follows the <team (optional)> - <type(optional)> - <one-to-three-word-description> - <initials (optional)> in lowercase-dash-format. Examples:
      • icgm-sensitivity-analysis is used by all of Tidepool so no team is needed and is considered production code so no type is needed.
      • data-scence-donor-data-pipeline is only used by Data Science
      • data-science-template-repository is a template (type) used by Data Science Team
      • data-science-explore-<short-description> type of work is exploratory
      • data-science-explore-<short-description>-etn exploratory solo work has initials at the end
    • Update Settings/Options/Manage access
      • Invite data-science-admins team and give admin access
      • Invite Data Science team and give write access
    • Update Settings/Options/Manage access/Branch protection rules
      • Set Branch name pattern to master
      • Check Require pull request reviews before merging
      • Set Required approving reivews: to 1 for non-production code and 2 for production code
      • Check Dismiss stale pull request approvals when new commits are pushed
      • TODO: add in travis ci instructions via Require status checks to pass before merging
  • Fill in this readme. Everything in [ ]'s should be changed and/or filled in.
  • After completing this checklist, move the completed checklist to the bottom of the readme
  • Delete everything above the [Project Name]

[Project Name]

-- Project Status: [Active, On-Hold, Completed]

-- Project Disclaimer: This work is for [Exploration, Development, Production]

Project Objective

The purpose of this project is to [___].

Definition of Done

This phase of the project will be done when [___].

Project Description

(Add a short paragraph with some details, Why?, How?, Link to Jira and/or Confluence) In order to learn/do [], we did [].

Technologies (Update this list)

  • Python (99% of the time)
  • Anaconda for our virtual environments
  • Pandas for working with data (99% of the time)
  • Google Colab for sharing examples
  • Plotly for visualization
  • Pytest for testing
  • Travis for continuous integration testing
  • Black for code style
  • Flake8 for linting
  • Sphinx for documentation
  • Numpy docstring format
  • pre-commit for githooks

Getting Started with the Conda Virtual Environment

  1. Install Miniconda. CAUTION for python virtual env users: Anaconda will automatically update your .bash_profile so that conda is launched automatically when you open a terminal. You can deactivate with the command conda deactivate or you can edit your bash_profile.
  2. If you are new to Anaconda check out their getting started docs.
  3. If you want the pre-commit githooks to install automatically, then following these directions.
  4. Clone this repo (for help see this tutorial).
  5. In a terminal, navigate to the directory where you cloned this repo.
  6. Run conda update -n base -c defaults conda to update to the latest version of conda
  7. Run conda env create -f conda-environment.yml --name [input-your-env-name-here]. This will download all of the package dependencies and install them in a conda (python) virtual environment. (Insert your conda env name in the brackets. Do not include the brackets)
  8. Run conda env list to get a list of conda environments and select the environment that was created from the environmental.yml file (hint: environment name is at the top of the file)
  9. Run conda activate <conda-env-name> or source activate <conda-env-name> to start the environment.
  10. If you did not setup your global git-template to automatically install the pre-commit githooks, then run pre-commit install to enable the githooks.
  11. Run deactivate to stop the environment.

Maintaining Compatability with venv and virtualenv

This may seem counterintuitive, but when you are loading new packages into your conda virtual environment, load them in using pip, and export your environment using pip-chill > requirements.txt. We take this approach to make our code compatible with people that prefer to use venv or virtualenv. This may also make it easier to convert existing packages into pypi packages. We only install packages directly in conda using the conda-environment.yml file when packages are not available via pip (e.g., R and plotly-orca).

Getting Started with this project

  1. Raw Data is being kept [here](Repo folder containing raw data) within this repo. (If using offline data mention that and how they may obtain the data from the froup)
  2. Data processing/transformation scripts are being kept [here](Repo folder containing data processing scripts/notebooks)
  3. (Finishing filling out this list)

Contributing Guide

  1. All are welcome to contribute to this project.
  2. Naming convention for notebooks is [short_description]-[initials]-[date_created]-[version], e.g. initial_data_exploration-jqp-2020-04-25-v-0-1-0.ipynb. A short _ delimited description, the creator's initials, date of creation, and a version number,
  3. Naming convention for data files, figures, and tables is [PHI (if applicable)]-[short_description]-[date created or downloaded]-[code_version], e.g. raw_project_data_from_mnist-2020-04-25-v-0-1-0.csv, or project_data_figure-2020-04-25-v-0-1-0.png.

NOTE: PHI data is never stored in github and the .gitignore file includes this requirement as well.

Featured Notebooks/Analysis/Deliverables

Tidepool Data Science Team

Name (with github link) Tidepool Slack
Ed Nykaza @ed
Jason Meno @jason
Cameron Summers @Cameron Summers

Known TODO items

  • automate the process of finding all of the the TODO: comments in the code and put link here.

Initial Setup Checklist

explore-jaeb-data-old's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.