Giter Site home page Giter Site logo

gauthampughazhendhi / strapvizpy Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ubc-mds/strapvizpy

0.0 0.0 0.0 804 KB

A python package to automate bootstrapping and produce statistics and plots for reporting purposes.

License: MIT License

Python 100.00%

strapvizpy's Introduction

StrapvizPy

example workflow codecov Documentation Status

Summary

Performs bootstrapping of a sample to produce plots and statistics for use in final reports and documents.

The purpose of this package is to simplify and automate the process of creating simple bootstrap distributions of numerical samples. The package has a module which intakes a sample and relevant parameters such as the desired confidence bounds and number of simulations. The module will perform the simulation statistics to generate the bootstrap distribution and relevant statistics such as the sample mean and bootstrapped confidence interval. The package also has a module for visualization of the bootstraped confidence interval, and for creating a professional publication-ready table of the relevant statistics.

Package context within the Python ecosystem

The package builds on NumPy and Matplotlib packages, and is designed to conduct bootstrap sampling and visualization using them. scikit-learn has a utils module with a resample method which has bootstrapping functionality. But StrapvizPy streamlines the process of bootstrapping from data to visualization and embedding in documents which is not available as a single bundle. Some tutorials on bootstrap confidence intervals from machinelearningmastery.com and towardsdatascience.com encourage the reader to plot the results manually.

Installation

$ pip install strapvizpy

Usage

To import strapvizpy and check the version:

import strapvizpy
print(strapvizpy.__version__)

To import the suite of functions:

from strapvizpy import bootstrap
from strapvizpy import display

Please view our packaged documentation here.

Functions

  • bootstrap_distribution: Returns a sampling distribution of specified replicates is generated for a specified estimator with replacement for a given bootstrap sample size.
  • calculate_boot_stats: Calculates a confidence interval for a given sampling distribution as well as other bootstrapped statistics.
  • plot_ci: Creates a histogram of a bootstrapped sampling distribution with its confidence interval and observed sample statistic.
  • tabulate_stats: Generates a table that contains a given sampling distribution's mean and standard deviation along with relevant statistics as well as a summary table of the bootstrap distributions parameters. The code automatically saves the tables as html documents.

Contributing

Julien Gordon, Gautham Pughazhendhi, Zack Tang, and Margot Vore.

License

StrapvizPy was created by Julien Gordon, Gautham Pughazhendhi, Zack Tang, Margot Vore. It is licensed under the terms of the MIT license.

Credits

StrapvizPy was created with cookiecutter and the py-pkgs-cookiecutter template.

strapvizpy's People

Contributors

voremargot avatar zackt113 avatar gauthampughazhendhi avatar booleanjulien avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.