Giter Site home page Giter Site logo

coursera_cleaning_data's Introduction

Cleaning_data

Course project for Coursera Getting and Cleaning Data

The run_analysis code performs the following tasks:

Imports three test files (subjects, activities and data), three training files (subjects, activities and data) as well as two files with data lables (activity names and variable names).

For both the test and training files, the analysis first matches the activities and activity names, creating a new column labeling each activity. Then it renames the columns in the data files using the variable names. Finally it merges the data with the subjects and activities.

Next, the training and test data are combined, to create one data file (alldata)

Then just the columns with the mean and standard deviation are extracted, and a new file is created with just these columns, plus the subjects and activities).

Finally, I subsetted the data to split out by subject, then by activity, to find the mean of all variables for each activity for each subject. I used loops here, although i'm sure there's a better way. I had each loop add a row to the final dataset, and then wrote the dataframe to a txt file.

coursera_cleaning_data's People

Contributors

kpazoles avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.