Giter Site home page Giter Site logo

us-census-data-engineering-project's Introduction

US-census-Data-Engineering-Project

Overview

Welcome to the US Census Data Engineering project, where I analyze the United States Census Bureau's 2017 Basic Monthly CPS using Apache Spark (Python).

Objective

  • Determine the count of responders per family income range.
  • Identify the top 10 counts of responders based on geographical division/location and race.
  • Assess the number of responders without a telephone at home but with access elsewhere, accepting telephone interviews.
  • Evaluate the number of responders with access to a telephone, but rejecting telephone interviews.

Datasets Used:

  • December 2017 United States Census Bureau’s Basic Monthly CPS Record (.dat)
  • January 2017 Basic Monthly CPS Data Dictionary File (.txt)

Process:

The project leverages Python to extract specified information from the dataset using the provided data dictionary file. Apache Spark is then employed for in-depth dataset analysis, addressing key questions outlined in the project objectives.

Feel free to explore the code, data, and insights in the repository. If you have any questions or suggestions, please don't hesitate to reach out.

us-census-data-engineering-project's People

Contributors

brightosas avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.