Giter Site home page Giter Site logo

cpsenrollment's Introduction

Introducing CPS Enrollment Datasets

Charlotte Mack March 15, 2019

CPSenrollment

Exploring enrollment data from the Chicago Public Schools

This repository contains exploratory analysis and visualizations of enrollment data that were extracted from Chicago Public Schools (CPS) public records. The prepared data are available at this repository in Rds and csv formats; at present there are only high school data, with elementary school data forthcoming. The sets span the school years from 2006--2007 through 2018--2019, and will be periodically updated.

A glimpse of the high schools data:

## Warning in gzfile(file, "rb"): cannot open compressed file './
## enrollment_all_hs.Rds', probable reason 'No such file or directory'

## Error in gzfile(file, "rb"): cannot open the connection

## Error in eval(lhs, parent, parent): object 'enrollment_hs' not found

The original CPS data are available in a series of spreadsheets at CPS website[1] Downloadable data files that I have prepared are in this repository with Rds and csv extensions, downloadable through the links below. The Rds files, which are used in R language programming, may have some type designations that are not in the csv files, but there should be no other difference.

Rds file for all CPS high schools, September 2006 to September 2018

csv file for all CPS high schools, September 2006 to September 2018

Rds files of high school locations as of 2014--2015 school year, with enrollments from 2016--2017 school year, and from 2006--2007 school year These data were used to construct the school maps linked below.

A more detailed overview of the all-years enrollment data is in the file CPSenrollment.md.

A brief article with maps of District high schools under regular and other governance for the 2006--2007 and 2016--2017 school years is in the file school_maps.md.

The preparation of the data 2006-2017 is described in data.R and clean.R. data.R is a protocol in the form of a script of the first interactive steps that I took in preparing the unified data set. The protocol here takes the data from the original set of annual membership rosters, published in spreadsheets and having various sets and arrangements of variables, to R data.frames of uniform layout. I undertook the project when I was fairly new to R, so it is not as automated as it might be; on the other hand, each year's roster from the Chicago Public Schools has idiosyncrasies that can only be found and worked out manually. clean.R examines missing data more closely and creates a uniform set of names for each school to be used throughout the data.frame.

The protocol for the 2018-2019 update is in data_2018.R, for those who want a more compact overview.

Miscellaneous exploratory analysis, mostly in graphs is in explorechange a subset with dplyr transmute.R.

[1] Membership data are under the heading "Demographics."

cpsenrollment's People

Contributors

cymack avatar justinmshea avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

cpsenrollment's Issues

To do: restore some data

To do: Design and create a data frame or table that preserves the network, area, etc designations of schools in different years.

Correction: re RCurl

Get RCurl working to download from the CPS site directly in data_2018.R and elsewhere. Currently I get either a hang or an error.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.