Giter Site home page Giter Site logo

finalcapstone's People

Contributors

fiendskrah avatar sjsrey avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

Forkers

sjsrey jchen321

finalcapstone's Issues

exported environment.yml will not work on Linux machines.

Preliminary googling seems to indicate that the error is a result of exporting an environment.yml file from Windows to Linux. Upon running conda env create -f environment.yml, I get a short failure message with little guidance, but googling keyphrases seems to indicate that the file cannot be used to create the environment because certain package dependencies are 'windows only'.

Esda bug with local Moran's i

When attempting to calculate local Moran's, esda returns the following error:

"ValueError: cannot assign slice from input of different size"

See 400

Cell 06: constructing queen weight matrix
Cell 14: setting variable
Cell 23: attempting to calculate local Moran's i
@sjsrey

Index project

Data work and analysis for MPP entirely contained within project.ipynb and Analysis.ipynb. We can split the work up into smaller, more digestible notebooks. This makes data work easier to reference later and will allow the project to be easier to adapt to other purposes. Use numerical naming convention.

selecting districts for regression discontinuity design

Beginning investigation in RDD.ipynb. Considering that we still need both treatment and control districts:

  1. What criteria should be used to determine treatment vs. control districts?
  • Breakpoint approach makes the most sense, as the policy being evaluated has a strict implementation point (upc > 55%)
  1. Does it make sense to also evaluate the most disadvantaged districts (upc >~80%) vs. the most advantaged districts (upc < ~20%)? Is this a valid approach?

  2. How can we use geosnap/segregation for this project?

  • geographically select districts?
  • dissolve income data from geosnap census tracts into school districts?

understand missing grade data

in 500_matching.ipnyb:

When slicing 'master' dataframe for df[df['e_curr ALL'].isnull()], pandas returns 120 records with no grade data. I would like a list of these districts to then return to and query against the original unmerged ela and math datasets to ensure no grade data is being erroneously discarded.

Create a better README for 'front' of project

README should:

  • give a concise description of the policy (CAL SB-97 Concentration grant)
  • discuss research questions
  • explain applicability to other projects (use this as a reference guide for other policy analyses)
  • explain workflow
    • conda environment
    • notebook flow

pandas multi-indexing

One issue here is that I've had trouble merging both english and math scores onto one row of data corresponding to a specific student group within a district. I've tried left, right, inner, and outer merges.

I'd like the dataframe to be structured like so:

  • County
    • District
      • Student Group
        • English Score
        • Math Score

Instead, the student group gets duplicated for each set of scores.

  • County
    • District
      • Student Group
        • English score
      • Student Group
        • Math score

I got around this for the capstone by simply duplicating the analysis for a second dataframe.

  • County
    • District
      • Student Group
        • English score

and

  • County
    • District
      • Student Group
        • Math score

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.