Giter Site home page Giter Site logo

rsmahabir / applied-data-science-finalproj Goto Github PK

View Code? Open in Web Editor NEW

This project forked from charlie-moffett/applied-data-science-finalproj

0.0 1.0 0.0 245 KB

Natural language toolkit, scikit-learn and web scraping for topic modeling analysis of DCP's Greenpoint-Williamsburg Rezoning

Jupyter Notebook 99.45% Python 0.55%

applied-data-science-finalproj's Introduction

News Coverage on Williamsburg, Brooklyn and its Underlying Textual Themes

A Topic Modeling Study Using LDA and NMF

Samantha Currie, Lingyu Jin, Charlie Moffett

New York University

ABSTRACT

Williamsburg has a reputation as a beacon of modern-day gentrification in New York and beyond. Using the 2005 Rezoning of a low-density manufacturing section in Greenpoint’s and Williamsburg’s waterfront as a reflection point, this research aims to examine whether there are any observable changes in the topics found in articles from The New York Times on Williamsburg before and after the rezoning. We used two methods of topic modeling: Latent Dirichlet Allocation and Non-negative Matrix Factorization and applied it to a corpus containing all articles from January 1, 2000-December 31, 2010 derived from keyword searches on Williamsburg and 4 themes based on development and cultural consumption. The resulting topics highlighted the underlying context found within the corpus and our results displayed an overall increase in the number of articles focused on our topics, with Neighborhood Development as a dominant theme and Entertainment & Leisure as a supportive theme in most years. Comparing our results with a counterfactual analysis, we find the trends present in our Williamsburg corpus to be distinctive.

Keywords: Topic Models, Latent Dirichlet Allocation, Non-Negative Matrix Factorization, Neighborhood and Urban Development, Gentrification, Cultural Consumption, Text Analysis

applied-data-science-finalproj's People

Contributors

charlie-moffett avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.