Giter Site home page Giter Site logo

csv-analyzer's Introduction

CSV Analyzer Demo

Screenshot

Usage

This project was built using Yarn as a package manager, it can be installed globally using Homebrew.

brew install yarn

Clone or download the repo to your local machine

git clone https://github.com/mattddonovan/csv-analyzer.git && cd csv-analyzer

Install dependencies with yarn

yarn install

See below for more script options.

Some notes

This is a very rough proof of concept single page application that attempts to take a comma-separated value (.csv) file consisting of user data / contact information. The only required fields right now are first_name and last_name as they're used to generate dynamic keys for sorting and comparison. Upon uploading a file, it automatically parses each row in the normal.csv test file and identifies exact duplicates as well as fuzzy matched duplicates using the Levenshtein Distance (LD) algorithm.

The output of the analysis is displayed in a sortable and paginated table, currently showing select fields for each row for the sake of keeping the UI uncluttered. The full output is displayed in the developer console and contains all values associated with each record. The code itself, specifically the CSVAnalyzer component is filled with comments explaining what's going on, so definitely take a look.

The UI is leveraging the Material-UI component library for its basic style and formatting.

There are a few known issues / todos:

  • The app is able to properly identify the duplicate values within a given dataset and remove them from the result set into a separate bucket. The console output correctly reflects this, but instead of outputting the potential duplicates in pairs (as asked in the assignment), it reports the duplicated rows that were omitted from the unique result set only. This is only a minor issue with the display output.
  • Some additional UI customization (transitions, notifications, etc) would be a nice to have.
  • Unit tests should be added for the CSVAnalyzer component, but for the sake of timeboxing the task they were omitted.

This project was bootstrapped with Create React App.

Available Scripts

In the project directory, you can run:

yarn start

Runs the app in the development mode.
Open http://localhost:3000 to view it in the browser.

The page will reload if you make edits.
You will also see any lint errors in the console.

yarn test

Launches the test runner in the interactive watch mode.
See the section about running tests for more information.

yarn build

Builds the app for production to the build folder.
It correctly bundles React in production mode and optimizes the build for the best performance.

The build is minified and the filenames include the hashes.
Your app is ready to be deployed!

See the section about deployment for more information.

csv-analyzer's People

Contributors

thebristolsound avatar

Stargazers

 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.