Giter Site home page Giter Site logo

Hello! 👋

My name is Kevin Kho. I am currently working on Fugue, a minimal interface to bring Python, Pandas, and SQL code to Spark, Dask, and Ray. Most recently, I was at Prefect as an Open Source Community Engineer where I managed the Slack community and created content. Before working on open-source tooling, I was a data scientist for four years across Paylocity and Itron.

I am currently contracting part time with Citi helping them scale compute workflows to distributed computing. I am looking for more contract opportunities around big data.

📭    Contact me!

Feel free to reach out to me for anything data related. I talk to people about big data, data artichecture, data engineering, and data careers. Always happy to speak at meetups or company meetings about the things I'm working on.

Website: https://kevinkho.com/

Email: [email protected]

LinkedIn: https://www.linkedin.com/in/kvnkho

🌎    Location

I am currently based out of Chicago. Always happy to meet people in person.

📝    Blogs

I mainly write about the things I am working on. Here are some:

My Medium Profile will have all of my articles.

📢    Conference Talks

I've given a couple of talks about Fugue, Prefect, and distributed computing. Here are some:

🎤    Podcasts

💙    Community

I am involved in some other things:

  • DataKind - I volunteered for two projects helping non-profits with data science/data engineering work
  • Orlando Machine Learning and Data Science - I organized/co-organized this Meetup for 4 years
  • Adventurous Analytics - I advise non-profit data science consulting projects, primarily around the Florida foster care system
  • Conference Involvement:
    • SciPy 2022 Data Life Cycle Track Co-chair
    • PyData Seattle 2023 Volunteer

🤓    Other Interests

  • Mechanical Keyboards
  • Basketball
  • Kpop

Kevin Kho's Projects

dask-sql icon dask-sql

Distributed SQL Engine in Python using Dask

datacompy icon datacompy

Pandas and Spark DataFrame comparison for humans

demos icon demos

Collection of code snippets for blogs, conferences, and talks

ds-optimus icon ds-optimus

How to do data science with Optimus, Spark and Python.

fugue icon fugue

An abstraction layer that ports Python, pandas and SQL code to Spark or Dask, making big data projects small.

pandera icon pandera

A light-weight, flexible, and expressive pandas data validation library

prefect icon prefect

The easiest way to automate your data

prefect-docker-compose icon prefect-docker-compose

A simple guide to understand Prefect and make it work with your own docker-compose configuration.

scaled icon scaled

Scaled Protocol Python Implementation

triad icon triad

A collection of python utility functions

unstructured icon unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.