Giter Site home page Giter Site logo

Hi there πŸ‘‹

I'm interested in Large-scale Engineering, Data Engineering, Representation Learning, Multi-modal Understanding, Training Optimization, Data Curation

Experience

πŸ› οΈ LLM Data Engineer (now) - 42dot

πŸ” Research Intern - Kakaobrain

🌿 Research Intern @kakaobrain

πŸ‡ΊπŸ‡Έ Intern as a UI developer - Wavity

Education

πŸ‡°πŸ‡· Bachelor degree of Computer Science Engineering at Sogang University (2012 - 2019)

πŸ‡°πŸ‡· Master degree of Computer Science Engineering at Sogang University (2020 - 2022)

Competitions

πŸ₯ˆ 2020 Korea Health Dataton 2nd Prize (Binary Classification on Breast Cancer Pathology Image)

πŸ₯‡ 2020 Naver AI Rush Challenge, 1st Prize on 3 Areas (Auto Tagging on Naver Shopping Image, Mood Classification on Music, Genre Classification on Japanese Music)

Projects and Publications

πŸ“š coyo-700M Dataset: A large-scale dataset aimed at enhancing data curation and multi-modal understanding, publicly released for the research community. Check it out here: coyo-700M.

✍️ ViT Alignment Blog Post on Hugging Face: Based on the coyo-700M dataset, this blog post discusses the reproduction of Vision Transformer (ViT) models. Read the blog post: vit-align.

sungjun lee's Projects

datasets icon datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

dex icon dex

Index and query analyzer for MongoDB: compares MongoDB log files and index entries to make index recommendations

dotfiles icon dotfiles

My personal collection of various dotfiles

face.evolve.pytorch icon face.evolve.pytorch

πŸ”₯πŸ”₯High-Performance Face Recognition Library on PyTorchπŸ”₯πŸ”₯

face_recognition icon face_recognition

The world's simplest facial recognition api for Python and the command line

facenet-pytorch icon facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and recognition (InceptionResnet) models

generative-inpainting-pytorch icon generative-inpainting-pytorch

A PyTorch reimplementation for paper Generative Image Inpainting with Contextual Attention (https://arxiv.org/abs/1801.07892)

glimpse_clouds icon glimpse_clouds

Pytorch implementation of the paper "Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points", F. Baradel, C. Wolf, J. Mille , G.W. Taylor, CVPR 2018

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.