Giter Site home page Giter Site logo

Welcome to Yang's GitHub

Hi there! I am an MS student from Cornell, focusing on scalable language modeling, data generation, and agent systems.

I am happy to chat and discuss potential collaborations, feel free to reach out by

Linkedin Twitter Gmail WeChat

🌟 Studying Zone

I am collaborating with BigCode, Cornell ICPC and Millennium to build efficient LLMs for code and data generation.

  • This work is called ALICE (Aligning Language models for Interactive Code Execution), find more about it at alicellm.github.io.
  • ALICE is a meta-agent collaboration system that generates high-quality data through multi-turn interactions and feedback without human intervention.
  • It produces multimodal data with traces from agent strategies like ReAct and Reflexion, which are scarce but offer potential for aligning advanced LLMs.

Previously, I led the prior work of ALICE called Voice2Action with Cornell XRC, an Unity Package for real-time code execution in VR.

I am also working on large-scale generation augmented retrieval systems (opposed to RAG) at Cornell NLP.

I used to work on graph machine learning at AWS AI Lab (2021-2022) and contribute to the open source Deep Graph Library.

👀 Chilling Zone

I like programming! I lead the "Cornell Tech" Group at Cornell ICPC and won the Top 20% in 2023 Regional!

LeetCode CodeForces Visitors

I enjoy cooking, listening to music of all forms, playing ping-pong, reading science fiction, and more!

⚡ Developing Zone

📈 "Accepted" Zone

Yang Su's Projects

code-library icon code-library

Templates, algorithms and data structures implemented and collected for programming contests.

cp-practice icon cp-practice

survive-blue-tier-not-dropping competition now begins!

dgl icon dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

dgl-ke icon dgl-ke

High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.

game-of-chess icon game-of-chess

Manual implementation of MVC in C++ using the observer pattern, worked in a group of 3, credit to Vinh Phu Nguyen and Daniel Yang

gomoku-ai-from-scratch icon gomoku-ai-from-scratch

Tackle the Game of Gomoku using symbolic network, redesigned minimax algorithm with alpha-beta pruning, credit to Yumeng Yao and Yu Li

gradcache icon gradcache

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

lightning icon lightning

Deep learning framework to train, deploy, and ship AI products Lightning fast.

nlp-notebook icon nlp-notebook

NLP related fundamentals, re-implementations of popular NLP architectures

state-space-interpretability icon state-space-interpretability

Investigation of state space model interpretability using SHAP (SHapley Additive exPlanations), co-authors Yin Li and Lancaster Wu

stock-prediction icon stock-prediction

Stock prediction using PySpark and TensorFlow, analyzed RF predictor result with Twitter Developer API, co-authors Emily Ye and Jack Zhu

voice2action icon voice2action

ALICE and its prior work, Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in Virtual Reality

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.