Giter Site home page Giter Site logo

nlp-notebook's Introduction

Welcome to Yang's GitHub

Hi there! I am an MS student from Cornell, focusing on scalable language modeling, data generation, and agent systems.

I am happy to chat and discuss potential collaborations, feel free to reach out by

Linkedin Twitter Gmail WeChat

๐ŸŒŸ Studying Zone

I am collaborating with Cornell ICPC and Millennium to build efficient LLMs for code and data generation.

  • This work is called ALICE (Aligning Language models for Interactive Code Execution), find more about it at alicellm.github.io.
  • ALICE is a meta-agent collaboration system that generates high-quality data through multi-turn interactions and feedback without human intervention.
  • It produces multimodal data with traces from agent strategies like ReAct and Reflexion, which are scarce but offer potential for aligning advanced LLMs.

Previously, I led the prior work of ALICE called Voice2Action with Cornell XRC, an Unity Package for real-time code execution in VR.

I am also working on large-scale generation augmented retrieval systems (opposed to RAG) at Cornell NLP.

I used to work on graph machine learning at AWS AI Lab (2021-2022) and contribute to the open source Deep Graph Library.

๐Ÿ‘€ Chilling Zone

I like programming! I lead the "Cornell Tech" Group at Cornell ICPC and won the Top 20% in 2023 Regional!

LeetCode CodeForces Visitors

I enjoy cooking, listening to music of all forms, playing ping-pong, reading science fiction, and more!

โšก Developing Zone

๐Ÿ“ˆ "Accepted" Zone

nlp-notebook's People

Contributors

yang-su2000 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.