Giter Site home page Giter Site logo

sharkwyf's Projects

agenta icon agenta

The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.

decision-transformer icon decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

dify icon dify

An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.

dreamerv3 icon dreamerv3

Mastering Diverse Domains through World Models

frozenbilm icon frozenbilm

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

ivr icon ivr

Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

langflow icon langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.

llama-factory icon llama-factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

minerl icon minerl

MineRL Competition for Sample Efficient Reinforcement Learning - Python Package

motionclip icon motionclip

Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"

neuralmmo icon neuralmmo

Baselines for Neural MMO -- new users should treat this repo as a starter project

pdt icon pdt

Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer

repoagent icon repoagent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

safe-rlhf icon safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

stable-alignment icon stable-alignment

Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

swinbert icon swinbert

Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"

trl icon trl

Train transformer language models with reinforcement learning.

vllm icon vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.