sharkwyf,github

agenta

The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.

block-recurrent-transformer

Pytorch implementation of "Block Recurrent Transformers" (Hutchins & Schlag et al., 2022)

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.

dreamerv3

Mastering Diverse Domains through World Models

frozenbilm

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

ivr

Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.

llama-factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

minedojo

Modified actions space to MineRL style

minerl

MineRL Competition for Sample Efficient Reinforcement Learning - Python Package

motionclip

Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"

neuralmmo

Baselines for Neural MMO -- new users should treat this repo as a starter project

notion-feeder

🕸 A Node app for creating a Feed Reader in Notion.

online-dt

Online Decision Transformer

pdt

Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer

repoagent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

stable-alignment

Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

swinbert

Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"

trajectory-transformer

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

trl

Train transformer language models with reinforcement learning.

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

sharkwyf Goto Github PK

sharkwyf's Projects

Recommend Projects

Recommend Topics

Recommend Org