aniki-ly Goto Github PK

followers: 16.0 following: 36.0 repos: 28.0 gists: 0.0

Name: Aniki

Type: User

Company: University of Technology Sydney

Bio: Philosophy Matters

Location: Sydney

Blog: yulu.net.cn

Aniki's Projects

aniki-ly.github.io

arxiv_sanity

awesome-cross-modal-video-moment-retrieval

前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。

awesome-language-model-with-vision

Related about vision and language models

awesome-segment-anything

Collect some resource about Segment Anything (SAM), including the latest papers and demo

awesome-source-free-test-time-adaptation

[2022] A curated list of papers in Test-time Adaptation, Test-time Training and Source-free Domain Adaptation

awesome-video-diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

awesome-video-diffusion-models

[Arxiv] A Survey on Video Diffusion Models

cris.pytorch

An official PyTorch implementation of the CRIS paper

datacomp

DataComp: In search of the next generation of multimodal datasets

direct2v

flowzero

FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax

gen-l-video

The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".

if

langchain

⚡ Building applications with LLMs through composability ⚡

layoutgpt

Official repo for LayoutGPT

llm-in-vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

maskclip

Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)

medsegdiff

Official implementation of paper "MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model"

palm-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

rpg-diffusionmaster

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

scienceplots

Matplotlib styles for scientific plotting

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

stable-diffusion

videox

VideoX: a collection of video cross-modal models

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

visual-chatgpt

Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

aniki-ly Goto Github PK

Aniki's Projects

Recommend Projects

Recommend Topics

Recommend Org