Giter Site home page Giter Site logo

he yong jun's Projects

ai-audio-datasets icon ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

audioclassification-pytorch icon audioclassification-pytorch

The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.

awesome-ai-books icon awesome-ai-books

Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning

awesome-llm-and-aigc icon awesome-llm-and-aigc

🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.

cogvideo icon cogvideo

Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

easyanimate_imgtovideo icon easyanimate_imgtovideo

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

golangbooks icon golangbooks

A list of best books which are needed to be a specialist in golang

gorgonia icon gorgonia

Gorgonia is a library that helps facilitate machine learning in Go.

holmes icon holmes

self-aware Golang profile dumper

localai icon localai

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

postgres-operator icon postgres-operator

Postgres operator creates and manages PostgreSQL clusters running in Kubernetes

text2video-zero icon text2video-zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

textstovideo icon textstovideo

使用stable diffusion api生成图片,结合python的一系列对媒体操作的库,生成十分简单的小说推文.

videowater_videocut icon videowater_videocut

视频批量处理, 码率设置, 格式转换, 添加字幕, 添加水印, 文字跑马灯, 去除水印, 修改分辨率, 视频剪裁, 倍速播放, 视频分段, 视频合成, 视频镜像, 背景音乐, 插入背景图片, 视频高斯模糊, 模糊拓边, 画中画,字幕,翻译,影视解说,影视混剪,抖音带货,视频全自动剪辑,视频批量剪辑

voice_datasets icon voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.