yuan-manx Goto Github PK

followers: 268.0 following: 338.0 repos: 426.0 gists: 0.0

Name: Yuan-Man

Type: User

Bio: Sound/Music/AI/Game/Code/Design

Location: Shanghai, China

Blog: [email protected]

Hi there，Welcome to my Art and Technology Creative Space！ 🌏🌌🌊

I'm Derrick / Yuan Man（袁满）！👋

Research & Hobbies： Sound, Music, AI, Game Development, Code, Design, etc. 🎸🎹🥁🎻🎺🎤🎧

SouPyX - Audio Toolkit 🎵
ArtNex - Deep Learning Framework 🚀
SoundHub - AI Audio Framework 🛸
DataForm - Data Processing Toolkit 🔥
NexEngine - Game Engine 🎮
MultiClip - MultiModal Clip 🤖

AI Research:

AI Resources
- Game : AI Game DevTools (AI-GDT) 🎮, Game Engine 🎮,
- Dataset : AI Audio Datasets (AI-ADL) 🎵,
- LLM : LLM App Stack, 🤖 Awesome ChatGPT,
- Agent : AI Agent Roadmap, AI Voice Agents, Audio AI Agent,
- Multimodal : AI Multimodal Timeline,
- Audio : Audio AI Timeline, AI Audio Startups, Audio Development Tools (ADT) 🔥,Large Audio Models, 🔱 Speech Trident, Open-Source Audio Plugins & Apps, Awesome Music Informatics, AudioLLMs, Awesome Large Language Models in Audio AI,
- ComfyUI : ComfyUI Tools Roadmap,
- Mamba : Awesome State-Space Resources for ML,
- AI-Startups : AI-Startups 🚀,
AI Project
- AI OS : 01 Project,
- LLama : LLama Agentic System,
- Audio : GPT-SoVITS, ChatTTS, StableTTS, SoundHub,
- Video/Image : Open-Sora, DiffSynth Studio,
- AI Search Engine : MindSearch, ScrapeGraphAI, RAGoon, SearchPhi,
- LLM/Dataset : ArtNex, DataForm,
- Tool : FastHTML,

I love everything I love！

“日日行不怕千万里，时时做不惧千万事。”

Yuan-Man's Projects

3d-game-development

3D game development project demo - reference Doom style related materials.

3dti_audiotoolkit_unitywrapper

abclib

Faust code for ambisonic and multi-channel mixed music

abletonparsing

Parse an Ableton ASD clip file (warp markers and more) in Python

ace_phonemes

a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

ai-audio-processing-methods

ai audio processing methods

ai-audio-startups

Community list of startups working with AI in audio and music technology

ai-collection

The Generative AI Landscape - A Collection of Awesome Generative AI Applications

ai-development-model

This is a model library for AI development, containing many algorithms and models for artificial intelligence.

ai-game-devtools

Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥

ai-multimodal-timeline

Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥

ai-rpi-detection

AI Raspberry Pi cat detection and notification: get a text when your cat does something it's not supposed to do, and have AI narrate what it sees. Generalizable across other use cases outside of cats

ai-startups

AI Startups are all you need! Here we will track the latest AI Startups, including AI Applications, AI Developer Tools, AI Infrastructure and AI Hardware. 🔥

ai-voice-agents

AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧

ai-vtuber

AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain（本地/llm）/chatglm/text-generation-webui/闻达】驱动的虚拟主播【Live2D】，可以在【Bilibili/抖音/快手/斗鱼】直播中与观众实时互动或直接在本地进行聊天。它使用自然语言处理和文本转语音技术【edge-tts/VITS/elevenlabs/bark-gui】生成对观众问题的回答并可以选择【so-vits-svc/DDSP-SVC】变声；通过特定指令协同Stable Diffusion进行画图展示。并且可以自定义文案进行循环播放。

ai_beatmap_generator

尝试使用神经网络生成音乐游戏Malody的谱面。

aigc-development-diffusion

This is a library of models for artificial intelligence-generated art, containing many Diffusion algorithms and models for artistic content creation.

aigc-development-gan-list

This is a library of models for artificial intelligence-generated art, containing many algorithms and models for artistic content creation.

aigc-development-vae-list

This is a library of models for artificial intelligence-generated art, containing many VAE algorithms and models for artistic content creation.

aisfx

Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.

alien-isolation-audio-extractor

A simple tool to export and name sound files within Alien: Isolation.

ambisonics2binaural_simple

A simple Python script to convert FOA audio to binaural.

amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

yuan-manx Goto Github PK

Hi there，Welcome to my Art and Technology Creative Space！ 🌏🌌🌊

I'm Derrick / Yuan Man（袁满） ！👋

Research & Hobbies： Sound, Music, AI, Game Development, Code, Design, etc. 🎸🎹🥁🎻🎺🎤🎧

AI Research:

I love everything I love！

“日日行不怕千万里，时时做不惧千万事。”

Yuan-Man's Projects

Recommend Projects

Recommend Topics

Recommend Org

I'm Derrick / Yuan Man（袁满）！👋