Yuan-Man's Projects
The open-source language model computer
3D game development project demo - reference Doom style related materials.
Faust code for ambisonic and multi-channel mixed music
Parse an Ableton ASD clip file (warp markers and more) in Python
a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine
Explore the latest AI Agent Framework!
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
ai audio processing methods
Community list of startups working with AI in audio and music technology
The Generative AI Landscape - A Collection of Awesome Generative AI Applications
This is a model library for AI development, containing many algorithms and models for artificial intelligence.
Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥
AI Raspberry Pi cat detection and notification: get a text when your cat does something it's not supposed to do, and have AI narrate what it sees. Generalizable across other use cases outside of cats
AI Startups are all you need! Here we will track the latest AI Startups, including AI Applications, AI Developer Tools, AI Infrastructure and AI Hardware. 🔥
AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达】 驱动的虚拟主播【Live2D】,可以在 【Bilibili/抖音/快手/斗鱼】 直播中与观众实时互动 或 直接在本地进行聊天。它使用自然语言处理和文本转语音技术【edge-tts/VITS/elevenlabs/bark-gui】生成对观众问题的回答并可以选择【so-vits-svc/DDSP-SVC】变声;通过特定指令协同Stable Diffusion进行画图展示。并且可以自定义文案进行循环播放。
尝试使用神经网络生成音乐游戏Malody的谱面。
This is a library of models for artificial intelligence-generated art, containing many Diffusion algorithms and models for artistic content creation.
This is a library of models for artificial intelligence-generated art, containing many algorithms and models for artistic content creation.
This is a library of models for artificial intelligence-generated art, containing many VAE algorithms and models for artistic content creation.
Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & Non-UCS-compliant datasets.
A simple tool to export and name sound files within Alien: Isolation.
A simple Python script to convert FOA audio to binaural.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
AMY - the Additive Music synthesizer librarY
Local & Open Source Alternative to CharacterAI
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Plugin for testing connection to an API using JUCE