Kedreamix's Projects
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
收集整理 GitHub 上高质量、有趣的开源项目。
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
TTS
CRNN-pytorch
《动手学深度学习》习题解答,在线阅读地址如下:
deep learning for image processing including classification and object-detection etc.
Deep Learning Paper Reading 论文泛读
在这里面我会记录一下我在机器学习中一步一步的路程,我一定会慢慢努力的哈哈
[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"
30天吃掉那只 TensorFlow2.0
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
GAN Step By Step -- GSBS,顾名思义,我希望我自己能够一步一步的学习GAN。GAN 又名 生成对抗网络,是最近几年很热门的一种无监督算法,他能生成出非常逼真的照片,图像甚至视频。GAN是一个图像的全新的领域,从2014的GAN的发展现在,在计算机视觉中扮演这越来越重要的角色,并且到每年都能产出各色各样的东西,GAN的理论和发展都蛮多的。我感觉最近有很多人都在学习GAN,但是国内可能缺少比较多的GAN的理论及其实现,所以我也想着和大家一起学习,并且提供主流框架下 **pytorch,tensorflow,keras** 的一些实现教学。 在一个2016年的研讨会,`杨立昆`描述生成式对抗网络是“`机器学习这二十年来最酷的想法`”。
Official repository for 'GaussianHead: Impressive Head Avatars with Learnable Gaussian Diffusion'
A detailed formulae explanation on gaussian splatting
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
华为OD算法题解 愿景:加入C++的方法,学习相关知识
Image Web-App using Streamlit
用于keras的图像分类,包含多种模型方法,比如AlexNet,VGG,GoogleNet,ResNet,DenseNet等等,包含可完整运行的代码。除此之外,也有colab的在线运行代码,可以直接在colab在线运行查看结果。也可以迁移到自己的数据集进行迁移学习。
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬