xzm2004260 Goto Github PK
Name: xzm2004
Type: User
Bio: speech synthesis , TTS
Location: Xiamen
Name: xzm2004
Type: User
Bio: speech synthesis , TTS
Location: Xiamen
Regressing Robust and Discriminative 3D Morphable Models with a very Deep Neural Network
The authors' implementation of Unsupervised Adversarial Learning of 3D Human Pose from 2D Joint Locations
some code for nlp tour
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna
A TensorFlow Implementation of "Deep Multi-Scale Video Prediction Beyond Mean Square Error" by Mathieu, Couprie & LeCun.
The classical papers and codes about generative adversarial nets
一句话实现美颜效果(基于GPUImage)
Audiogen Codec
Community list of startups working with AI in audio and music technology
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
160+ Algorithm & Data Structure Problems using C++
API for alignment of singing voice to lyrics as used in www.voicemagix.com. Core Machine Learning Algorithms are MLP neural networks and hidden markov models. Based on Django Rest Framework
License Plate Detection and Recognition in Unconstrained Scenarios
Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
🧑🏫 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit), optimizers (adam, radam, adabelief), gans(dcgan, cyclegan, stylegan2), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, etc. 🧠
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Official Repository for Assem-VC @ INTERSPEECH 2021 SUBMITTED
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Attention OCR Based On Tensorflow
[WIP] Attention Is All You Need (Vaswani et al. 2017) by Chainer.
a library for audio and music analysis
A collection of Audio and Speech pre-trained models.
Audio super resolution using neural networks
Audio Coding Notebooks and Tutorials
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.