Giter Site home page Giter Site logo

MaxMax's Projects

only-noisy-training icon only-noisy-training

** A self-supervised speech denoising strategy named Only-Noisy Training (ONT), which solves the speech denoising problem with only noisy audio signals in audio space for the first time.

open-llms icon open-llms

经典 📋 A list of open LLMs available for commercial use.

openit icon openit

致力于打造免费无感的翻墙环境

openmmd icon openmmd

MMD舞蹈 OpenMMD is an OpenPose-based application that can convert real-person videos to the motion files (.vmd) which directly implement the 3D model (e.g. Miku, Anmicius) animated movies.

opensvip icon opensvip

歌声合成工程转换 An open framework and intermediary model for converters among project files of various singing voice synthesizers

opentts icon opentts

语音合成服务器开发 Open Text to Speech Server

openutau icon openutau

OpenUTAU renderer for diffsinger / 适用于diffsinger的OpenUTAU渲染器,使用方法:https://github.com/xunmengshe/OpenUtau/wiki/%E4%BD%BF%E7%94%A8%E6%96%B9%E6%B3%95%EF%BC%88%E4%B8%AD%E6%96%87%EF%BC%89

overflow icon overflow

Probabilistic speech syntheses by mixing neural HMM TTS with normalising flows

pafx icon pafx

音效 Python Audio Effects

palm-rlhf-pytorch icon palm-rlhf-pytorch

对话模型 Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

paper2gui icon paper2gui

实用工具 Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

paralip icon paralip

Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code

parallel-tacotron2 icon parallel-tacotron2

可微时长模型 PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

parselmouth icon parselmouth

【Praat音频分析】 in Python, the Pythonic way

pasd icon pasd

Pixel-Aware Stable Diffusion

pats icon pats

数字人手势生成 PATS Dataset. Aligned Pose-Audio-Transcripts and Style for co-speech gesture research

penn icon penn

基音预测 Pitch Estimating Neural Networks (PENN)

percepnet icon percepnet

RNNoise升级版,比赛实时赛道第二名 (Work In Progress) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

pesto icon pesto

轻量级音高估计,Self-supervised learning for fast pitch estimation

pesto-full icon pesto-full

音高检测,Full models and training code for PESTO

phaseaug icon phaseaug

相位声码器 Submitted to ICASSP 2023

phoneix icon phoneix

歌声合成 PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor

phonelm icon phonelm

(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.

phonemizer icon phonemizer

支持中文 Simple text to phones converter for multiple languages

phonmatchnet icon phonmatchnet

自定义唤醒词,Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.