ishine's Projects
Predict prosody labels for Chinese sentences.
code of ProsodySpeech: Towards Advanced Prosody Model for Neural Text-to-Speech (ICASSP2022)
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
An official implement of "PSSRF: Learning to restore Pitch-Scaled Speech without reference"
clone / cvs-import of pthread-win32 + local tweaks (including MSVC2008, MSVC2010 and MSVC2012 project files)
Parallel TTS web demo based on Flask + Vue (Vuetify). 基于 Flask + Vue 的语音合成单网页演示项目。
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))
Punctuation restoration using TensorFlow
Support tools for punctuation and boundary detection for ASR output.
A TensorFlow Implementation of Punctuation Restoration.
A Pytorch based LSTM Punctuation Restoration Implementation/A Simple Tutorial for Leaning Pytorch and NLP
Token-Level Supervised Contrastive Learning for Punctuation Restoration
Punctuation prediction
JP puncuator
A small seq2seq punctuator tool based on DistilBERT
官方py3AIML基于英文,现为其增加中文支持,并将代码注释翻译为中文。实测可正常解析带中文pattern和template的aiml文件。
simple and efficient python implemention of a series of adaptive filters (lms、nlms、rls、kalman、Frequency Domain Adaptive Filter、Partitioned-Block-Based Frequency Domain Adaptive Filter、Frequency Domain Kalman Filter、Partitioned-Block-Based Frequency Domain Kalman Filter) for acoustic echo cancellation.
Speaker diarization python system based on binary key speaker modelling
PyTorch implementation of LF-MMI for End-to-end ASR
Some Python code for researching concatenative synthesis
simple to use, pretrained/training-less models for speaker diarization
Python toolkit for likelihood-ratio calibration of binary classifiers
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
Cython implement LPC net
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
A python implementation of Speech intelligibility in bits (SIIB)