Light

allensmile / speech_dataset Goto Github PK

View Code? Open in Web Editor NEW

This project forked from double22a/speech_dataset

0.0 1.0 0.0 72 KB

The dataset of Speech Recognition

License: Apache License 2.0

speech_dataset's Introduction

The Dataset of Speech Recognition

Chinese

name	duration/h	address	remark
THCHS-30	30	https://openslr.org/18/
Aishell	150	https://openslr.org/33/
ST-CMDS	110	https://openslr.org/38/
Primewords	99	https://openslr.org/47/
aidatatang	200	https://openslr.org/62/
MagicData	755	https://openslr.org/68/
ASR&SD	160	http://ncmmsc2021.org/competition2.html	if available
Aishell2	1000	http://www.aishelltech.com/aishell_2	if available
TAL ASR	100	https://ai.100tal.com/dataset
Common Voice	63	https://commonvoice.mozilla.org/zh-CN/datasets	Common Voice Corpus 7.0
ASRU2019 ASR	500	https://www.datatang.com/competition	if available
2021 SLT CSRC	398	https://www.data-baker.com/csrc_challenge.html	if available
aidatatang_1505zh	1505	https://datatang.com/opensource	if available
WenetSpeech	10000	https://github.com/wenet-e2e/WenetSpeech	if available

English

name	duration/h	address	remark
Common Voice	2015	https://commonvoice.mozilla.org/zh-CN/datasets	Common Voice Corpus 7.0
LibriSpeech	960	https://openslr.org/12/
ST-AEDS-20180100	4.7	http://www.openslr.org/45/
TED-LIUM Release 3	430	https://openslr.org/51/
Multilingual LibriSpeech	44659	https://openslr.org/94/	limited supervision
SPGISpeech	5000	https://datasets.kensho.com/datasets/scribe	if available
Speech Commands	10	https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/data
2020AESRC	160	https://datatang.com/INTERSPEECH2020	if available
GigaSpeech	10000	https://github.com/SpeechColab/GigaSpeech	if available
The People’s Speech	31400	https://openreview.net/pdf?id=R8CwidgJ0yT
Earnings-21	39	https://arxiv.org/abs/2104.11348
VoxPopuli	24100+543	https://arxiv.org/pdf/2101.00390.pdf	24100(unlabeled), 543(transcribed)

Chinese-English

name	duration/h	address	remark
TAL CSASR	587	https://ai.100tal.com/dataset
ASRU2019 CSASR	200	https://www.datatang.com/competition	if available

Japanese (ja-JP)

name	duration/h	address	remark
Common Voice	26	https://commonvoice.mozilla.org/zh-CN/datasets	Common Voice Corpus 7.0
Japanese_Scripted_Speech_Corpus_Daily_Use_Sentence	18	https://magichub.io/cn/datasets/japanese-scripted-speech-corpus-daily-use-sentence/
LaboroTVSpeech	2000	https://arxiv.org/pdf/2103.14736.pdf
CSJ	650	https://github.com/kaldi-asr/kaldi/tree/master/egs/csj

Korean (ko-KR)

name	duration/h	address	remark
korean-scripted-speech-corpus-daily-use-sentence	4.3	https://magichub.io/cn/datasets/korean-scripted-speech-corpus-daily-use-sentence/
korean-conversational-speech-corpus	5.22	https://magichub.io/cn/datasets/korean-conversational-speech-corpus/

Russian (ru-RU)

name	duration/h	address	remark
Common Voice	148	https://commonvoice.mozilla.org/zh-CN/datasets	Common Voice Corpus 7.0
OpenSTT	20000	https://arxiv.org/pdf/2006.08274.pdf	limited supervision

noise & nonspeech

name	duration/h	address	remark
MUSAN	-	https://openslr.org/17/
Room Impulse Response and Noise Database	-	https://openslr.org/28/
AudioSet	-	https://ieeexplore.ieee.org/document/7952261

The Dataset of Speech Synthesis

Chinese

name	duration/h	address	remark
Aishell3	85	https://openslr.org/93/

English

name	duration/h	address	remark
Hi-Fi Multi-Speaker English TTS Dataset	291.6	https://openslr.org/109/
LibriTTS corpus	585	https://openslr.org/60/

The Dataset of Speech Recognition & Speaker Diarization

Chinese

name	duration/h	address	remark
Aishell4	120	https://openslr.org/111/	8-channel, conference scenarios
ASR&SD	160	http://ncmmsc2021.org/competition2.html	if available

English

name	duration/h	address	remark
CHiME-6	-	https://chimechallenge.github.io/chime6/download.html	if available

The Dataset of Speaker Recognition

Chinese

name	duration/h	address	remark
CN-Celeb	-	https://openslr.org/82/

English

name	duration/h	address	remark
VoxCeleb Data	-	http://www.robots.ox.ac.uk/~vgg/data/voxceleb/

speech_dataset's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.