I read the FAQ on page . But

Missing transcripts? about 3d-speaker HOT 4 CLOSED

chenht2021 commented on July 30, 2024

Missing transcripts?

from 3d-speaker.

Comments (4)

GeekOrangeLuYao commented on July 30, 2024

Currently, our text annotations are only available for audio clips recorded with DIRECTIONAL devices. The reason for this is that we focus on annotating clear and distinct audio rather than using audio data that is not as clear, such as those from far-field recordings or in dialects. Our dataset is more focused on speaker-related tasks. If further text annotation releases, we will update the information on our website.

from 3d-speaker.

chenht2021 commented on July 30, 2024

Thanks for your explanation.
Ok, maybe off topic, if not appropriate, pls close it.
I read LAURAGPT, It says the the trainning data of TTS is LibriTTS and 3D-Speaker, and copied it 2 times, so the number of samples is 5.0M.
LibriTTS train set is about 206K, and all 3D-Speaker's train set is about 643k, if count annotations, it will be less.
So the number of samples for trainning TTS is wrong? should be 500k?

from 3d-speaker.

GeekOrangeLuYao commented on July 30, 2024

In the experiment with LauraGPT, data from the highest quality device of 3D-Speaker Datasets was utilized, and certain data augmentation was performed. For specific data details, please refer to the original paper.

from 3d-speaker.

GeekOrangeLuYao commented on July 30, 2024

After double-checking with the authors, it appears that the LibriTTS data you provided seems to be smaller than expected. Additionally, we have also utilized data from aishell-1,2,3 in the TTS tasks, which was inadvertently omitted in the current preprint version of our paper. We will rectify this detail in our subsequent revisions.

from 3d-speaker.

Recommend Projects

Missing transcripts? about 3d-speaker HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent