Giter Site home page Giter Site logo

tmhint-qi-voicemos2023's Introduction

TMHINT-QI VoiceMOS2023

This repository aims to publically share the training and test data of TMHINT-QI version II, which was used as one of the tracks in VoiceMOS Challenge 2023.

TMHINT-QI version II

The TMHINT-QI version II is the updated version of the original TMHINT-QI dataset. The version I dataset has no unseen scenarios for the evaluation set. Therefore, the version II dataset aims to accommodate such concerns by modifying the training set and providing the unseen systems for the evaluation set.

Training Set

The training set consists of four scene environments: clean, babble, white, and pink noises. It also consists of noisy, clean, and enhanced utterances from four systems, including Minimum-mean Square Error (MMSE), Deep Denoising Autoencoder (DDAE), Fully Convolutional Network (FCN), and Transformer. The training set consists of 11053 utterances with corresponding quality (0-5) and intelligibility (0-10) scores.

Please refer to the following link for the updated split for the training set, and please download the corresponding utterances in the following link.

Test Set

The TMHINT-QI version II test set consists of five scene environments: clean, babble, white, and pink noises, and street noise for the unseen environment. It also consists of noisy, clean, and enhanced utterances consisting of three seen enhanced systems (FCN, MMSE, and Tranformer) and two unseen enhanced systems (Conformer-based Metric Generative Adversarial Network (CMGAN) and DEMUCS). In total, the test set of TMHINT-QI version II consists 1960 utterances.

The corresponding test set of TMHINT-QI version II can be downloaded at the following link. In addition, the corresponding utterances can be downloaded here.

Current benchmark score

Capture

From the VoiceMOS Challenge 2023, our system (T02), which is based on the improved version of MOSA-Net achieved the top performer among the other systems. The detail explanation regarding our systems can be found in the following link

Citation

Please kindly cite our paper, if you use the dataset in your research.

@misc{zezario2023study,
      title={A Study on Incorporating Whisper for Robust Speech Assessment}, 
      author={Ryandhimas E. Zezario and Yu-Wen Chen and Szu-Wei Fu and Yu Tsao and Hsin-Min Wang and Chiou-Shann Fuh},
      year={2023},
      eprint={2309.12766},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

Acknowledgment

BioASP Lab, Academia Sinica

VoiceMOS Challenge Organizers 2023

tmhint-qi-voicemos2023's People

Contributors

dhimasryan avatar

Stargazers

yhzhouowo avatar  avatar Nickolay V. Shmyrev avatar Eric Lam avatar Kazushi Nakazawa avatar Hui-Guan Yuan (袁惠冠) avatar RoyChao avatar Wei-Lun Chen avatar  avatar  avatar Dyah Ayu M. G. Wisnu avatar  avatar  avatar

Watchers

 avatar Nickolay V. Shmyrev avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.