Giter Site home page Giter Site logo

yoruba-voice's Introduction

YorùbáVoice

Landing page for data, code and publications for this project sponsored by an Imminent Research Grant.

In 2022, we launched the curation and recording of 40 hours of high-fidelity speech data for the Yorùbá language, the third most widely spoken language in Africa with over 40 million L1 speakers. We partner with the YorubaName organization in Nigeria to encourage volunteers both online and offline to record their voices.

BibTeX entry and citation info

If you make use of our dataset, please cite the our paper.

@misc{ogunremi2023iroyinspeech,
      title={\`{I}r\`{o}y\`{i}nSpeech: A multi-purpose Yor\`{u}b\'{a} Speech Corpus}, 
      author={Tolulope Ogunremi and Kola Tubosun and Anuoluwapo Aremu and Iroro Orife and David Ifeoluwa Adelani},
      year={2023},
      eprint={2307.16071},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

yoruba-voice's People

Contributors

ruohoruotsi avatar dadelani avatar

Stargazers

Sam avatar DavidAA avatar  avatar Nickolay V. Shmyrev avatar

Watchers

 avatar Nickolay V. Shmyrev avatar Tolúlọpẹ́ Ògúnrẹ̀mí avatar Kola Tubosun avatar

yoruba-voice's Issues

Add g2p code to generate IPA & XSAMPA

  • Adapt code from yo-asr project to do g2p using epitran
  • Print out IPA & XSAMPA phonetic spellings
  • Clean up comments, add README and requirements.txt

[Infrastructure] Setup Githhub repo, project & invite members

  • Setup Repository for code & data: yoruba-voice (lowercase, no diacritics because of Github URL limitations)
  • Setup Github Project for Yorùbá Voice. Use Kanban with small automation
  • Setup Github team (since not everyone in NV-LTI is working on Yorùbá Voice). TBD how exactly we use it for discussions, etc
  • Invite members. I don't have the github handle of Àrẹ̀mú
  • Start making cards & assigning to people

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.