Topic: speech-translation Goto Github
Some thing interesting about speech-translation
Some thing interesting about speech-translation
speech-translation,Revisiting End-to-End Speech-to-Text Translation From Scratch
User: bzhanggo
speech-translation,Zero -- A neural machine translation system
User: bzhanggo
speech-translation,This repository contains the data resources for the LacunaFund supported project, Multimodal datasets for the Bemba Language of Zambia.
User: csikasote
speech-translation,A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
User: dadangdut33
speech-translation,The dataset of Speech Recognition
User: double22a
speech-translation,Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Organization: echogarden-project
speech-translation,End-to-End Speech Processing Toolkit
Organization: espnet
Home Page: https://espnet.github.io/espnet/
speech-translation,PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.
User: george0828zhang
speech-translation,A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.
User: george0828zhang
speech-translation,A corpus that can be used to train English-to-Italian End-to-End Speech-to-Text Machine Translation models
User: giuseppe-della-corte
speech-translation,Speech to text and translation client-server using Google cloud
User: hagarz
speech-translation,Repository containing the open source code of works published at the FBK MT unit.
Organization: hlt-mt
speech-translation,Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
Organization: ictnlp
Home Page: https://arxiv.org/abs/2305.08709
speech-translation,Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
Organization: ictnlp
Home Page: https://arxiv.org/abs/2305.08706
speech-translation,Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
Organization: ictnlp
Home Page: https://arxiv.org/abs/2310.07403
speech-translation,Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
Organization: ictnlp
speech-translation,Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"
Organization: ictnlp
speech-translation,Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
Organization: ictnlp
speech-translation,Real time caption generator using Microsoft Azure speech services
User: jadenchun
speech-translation,🖍️ This project combines multiple operations in Microsoft Azure Cognitive Services into one GUI, including QnA Maker, LUIS, Computer Vision, Custom Vision, Face, Form Recognizer, Text To Speech, Speech To Text and Speech Translation. It's very user-friendly for users to implement any operation mentioned above.
User: jeffwang0325
speech-translation,A hobby project. Online translator service. This service helps you to translate a text or speech from any languages in the world to any other.
User: jojijacobk
Home Page: https://thetranslator.live/
speech-translation,Tracking the progress in end-to-end speech translation
User: kahne
speech-translation,🚀 Seamlessly fine-tune and deploy Whisper model on a multi-lingual dataset.
User: kevkibe
speech-translation,Whisper Transcription Service
User: ksquarekumar
Home Page: https://github.com/ksquarekumar/whisper-stream
speech-translation,Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"
User: liamdugan
Home Page: https://arxiv.org/abs/2306.01201
speech-translation,Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Organization: microsoft
speech-translation,ESO speech dataset: an English-language speech corpus of the oncology domain for ASR training and benchmarking and MT benchmarking.
Organization: mllp-research-group
Home Page: http://www.mllp.upv.es/eso-dataset
speech-translation,Systems submitted to IWSLT 2022 by the MT-UPC group.
Organization: mt-upc
speech-translation,Efficient Speech Translation with Dynamic Latent Perceivers
Organization: mt-upc
speech-translation,SEGAUGMENT: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Organization: mt-upc
speech-translation,SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Organization: mt-upc
speech-translation,Pushing the Limits of Zero-shot End-to-End Speech Translation
Organization: mt-upc
speech-translation,A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Organization: nvidia
Home Page: https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
speech-translation,Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Organization: paddlepaddle
Home Page: https://paddlespeech.readthedocs.io
speech-translation,code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
User: reneeye
speech-translation,This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
User: reneeye
speech-translation,List of direct speech-to-speech translation papers.
User: rongjiehuang
speech-translation,SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Java for Linux
Organization: think-a-move
speech-translation, SPEAR-ASR and SPEAR-WakeUp Software Development Kit in Java for Windows
Organization: think-a-move
speech-translation,Code for the paper "Does Joint Training Really Help Cascaded Speech Translation?" (EMNLP 2022)
User: tran-khoa
speech-translation,Limit the use of end-to-end data for Speech Translation (by leveraging Automatic Speech Recognition and Machine Translation data instead) using zero-shot multilingual text translation techniques.
User: tuanh23
speech-translation,A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
Organization: vinairesearch
speech-translation,The project for speech translation
User: xuchennlp
speech-translation,unsupervised spoken utterances scoring
User: yaya-sy
speech-translation,Speech-To-Text is a C# desktop app that uses Azure Cognitive Services to convert and translate speech. You can copy or show the text on the screen, and choose the language of the speech or the translation.
User: yousef0sa
speech-translation,Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.
User: zhangshaolei1998
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.