Comments (4)
I wanted to know what should I put in wav, sr = translator.synthesize_speech(<speech_units>, <tgt_lang>) , speech units part
from seamless_communication.
your sample is to synthesize speech from the output of the translation step. Check https://github.com/facebookresearch/seamless_communication/blob/main/scripts/m4t/predict/predict.py as this has an example of predicting with code.
from seamless_communication.
https://github.com/facebookresearch/seamless_communication/blob/main/src/seamless_communication/models/inference/translator.py#L150 has more details
from seamless_communication.
yeah I am stumbling over tihs one as well. And when I plug the tensor response from predict
as follows, it does generate a small 2 sec audio, but without any content. But I just started playing around 2 hours ago ...
translated_text, wav, sr = translator.predict(
content, mode, target_lang, src_lang=source_lang
)
# Save the translated audio generation.
# wav, sr = translator.synthesize_speech(wav, "eng")
unique_filename = str(uuid.uuid4()) # Generate a UUID and convert it to a string
file_extension = ".wav" # Replace with the desired file extension
file_path = os.path.join(folder_path, unique_filename + file_extension)
# # Save the translated audio generation.
torchaudio.save(
file_path,
wav[0].cpu(),
# TODO test gpu
# wav[0].cpu(),
sample_rate=sr,
)
from seamless_communication.
Related Issues (20)
- tensor format input audio translation error
- $$ FineTune Not Work !
- [help] libsndfile is not found | mac m1 proMax, sonoma14.4 HOT 1
- T2TT not works
- Cannot finetune TEXT_TO_SPEECH and SPEECH_TO_SPEECH HOT 1
- آیا شما توانسته اید یادگیری مدل را اصلاح کنید ؟؟؟
- How use checkpoints that i got from fine-tuning??
- I am going crazy waiting for the training and fine-tuning code for the SPEECH TO SPEECH task
- Norwagian Language Not supported this model
- Does t2tt and t2ts support real-time streaming generation? HOT 2
- Seamless Streaming using microphone input
- Does the Seamless encoder participate in parameter updates during X2TT training?
- MuTox Dataset Annotated Timeline not matching
- Cannot install packages
- Broken hugging face link
- stdlib.h, error: command '/usr/bin/x86_64-solus-linux-gcc' failed with exit code 1 HOT 1
- seamless_streaming_unity相关权重自行下载,怎么设置路径 HOT 1
- streaming_evaluate --task s2st --data-file ../input_1.wav --audio-root-dir ../ --output ./ --tgt-lang eng --expressive --gated-model-dir ./SeamlessExpressive 2024-07-09 12:55:04,014 INFO -- seamless_communication.streaming.agents.unity_pipeline: Loading the UnitY model: seamless_streaming_unity on device=cuda:0, dtype=torch.float16 Killed
- about tsv file HOT 1
- REDACTED HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from seamless_communication.