Error : Detected language: Gujarati 100%|██████████| 8533/8533 [00:16<00:00

mn = "skylord/wav2vec2-large-xlsr-hindi" #<a class="user-mention notranslate" data-hov

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Language issues [No default align-model for language: gu] about whisper-diarization HOT 7 CLOSED

mahmoudashraf97 commented on May 14, 2024

Language issues [No default align-model for language: gu]

from whisper-diarization.

Comments (7)

MahmoudAshraf97 commented on May 14, 2024

Not all languages are supported right now, I'm actively working on supporting more languages

from whisper-diarization.

alloc7260 commented on May 14, 2024

I am also willing to contribute for the same.
Just wanted little guidance.

from whisper-diarization.

alloc7260 commented on May 14, 2024

Can you tell me how many languages are supported right now?

from whisper-diarization.

MahmoudAshraf97 commented on May 14, 2024

Right now word timestamps are generated using WhisperX, languages that are not supported in whisperx can be generated using Whisper Dynamic Time Warping, you can find tutorals for that on the original whisper repo, and supported languages are in the code

from whisper-diarization.

alloc7260 commented on May 14, 2024

mn = "skylord/wav2vec2-large-xlsr-hindi" #@param
alignment_model, metadata = whisperx.load_align_model(
language_code=whisper_results["language"], device=device, model_name=mn
)

I have changes this line
it is used to take language specific model from hugging face

there are many language model available for many languages there

take model name from there that suits your language and put it in mn variable

and continue running...

WER will vary according to model you choose

from whisper-diarization.

MahmoudAshraf97 commented on May 14, 2024

You can modify this in whisperX repo, we import supported languages from there

from whisper-diarization.

MahmoudAshraf97 commented on May 14, 2024

@alloc7260 Hello, all languages that are supported in whisper are supported in the code now

from whisper-diarization.