Long-Form transcription with Faster Whisper about distil-whisper HOT 3 OPEN

9throok commented on May 12, 2024

Long-Form transcription with Faster Whisper

from distil-whisper.

Comments (3)

sanchit-gandhi commented on May 12, 2024

Hey @9throok - cool to see that you're using Distil-Whisper in combination with Faster-Whisper! I believe the .transcribe method in Faster-Whisper handles the long-form generation algorithm: https://github.com/guillaumekln/faster-whisper#usage Is this the API that you've been using? If you could share a reproducible code snippet that showcases the behaviour you're seeing that would be great, thanks!

from distil-whisper.

mehrdad-es commented on May 12, 2024

@9throok, any update on the issue that you mentioned?

from distil-whisper.

Purfview commented on May 12, 2024

Hi, I have been working on faster whisper and trying to use the distil-whisper model. However, distil-whisper supports 30s of audio chunks and using it with faster whisper only outputs the first 30 seconds.

I had same issue, after the first chunk nada in output, then looked at debug - distill model just hallucinated non stop after the first chunk, solution is to disable context prompt, initial prompt has negative effect too.

How can it be used with the faster-whisper implementation?

Now it has official support -> SYSTRAN/faster-whisper@ad3c830

Or you can use the standalone executable -> https://github.com/Purfview/whisper-standalone-win

from distil-whisper.

Long-Form transcription with Faster Whisper about distil-whisper HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent