Comments (7)
Is there a way to figure out why it quits?
Use it directly in a terminal/console.
Is it that it encounters a gap in the audio or a different language and stops?
It's because you have not enough VRAM and it crash with the out of memory error.
from whisper-standalone-win.
from whisper-standalone-win.
8GB should be enough for the large models.
4GB is barely enough for large models maybe your internet browser or other soft is using VRAM too. Try Faster-Whisper-XXL.
Check with --verbose true
what compute type is in use.
from whisper-standalone-win.
I just downloaded and tested XXL (on my laptop, 4GB) with large v2. With just this browser (Firefox) open, testing it on the same audio file it says it is using my GTX1050 with int8_float32. I see nearly 100% CUDA activity in task manager.
With large v2... it made it to 10 minutes and then gave up when it ran out of memory. Oh well!
from whisper-standalone-win.
Try to reduce --best_of
till it takes less than 4GB memory.
from whisper-standalone-win.
5 is the default, right? 4 went a few more minutes before failing. 3 was able to complete the 1 hour 30 min interview!
Is there a quality downside to this setting? In general I use the large model as it just a better job with proper nouns. Is best of 3 and large v2 likely better for specific names than medium at 5? A quick review seems acceptable.
from whisper-standalone-win.
Is there a quality downside to this setting?
Yes, but going from 5 to 3 has very small impact to quality, much less than going from large to medium model.
If it's audio from a movie then you can use --ff_mdx_kim2 --vad_alt_method pyannote_v3
to increase quality [that's with Faster-Whisper-XXL].
EDIT:
Oh, you wrote that it's interview, then don't bother with those settings unless there is background music.
from whisper-standalone-win.
Related Issues (20)
- Error when running faster whisper r192.3
- a request: Purfview Whisper Live ? HOT 1
- Named Pipes are not recognized HOT 1
- Americans with Disabilities Act (ADA) guidelines, for subtitles HOT 1
- Missing transcript between segments. HOT 24
- Repeated output issue HOT 1
- Is wisper-standalone-win is closed source? HOT 2
- transcription as best as possible HOT 5
- My computer freezes when transcripts process starts HOT 3
- new Whisper old problems HOT 8
- --highlight_words true --max_line_width 43 --max_line_count 2 HOT 17
- How to make the sentence segmentation more precise HOT 1
- cuBLAS dll file takes too much space HOT 1
- Server/online mode to quickly process files on demand while keeping things in memory HOT 2
- When the --sentence option is enabled, some names in the transcription will be broken. HOT 1
- Not working with a non-English language HOT 2
- unable to remove timestamps HOT 1
- Zluda HOT 1
- Some peculiar effects of --initial_prompt HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisper-standalone-win.