Hi, I'm enjoying working with this fascinating repo. Looking at Stag

Added Tiny model and result here: <a href="https://github.com/guynich/distil-wh

I'm closing this issue: the small and tiny model results for <code class="notranslate"

Short form evaluation WER % for Librispeech clean test about distil-whisper HOT 3 CLOSED

guynich commented on May 23, 2024

Short form evaluation WER % for Librispeech clean test

from distil-whisper.

Comments (3)

guynich commented on May 23, 2024

The above table is with --language "en" in the short form bash scripts. By removing this flag and rerunning the evaluation the eval/wer values are lower.

E.g.:

model	eval/wer with `--language "en"`	eval/wer without option `--language`	HF model card WER
OpenAI Large-v2	3.1683	2.5685	3.0004
OpenAI Small	4.0682	3.44541	3.4322

Without the --language flag:

Large-v2 model eval/wer is lower than the HuggingFace model card WER value, and lower than the original OpenAI paper result of 2.7% in Table 2.
Small model eval/wer is similar to the HuggingFace model card WER value.

from distil-whisper.

guynich commented on May 23, 2024

Added Tiny model script and result here: https://github.com/guynich/distil-whisper/tree/main/training/scripts#summary.

from distil-whisper.

guynich commented on May 23, 2024

I'm closing this issue: the small and tiny model results for HF model card and eval/wer without option --language are aligned sufficiently for me.

(I don't understand the discrepancy in values for Large-V2 but can leave that issue)

from distil-whisper.

Short form evaluation WER % for Librispeech clean test about distil-whisper HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent