Comments (6)
I can't redo this:
- "seven hundred and twenty two" becomes "722" here.
Other combinations noted are more ambiguous, what is your --numbers-min-value
set to?
from nerd-dictation.
Oops, funny, I wrote the min value stuff and forgot it was there. I have --numbers-min-value=3
which explains the issue.
If you see an easy way to make --numbers-min-value
realize that 722 is much bigger than 3, then go for it because I'm not quite sure how to hook that in properly. Otherwise, you may close the issue as not a bug.
from nerd-dictation.
Could you show the command used to activate the nerd-dictation
? I can't redo the issue even with --numbers-min-value=3
set.
from nerd-dictation.
./nerd-dictation begin --numbers-as-digits --numbers-no-suffix --numbers-min-value=3 --suspend-on-start --verbose=1 --simulate-input-tool=DOTOOLC
from nerd-dictation.
I still can't redo this even with the exact command. It might be the language model outputs "twenty-two" instead of "twenty two" which confuses the parsing - which assumes spaces.
The readme in the model directory reads:
Accurate universal English model (both for callcenter and wideband)
Based on Appen Kaldi model https://github.com/Appen/UHV-OTS-Speech
Librispeech test-clean WER: 5.69%
Tedlium WER: 6.05%
Callcenter WER: 29.78%
from nerd-dictation.
That could be. I'm using an unmodified vosk-model-en-us-0.42-gigaspeech
model and it does say twenty-two
.
from nerd-dictation.
Related Issues (20)
- --output STDOUT only sends output after dictation ends HOT 3
- Issue with alias function name "ndbegin".
- Crash when ending the dictation with DOTOOL as simulating input tool
- on wayland this tool wont work as xdotool is for x11 which this program requires please implement different automation implementation for wayland HOT 3
- permission denied
- Feature: executable bundled with xdotool and pulseaudio (for parec)
- Processing time with larger models
- Bad xdotool performance
- Current status (activated, suspended, etc.) HOT 3
- Push to talk: How to Configure?
- can't find '__main__' module HOT 3
- pipx installed vosk not recognized by nerd-dictation, nor installable using pipx!? HOT 4
- /examples/vosk_grammar/ - browser prints "[B" for "down" instead of simulating arrow-down
- Can't get startstop AND punctuation example working at the same time
- Undo last sentence (when dication goes wrong) HOT 1
- Japanese models always produces spaces in the output
- Pip instructions are incorrect HOT 6
- ydotoold could not be started after the installation on Ubuntu 22.04: Here's the fix:
- Install failure on Linux Mint 22
- Capitalisation
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nerd-dictation.