Comments (3)
@ideasman42 , does this make sense to you? Would that be difficult to add?
(...and thanks for having brought this awesome project to the world!)
from nerd-dictation.
@ideasman42 , does this make sense to you? Would that be difficult to add?
I'm not keen on adding too much specific functionality,
there is value in keeping this small and simple allowing users to hack their own configurations to add extended functionality.
This means if users aren't happy with how a feature works (in the sample files for e.g.) they can always implemented differently in their own configuration.
As far as I can see this should not be so complicated, but it may not be a trivial task either.
This would be great to support, although I think this could be handled in the user configuration - with some minor extension. The examples file could include a sample which would be sufficient for most most users (so everyone doesn't have to write their own).
The way VOSK works, words may be modified as you talk (based on statistics of likely word choice).
For typing that's OK since words can be back spaced and typed in again, however it doesn't work for commands.
There would need to be a way for:
-
Commands to be executed immediately, ignoring any updates from VOSK's statistical model.
In this case the "Reset" method looks like what we want, so the statistical model is reset and previous words wont be reinterpreted.
-
The user configuration to return user input such as mouse buttons (not so difficult).
from nerd-dictation.
I'm not keen on adding too much specific functionality, there is value in keeping this small and simple allowing users to hack their own configurations to add extended functionality.
Thanks for your reply! I totally share this philosophy of keeping things simple and having few, but powerful features.
I will soon come back with a solution draft in form of an update in the PR #17...
from nerd-dictation.
Related Issues (20)
- Delay exit doesn't work HOT 1
- Connection refused HOT 1
- Windows support HOT 1
- --output STDOUT only sends output after dictation ends HOT 3
- Issue with alias function name "ndbegin".
- Crash when ending the dictation with DOTOOL as simulating input tool
- on wayland this tool wont work as xdotool is for x11 which this program requires please implement different automation implementation for wayland HOT 3
- permission denied
- Feature: executable bundled with xdotool and pulseaudio (for parec)
- Processing time with larger models
- Bad xdotool performance
- Current status (activated, suspended, etc.) HOT 3
- Push to talk: How to Configure?
- can't find '__main__' module HOT 3
- pipx installed vosk not recognized by nerd-dictation, nor installable using pipx!? HOT 4
- /examples/vosk_grammar/ - browser prints "[B" for "down" instead of simulating arrow-down
- Can't get startstop AND punctuation example working at the same time
- Undo last sentence (when dication goes wrong) HOT 1
- Japanese models always produces spaces in the output
- Pip instructions are incorrect HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nerd-dictation.