Comments (8)
No budget at all, I do development just for fun, will publish open source here when it done.
P.S. If you want, I'll add you to this private repo (but you need a Windows PC to test at least).
from whispertimesync.
I will have a look at it as soon as possible.
To be sure of what you expect, can you provide me with an example (JSON+TXT)?
WhisperTimeSync is not written in Python but in Java.
;-)
from whispertimesync.
Here we go: samples.zip
There are two directories: English (Smokie, "Living Next Door to Alice" and Russian (Bit-quartet Secret, "Alice") with JSONs and original lyrics (my daughter's name is Alice 😉 ). Whisper's English output is kinda affordable but Russian is a complete mess.
from whispertimesync.
@sensboston
Hmmm... The problem I see with this JSON format is that each word has a mandatory description including its timestamp. It will be very hard to decide what to do with not-matching words.
🤔
from whispertimesync.
Yeah, it's an issue, agree. But I haven't looked to your (or Java code you've ported) implementation (yet). Theoretically it's possible, even without involving AI - for example by using "soundex" algorithm. I thought about this but initially wanna check if someone already done this.
from whispertimesync.
@sensboston
The problem is not to match word by word, this is what WhisperTimeSync is doing.
The problem is to know what to do with unmatching words in this specific JSON description.
I may adapt an algo I already have for similar cases, but this is quite a work.
Do you have a budget for this?
from whispertimesync.
Any progress? Or you have no idea how to implement this? Please le me know - I don't wanna to waste a time.
from whispertimesync.
@sensboston
I understood you were working on the subject on your side.
On my side, without a budget, I have to find/allocate on my free time.
from whispertimesync.
Related Issues (14)
- Can you make a Google colab version please? HOT 2
- An error occurred while loading the archive HOT 4
- Crash with long SRT File HOT 2
- word_timestamps parameter HOT 4
- Highlight and Max line width HOT 1
- Synchronization offset HOT 4
- Memory error when synchronising longer texts HOT 2
- Incorrect sync HOT 5
- how can I change language option to transcribe audio HOT 2
- java.lang.ArrayIndexOutOfBoundsException when running WhisperTimeSync HOT 1
- ComPair Freeware: installation expired HOT 10
- Hallucinations HOT 5
- CUDA usage HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whispertimesync.