Comments (14)
I managed to get the system up and running, whatever I speak the analog signal changes but I do not received any transcription on the UI/web page. I cna see the transcription being generated in the terminal logs.
Is someone else facing the same issue, any suggestions will be helpful.
from whisper-playground.
Can you attach the console log (a screenshot would do) + a screen recording?
from whisper-playground.
Following Sahar's request, please make sure that the screen recording is at least 15 seconds if you're using the real-time mode with a transcription timeout of 5 seconds :)
from whisper-playground.
@saharmor Thank you for the reply, I am attaching a screen recording which you can check, although in terminal I see the transcriptions after some time but I do not get nay print on the web page UI.
issue.mp4
I hope this helps to get a better understanding on the challenge I am facing.
from whisper-playground.
That's odd. I don't know what could be causing this.
I have an odd request, let me know if you want me to clarify.
Once you see "Transcription generated", forcefully terminate the program (Ctrl +C), go to the project files, open the backend folder, and there you should see a folder called temp
. Within that folder, there's a file called batch.wav
, could you please upload it to Drive and share the link? Or compress it and drag-and-drop it?
from whisper-playground.
https://github.com/saharmor/whisper-playground/assets/132049696/d4355b4c-8e27-422f-b345-b76e69bf1f87
@ethanzrd I just checked what you suggested but I don't see any temp folder getting created in the backed folder.
I by mistake said backed folder in the video but I mean temp folder only. YOu can check the recording
Also, I forcefully terminated the program after a transcript got generated for a segment, but still I cannot locate any temp folder being created in the backed folder or any batch.wav file.
from whisper-playground.
You're right, I was mistaken. This only applies to the sequential mode. Speaking of which, could you try it and let me know if you're having any problems there? If you are, then the batch.wav file will be present for you to include :)
from whisper-playground.
@ethanzrd I checked with sequential mode. There it generated the wav file and it is having the voice recording. But I neither see any transcription in my terminal logs nor on the whisper playground WEB PAGE.
I am sharing the wav file.
temp.zip
from whisper-playground.
Huh, that is odd. I'll test a bit with the recording tomorrow :)
from whisper-playground.
One more thing I checked , I am getting the transcript in my
tag but the issue is that it is not getting displayed on the web page. Here is a screenshot for the same.
Any guess/suggestion for this behaviour?
from whisper-playground.
@saharmor Thank you for the reply, I am attaching a screen recording which you can check, although in terminal I see the transcriptions after some time but I do not get nay print on the web page UI.
I hope this helps to get a better understanding on the challenge I am facing.
is there any update/solution which can be recommended?
from whisper-playground.
As for the HTML problem, have you tried expanding the window?
I don't know what could be causing your transcription issue as your audio is valid, I'll try to run it on a clean Windows installation and get back to you.
from whisper-playground.
I made a fresh installation and had to make change in package.json
"start": "react-scripts --openssl-legacy-provider start" to "start": "react-scripts start --openssl-legacy-provider"
Also I updated my chrome to latest version.
I can see the transcription now, but in starting for most of the segments the transcription being generated is empy, after 30 secs it starts to transcribe something but before that I get no transcript.
Is there any suggestion/solution which can be recommended.
from whisper-playground.
That's what I couldn't understand. I'm currently a bit busy with other things, but once some time frees up, I'll run a fresh Windows installation and run the project to see if there's any dependencies we might have missed that could be causing this strange behavior.
from whisper-playground.
Related Issues (20)
- How to investigate the problem of "No module named 'flask_cors'"? HOT 1
- `yarn start` error HOT 6
- sh: 1: react-scripts: not found HOT 2
- Possible to package this in a dockerfile? HOT 4
- Code switching inputs HOT 1
- .... this is for future issues
- Dockerized the fullstack app
- Add VAD to avoid hallucinations with speechless batches
- Manage speakers manually for real-time refinement & avoiding speaker swapping
- Transcribe the remaining audio data when using the real-time mode
- The backend is not available on Windows. HOT 2
- Add auto-scroll for transcribed data in client
- Error occurred during backend connection. HOT 5
- Prevent errors with speechless batches
- Implement multi-client support
- Moving the backend's IP into config.js
- Implement status bar
- Implement parameter validation in the client
- BEST DIART PARAMETERS IN GENERAL TO USE FOR BETTER DIARIZATION RESULTS
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisper-playground.