Comments (8)
Can you post the full dump? This will support 0.4.
from ctcdecode.
There's no stack trace
This is all I got:
Loading the LM will be faster if you build a binary file.
Reading models/lm_csr_64k_vp_3gram.arpa
----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100
****************************************************************************************************
Segmentation fault (core dumped)
This is how I got the error btw (see issue):
SeanNaren/deepspeech.pytorch#294 (comment)
from ctcdecode.
Could you try compressing the arpa into a trie file? Something like this:
kenlm/bin build_binary -q 8 -b 8 -a 22 trie models/lm_csr_64k_vp_3gram.arpa models/lm_csr_64k_vp_3gram.trie
and then using the trie file? The error you're getting is from kenlm, not the binding itself.
from ctcdecode.
Will try that out (I originally just downloaded the .arpa
file)
from ctcdecode.
I compiled KenLM but can't get lmutils to work... Am I missing something?
from ctcdecode.
FInally managed to do it with:
kenlm/build/bin/build_binary -q 8 -b 8 -a 22 trie models/lm_csr_64k_vp_3gram.arpa models/lm_csr_64k_vp_3gram.trie
Anyway, same segmentation fault when transcribing
from ctcdecode.
Turns out that because I was running the code in a docker container I wasn't getting the full stack trace.
Now that I set the right flags for the container, I'm getting this:
Thread 60 "python" received signal SIGSEGV, Segmentation fault.
[Switching to Thread 0x7fff655f5700 (LWP 265)]
PathTrie::get_path_trie (this=this@entry=0x7fff655f4be0, new_char=new_char@entry=1, new_timestep=new_timestep@entry=0, reset=reset@entry=true)
at /tmp/pip-req-build-0lq7p010/ctcdecode/src/path_trie.cpp:56
56 /tmp/pip-req-build-0lq7p010/ctcdecode/src/path_trie.cpp: No such file or directory.
from ctcdecode.
@ryanleary the problem is solved. Turns out the binary was faulty. Feel free to close the issue
from ctcdecode.
Related Issues (20)
- problem with CTCBeamDecoder.decode() when using a big (.arpa / .binary) file HOT 1
- I can't dockerize my project with ctcdecode library HOT 1
- "Symbol not found in flat name space" in MacOS (macbook pro M1) when importing ctcdecode HOT 1
- pip install error on Linux HOT 2
- out_len and out_lens
- Calculation of "beam_scores" and gradient HOT 2
- probelm on installing ctcdecode in window
- probelm when installing ctcdecode in window HOT 1
- You can migrate your code to kaggle, install the package through kaggle. See the website below for installation.
- RuntimeError: Resource temporarily unavailable
- could the ctcdecode be serialized and executed in non-Python environments, such as torchscript?
- #error C++17 or later compatible compiler is required to use PyTorch
- pip install . errro on colab HOT 3
- Unable to run the pip install . command HOT 1
- How to use timesteps information to calculate word-level alignments? HOT 2
- Can you list the dependencies like torch version, c++ and others HOT 1
- PIP install issue Windows HOT 1
- Can we just update this repo HOT 1
- Share how to install successful "ctcdecode" HOT 5
- beam scores of shape (batch, num_beams, time_steps)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ctcdecode.