by2101 / openasr Goto Github PK

A pytorch based end2end speech recognition system.

License: Apache License 2.0

Shell 10.57% Python 77.88% Perl 11.55%

speech speech-recognition speech-to-text speechrecognition speech-recognizer transformer las end2end asr

openasr's Issues

the score seems incorrect

i suspect that the aishell.txt in exp is the problem.when i use decode_test.sh,the argument:exp/aishell.txt is the text i donnt have.so i argue it seems the vocab_file argument and then i use the aishel1_train_chars.txt instead and then it happens.
How can i do to solve it?
我发现在decode_test.sh的参数中我没有aishell.sh，于是我用了同在一个目录下的aishel1_train_chars.txt代替，因为我觉得这个参数对应这vocab_file这个参数，这样就造成了上述的情况，请问我错在什么地方？应该怎么修改？
（Ps: Thanks for giving the avg_last_ckpts.py!）

请问下您的复现成功了么？

我这边又跑了一边，还是decode的结果不对……无论是测试哪一段音频，decode结果都是同一句话，请问我应该如何修改呢？

训练遇到cufft错误

您好，我训练的时候遇到一个cufft的问题。具体的错误提示如下

Epoch 10 | Step 40686 | Iter 30400:
per_token_loss: 1.3417494 | avg_token_loss: 1.3317900 | learning_rate: 0.0002191
sequence_per_sec: 26.4520501
terminate called after throwing an instance of 'c10::Error'
  what():  cuFFT error: CUFFT_INVALID_PLAN (CUFFT_CHECK at /pytorch/aten/src/ATen/native/cuda/CuFFTUtils.h:70)
frame #0: c10::Error::Error(c10::SourceLocation, std::string const&) + 0x46 (0x7f585fed4536 in /root/.local/lib/python3.7/site-packages/torch/lib/libc10.so)
frame #1: <unknown function> + 0x299e1e0 (0x7f5862cce1e0 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cuda.so)
frame #2: <unknown function> + 0x29a3c0d (0x7f5862cd3c0d in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cuda.so)
frame #3: at::native::_fft_cufft(at::Tensor const&, long, bool, bool, bool, c10::ArrayRef<long>, bool, bool, c10::ArrayRef<long>) + 0x752 (0x7f5862cd1af2 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cuda.so)
frame #4: <unknown function> + 0xfa2408 (0x7f58612d2408 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cuda.so)
frame #5: <unknown function> + 0xfe27a4 (0x7f58613127a4 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cuda.so)
frame #6: <unknown function> + 0x2c2221c (0x7f589227a21c in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cpu.so)
frame #7: <unknown function> + 0x10c3f44 (0x7f589071bf44 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cpu.so)
frame #8: <unknown function> + 0xde1d5f (0x7f5890439d5f in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cpu.so)
frame #9: at::native::rfft(at::Tensor const&, long, bool, bool) + 0x22 (0x7f589043b0f2 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cpu.so)
frame #10: <unknown function> + 0x1152ef4 (0x7f58907aaef4 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cpu.so)
frame #11: <unknown function> + 0x2cf1434 (0x7f5892349434 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cpu.so)
frame #12: <unknown function> + 0x1189308 (0x7f58907e1308 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_cpu.so)
frame #13: <unknown function> + 0x298603 (0x7f589e8a0603 in /root/.local/lib/python3.7/site-packages/torch/lib/libtorch_python.so)
<omitting python frames>
frame #59: <unknown function> + 0x76ba (0x7f58a1dcb6ba in /lib/x86_64-linux-gnu/libpthread.so.0)
frame #60: clone + 0x6d (0x7f58a1b0141d in /lib/x86_64-linux-gnu/libc.so.6)

我现在用的是python==3.7 torch==1.5+cu9.2是版本问题吗？
另外我把lm decode改好了，最近测了下发现性能有下降。感觉不太正常。

训练的时候存在内存泄露

随着训练，内存一直升高，直到程序崩溃。大概2小时左右崩溃

How to use self training "lm" ?

Hi, I want to ask how to use my new self training lm with config_base.yaml trained acoustic model ?

解码时如何使用LM

您好，我已经训练好了一个LM 和一个AM。请问如何在解码的时候使用这个LM我没在示例中发现。或者您的思路是什么能分享一下吗？

how to train lst acoustic model

Hi, can you figure me out how to use config_lst.yaml for training acoustic model ?

显存溢出

RuntimeError: The size of tensor a (512) must match the size of tensor b (2) at non-singleton dimension 1

Traceback (most recent call last):
File "/home/lenovo/E2E-model/OpenASR/OpenASR-master/egs/aishell2/s5/../../../src/train.py", line 134, in
trainer.train()
File "/home/lenovo/E2E-model/OpenASR/OpenASR-master/src/trainer.py", line 152, in train
tr_loss = self.iter_one_epoch()
File "/home/lenovo/E2E-model/OpenASR/OpenASR-master/src/trainer.py", line 208, in iter_one_epoch
data = next(loader_iter)
File "/home/lenovo/anaconda3/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 345, in next
data = self._next_data()
File "/home/lenovo/anaconda3/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 856, in _next_data
return self._process_data(data)
File "/home/lenovo/anaconda3/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 881, in _process_data
data.reraise()
File "/home/lenovo/anaconda3/lib/python3.7/site-packages/torch/_utils.py", line 394, in reraise
raise self.exc_type(msg)
RuntimeError: Caught RuntimeError in DataLoader worker process 8.
Original Traceback (most recent call last):
File "/home/lenovo/anaconda3/lib/python3.7/site-packages/torch/utils/data/_utils/worker.py", line 178, in _worker_loop
data = fetcher.fetch(index)
File "/home/lenovo/anaconda3/lib/python3.7/site-packages/torch/utils/data/_utils/fetch.py", line 47, in fetch
return self.collate_fn(data)
File "/home/lenovo/E2E-model/OpenASR/OpenASR-master/src/data.py", line 242, in call
padded_waveforms, wave_lengths = load_wave_batch(paths)
File "/home/lenovo/E2E-model/OpenASR/OpenASR-master/src/data.py", line 189, in load_wave_batch
padded_waveforms[i, :lengths[i]] += waveforms[i]
RuntimeError: The size of tensor a (512) must match the size of tensor b (2) at non-singleton dimension 1
您好，我在使用AISHELL-1数据未出先问题，使用AISHELL-2出现以上问题，搜索很多资料，问题没有得到解决，请问该怎样修改？谢谢！

Traceback (most recent call last):
35910   File "/root/OpenASR/egs/aishell1/s5/../../../src/train.py", line 131, in <module>
35911     trainer.train()
35912   File "/root/OpenASR/src/trainer.py", line 152, in train
35913     tr_loss = self.iter_one_epoch()
35914   File "/root/OpenASR/src/trainer.py", line 229, in iter_one_epoch
35915     lst_t=self.lst_t)
35916   File "/root/.local/lib/python3.7/site-packages/torch/nn/modules/module.py", line 532, in __call__
35917     result = self.forward(*input, **kwargs)
35918   File "/root/.local/lib/python3.7/site-packages/torch/nn/parallel/data_parallel.py", line 152, in forward
35919     outputs = self.parallel_apply(replicas, inputs, kwargs)
35920   File "/root/.local/lib/python3.7/site-packages/torch/nn/parallel/data_parallel.py", line 162, in parallel_apply
35921     return parallel_apply(replicas, inputs, kwargs, self.device_ids[:len(replicas)])
35922   File "/root/.local/lib/python3.7/site-packages/torch/nn/parallel/parallel_apply.py", line 85, in parallel_apply
35923     output.reraise()
35924   File "/root/.local/lib/python3.7/site-packages/torch/_utils.py", line 394, in reraise
35925     raise self.exc_type(msg)
35926 TypeError: Caught TypeError in replica 2 on device 2.
35927 Original Traceback (most recent call last):
35928   File "/root/.local/lib/python3.7/site-packages/torch/nn/parallel/parallel_apply.py", line 60, in _worker
35929     output = module(*input, **kwargs)
35930   File "/root/.local/lib/python3.7/site-packages/torch/nn/modules/module.py", line 532, in __call__
35931     result = self.forward(*input, **kwargs)
35932 TypeError: forward() missing 3 required positional arguments: 'batch_wave', 'lengths', and 'target_ids'

Do you know why ? If this is the mismatch between torch version ?

by2101 / openasr Goto Github PK

openasr's Issues

the score seems incorrect

请问下您的复现成功了么？

训练遇到cufft错误

训练的时候存在内存泄露

How to use self training "lm" ?

解码时如何使用LM

how to train lst acoustic model

显存溢出

RuntimeError: The size of tensor a (512) must match the size of tensor b (2) at non-singleton dimension 1

avg_last_ckpts.py [Errno 2] No such file or directory

请问，文本长度超过了最大长度之后，cut是只cut了文本么，还是音频也一起cut了，还是把这条训练数据去掉了？

Error during train.sh

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent