hlt-mt / fbk-fairseq Goto Github PK

View Code? Open in Web Editor NEW

39.0 6.0 1.0 7.66 MB

Repository containing the open source code of works published at the FBK MT unit.

License: Other

Python 97.66% C++ 0.54% Cuda 1.25% Cython 0.36% Shell 0.03% Lua 0.14%

deep-learning pytorch speech-to-text speech-translation gender-bias simultaneous-translation subtitling

fbk-fairseq's People

Contributors

Stargazers

Watchers

Forkers

ishine

fbk-fairseq's Issues

[Question for AlignAtt]

Hi, thank you so much for your great work and ope access.

I'd like to reproduce AlignAtt, especially to replicate data preparation and trainig.

Can you provide a guide containing a script or command for data-prepration and traing?

Thanks,

About NE emb

Would you mind introduce the 'NE emb' detailly? How to get the vector of it? Thanks!

TypeError: cannot unpack non-iterable NoneType object

🐛 Bug

Hey! I tried following the instructions here to run the AlignATT agent on the en->es direction model. I git clone and (editable) installed this repo and SimulEval, downloaded the checkpoint and all the associated metadata files to /workspace/FBK-fairseq/checkpoint/ and ran the following command:

!simuleval \
    --agent examples/speech_to_text/simultaneous_translation/agents/v1_0/simul_offline_alignatt.py \
    --source /workspace/source.txt \
    --target /workspace/target.txt \
    --config config_simul.yaml \
    --model-path /workspace/FBK-fairseq/checkpoint/checkpoint_avg7.pt \
    --extract-attn-from-layer 3 \
    --frame-num 4 \
    --speech-segment-factor 10 \
    --output /content/ \
    --port 8000 \
    --gpu \
    --scores

and got the following error

Traceback (most recent call last):
  File "/usr/local/bin/simuleval", line 33, in <module>
    sys.exit(load_entry_point('simuleval', 'console_scripts', 'simuleval')())
  File "/workspace/SimulEval/simuleval/cli.py", line 165, in main
    _main(args.client_only)
  File "/workspace/SimulEval/simuleval/cli.py", line 180, in _main
    _, agent_cls = find_agent_cls(args)
  File "/workspace/SimulEval/simuleval/utils/agent_finder.py", line 64, in find_agent_cls
    spec.loader.exec_module(agent_modules)
  File "<frozen importlib._bootstrap_external>", line 883, in exec_module
  File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed
  File "/workspace/FBK-fairseq/examples/speech_to_text/simultaneous_translation/agents/v1_0/simul_offline_alignatt.py", line 17, in <module>
    from examples.speech_to_text.simultaneous_translation.agents.v1_0.simul_offline_edatt import EDAttSTAgent
  File "/workspace/FBK-fairseq/examples/speech_to_text/__init__.py", line 6, in <module>
    from . import tasks, criterions, models, modules  # noqa
  File "/workspace/FBK-fairseq/examples/speech_to_text/tasks/__init__.py", line 7, in <module>
    importlib.import_module('examples.speech_to_text.tasks.' + task_name)
  File "/usr/lib/python3.10/importlib/__init__.py", line 126, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "/workspace/FBK-fairseq/examples/speech_to_text/tasks/speech_translation_dualdecoding.py", line 17, in <module>
    from examples.speech_to_text.inference.twophase_sequence_generator import TwoPhaseSequenceGenerator
  File "/workspace/FBK-fairseq/examples/speech_to_text/inference/twophase_sequence_generator.py", line 21, in <module>
    from examples.speech_to_text.models.base_triangle_with_prev_tags import BaseTrianglePreviousTags
  File "/workspace/FBK-fairseq/examples/speech_to_text/models/__init__.py", line 7, in <module>
    importlib.import_module('examples.speech_to_text.models.' + model_name)
  File "/usr/lib/python3.10/importlib/__init__.py", line 126, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "/workspace/FBK-fairseq/examples/speech_to_text/models/speechformer_triangle.py", line 14, in <module>
    from examples.speech_to_text.models.base_triangle import BaseTriangle
  File "/workspace/FBK-fairseq/examples/speech_to_text/models/base_triangle.py", line 20, in <module>
    from examples.speech_to_text.modules.triangle_transformer_layer import TriangleTransformerDecoderLayer
  File "/workspace/FBK-fairseq/examples/speech_to_text/modules/__init__.py", line 7, in <module>
    importlib.import_module('examples.speech_to_text.modules.' + module_name)
  File "/usr/lib/python3.10/importlib/__init__.py", line 126, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "/workspace/FBK-fairseq/examples/speech_to_text/modules/transformer_layer_penalty.py", line 10, in <module>
    from examples.speech_to_text.modules.local_attention import LocalAttention
  File "/workspace/FBK-fairseq/examples/speech_to_text/modules/local_attention.py", line 11, in <module>
    from fairseq import utils
  File "/workspace/FBK-fairseq/fairseq/__init__.py", line 33, in <module>
    import fairseq.optim  # noqa
  File "/workspace/FBK-fairseq/fairseq/optim/__init__.py", line 27, in <module>
    (
TypeError: cannot unpack non-iterable NoneType object

To Reproduce

Steps to reproduce the behavior (always include the command you ran):

Run cmd '....'
See error

Code sample

Expected behavior

Environment

fairseq Version (e.g., 1.0 or master): master
PyTorch Version (e.g., 1.0)
OS (e.g., Linux): Linux
How you installed fairseq (pip, source):
Build command you used (if compiling from source):
Python version:
CUDA/cuDNN version:
GPU models and configuration: 4090
Any other relevant information:

Additional context

Can't get EDATT to work

I cloned the FBK-fairseq repo (https://github.com/hlt-mt/FBK-fairseq.git), installed it following the instructions here, and tried to run the EDATT model on the SiMT task as described here, but it gives me some weird error:

Traceback (most recent call last):
File "/work/miniconda/envs/fbk/bin/simuleval", line 8, in
sys.exit(main())
File "/work/miniconda/envs/fbk/lib/python3.10/site-packages/simuleval/cli.py", line 165, in main
_main(args.client_only)
File "/work/miniconda/envs/fbk/lib/python3.10/site-packages/simuleval/cli.py", line 180, in _main
_, agent_cls = find_agent_cls(args)
File "/work/miniconda/envs/fbk/lib/python3.10/site-packages/simuleval/utils/agent_finder.py", line 64, in find_agent_cls
spec.loader.exec_module(agent_modules)
File "", line 883, in exec_module
File "", line 241, in _call_with_frames_removed
File "/work/FBK-fairseq/examples/speech_to_text/simultaneous_translation/agents/simul_offline_edatt.py", line 18, in
from examples.speech_to_text.simultaneous_translation.agents.base_simulst_agent import FairseqSimulSTAgent
ModuleNotFoundError: No module named 'examples'

What I am doing wrong?

Inconsistent BLEU&AL results between different SimulEval versions

Hi, thanks for your great work!

When I try to reproduce the results of edatt, I find the results are inconsistent between using SimulEval v1.0.2 and v1.1.4.

I use the checkpoints provided by BugConformer for must en-de and the global cmvn file from edatt repo.

v1.1
BLEU	21.8	22.6	23.5	24.2
AL	1068	1291	1618	2150

v1.0
BLEU	18.9	20.7	22.2	23.7
AL	1189	1392	1696	2212

train about task speech_to_text_tagged

using the command in readme，i can't get the same bleu on task speech_to_text_tagged.and i had add ner tag in dict.is it my dict or data preprocess wrong.can you share about the dict of mustc dataset?thanks a lot.

question about "Efficient yet Competitive Speech Translation: FBK@IWSLT2022"

A simple method, char-ratio filter, produce great gains. I've tried it before, but it didn't work out very well. I want to know how you do it and can't find the relevant code. Hopefully you can point out where your code is implemented. There is no file processing flow in the Readme file.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.