Giter Site home page Giter Site logo

openvidial's People

Contributors

keeeeiz avatar littlesulley avatar shuhewang1998 avatar wjczf123 avatar yuxianmeng avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

openvidial's Issues

The model and config in the feature extract scripts are mismatched

Thank you for the excellent works! I am trying to extract features from my own dataset, but it seems that the model and config in the feature_extract readme are mismatched. I can't load them in the run_rcnn.py.

cd data
wget https://dl.fbaipublicfiles.com/vilbert-multi-task/detectron_model.pth
wget https://dl.fbaipublicfiles.com/vilbert-multi-task/detectron_config.yaml

The 'rcnn' module required in run_rcnn.py is missing

Iโ€˜m trying to extract rcnn features by myself using provided 'run_rcnn.py' script, however the missing module 'rcnn' is required in line 38.

from rcnn.dataset import get_dataloader

So where can I find the get_dataloader function ?
THX

Cannot find train/valid/test.src.jsonl

Hi. When I try to reproduce the part of mmi. I could not find train/valid/test.src.jsonl, so i could not reproduce it. Could you please tell me where it is ? Thanks a lot

About the episode

Hi,
What is the episode stand for in your dataset paper? I can't find any instruction about this.
Thank you a lot.

the dataset

hi, how did you separate these subs? by time ? turn? or random? thx

Source of data

Could you provide the names of the TV series or/and the movies being used to construct the dataset?
thx.

About the training set 10.zip

We note that the provided training set 10.zip contains only 4 images, is this correct? Because we found that the total training set is less than 170G.

object label

Hi, I failed to find the label of each object in FV, I want to know if they have been saved or need to be extracted. Thanks a lot.

OpenViDial 2.0 download unstable

As most of the compressed files of OpenViDial 2.0 are more than 100GB, could you please split them into smaller ones for a better downloading stability?

result

I run your code of NV and CV model. The BLEU-4 is 1.21 and 1.22 respectively.
Then I use grep ^D gen.out | cut -f3- > sys.txt to get the sys.txt.
But the performance is poor.

=====Stats of /deepo_data/sys_NV.txt=====
Diversity-1: 0.0028171826554375134
Diversity-2: 0.012234149152032867
Diversity-3: 0.02608896729461698
Diversity-4: 0.04205556064912441
StopWords%: 0.5369782208034367; StopWords/Sent: 3.8842692900782727
AvgLength: 7.233569518455623
=====Stats of /deepo_data/sys_CV.txt=====
Diversity-1: 0.0029348115275008298
Diversity-2: 0.012985147977712393
Diversity-3: 0.027433267080460996
Diversity-4: 0.04410504609184471
StopWords%: 0.5485848448526971; StopWords/Sent: 3.9040424742831488
AvgLength: 7.116570045480276

The stopwords seem normal. But the diversity performances pool. The line of sys_NV and sys_CV are both 51231.
sys_CV.txt

problem when run FV model

2021-03-20 12:27:30 | INFO | fairseq.utils | CUDA enviroments for all 4 workers
2021-03-20 12:27:30 | INFO | fairseq_cli.train | training on 4 devices (GPUs/TPUs)
2021-03-20 12:27:30 | INFO | fairseq_cli.train | max tokens per GPU = 8000 and max sentences per GPU = 32
2021-03-20 12:27:30 | INFO | fairseq.trainer | no existing checkpoint found train_logs/reproduce_img_object/layer3_lr2e-4_bsz128_drop0.3_warmup6000/checkpoint_last.pt
2021-03-20 12:27:30 | INFO | fairseq.trainer | loading train data for epoch 1
2021-03-20 12:27:30 | INFO | video_dialogue_model.data.object_dataset | find minimum truncate of preprocessed_data_dir-train: 0
2021-03-20 12:38:39 | INFO | fairseq.data.data_utils | loaded 974803 examples from: preprocessed_data_dir/train
2021-03-20 12:39:10 | INFO | fairseq.trainer | NOTE: your device may support faster training with --fp16
epoch 001: 0%| | 0/7045 [00:00<?, ?it/s]2021-03-20 12:39:10 | INFO | fairseq.trainer | begin training epoch 1
Traceback (most recent call last):
File "/usr/local/bin/fairseq-train", line 8, in
sys.exit(cli_main())
File "/usr/local/lib/python3.6/dist-packages/fairseq_cli/train.py", line 352, in cli_main
distributed_utils.call_main(args, main)
File "/usr/local/lib/python3.6/dist-packages/fairseq/distributed_utils.py", line 286, in call_main
nprocs=args.distributed_num_procs,
File "/usr/local/lib/python3.6/dist-packages/torch/multiprocessing/spawn.py", line 200, in spawn
return start_processes(fn, args, nprocs, join, daemon, start_method='spawn')
File "/usr/local/lib/python3.6/dist-packages/torch/multiprocessing/spawn.py", line 158, in start_processes
while not context.join():
File "/usr/local/lib/python3.6/dist-packages/torch/multiprocessing/spawn.py", line 108, in join
(error_index, name)
Exception: process 1 terminated with signal SIGKILL

This seems to be a problem caused by insufficient memory. The memory of my computer is 200G.
How much memory does the FV model need? Or is it caused by other reasons?

result

Hi,
I run your code (text_only). I get the gen.out file. The result seems that (Last line in gen.out):
Generate test with beam=5: BLEU4 = 1.21, 15.6/1.4/0.5/0.2 (BP=0.953, ratio=0.954, syslen=370561, reflen=388568)

The results are correct? What means that results? Do the results include BLEU-1, BLEU-2, BLEU-4, Dis-1, Dis-2, Dis-3, and Dis-4?

problem when run FineVisual model

There is a bug as follows:
mm = mmap.mmap(fid.fileno(), bytes, access=acc, offset=start)
ValueError: mmap length is greater than file size

The size of object features files:
149G train.objects.mmap
11G train.objects.mmap.splitaa
11G train.objects.mmap.splitab
11G train.objects.mmap.splitac
11G train.objects.mmap.splitad
11G train.objects.mmap.splitae
11G train.objects.mmap.splitaf
11G train.objects.mmap.splitag
11G train.objects.mmap.splitah
11G train.objects.mmap.splitai
11G train.objects.mmap.splitaj
11G train.objects.mmap.splitak
11G train.objects.mmap.splital
11G train.objects.mmap.splitam
2.1G train.objects.mmap.splitan
17G train.objects.mmap.splitao

How to preprocess data on my own?

The script provided to preprocess text data only can binarize the .txt files into .bin files.
How to get files like train.sent_num.npy exactly?
THX

Results

Hi,
I run your code (FV). I get the gen.out file. The result seems that (Last line in gen.out):
Generate test with beam=5: BLEU4 = 0.40, 6.2/0.5/0.1/0.1 (BP=1.000, ratio=2.087, syslen=810940, reflen=388568)
Diversity-1: 0.0006535270732838133
Diversity-2: 0.0031576051690483616
Diversity-3: 0.006970750791777929
Diversity-4: 0.010780433006226303
StopWords%: 0.3479316484665542; StopWords/Sent: 5.507739454627082
AvgLength: 15.829946711951749
Are the results correct?

ValueError: cannot mmap an empty file

When I want to view the shape of train.features.mmap, numpy reports an error. How can I solve this problem

By the way, can I directly use the mmap file (such as train/valid/test. features.mmap) as the video feature, for example, save it as an .npy file for multimodal training

thank you

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.