Giter Site home page Giter Site logo

Comments (12)

JunweiLiang avatar JunweiLiang commented on May 26, 2024 1

No, the idea of SimAug is to train with multi-view samples.

from multiverse.

JunweiLiang avatar JunweiLiang commented on May 26, 2024

removing self.train_op would result in not training anything (that is the optimization step).
Base on some Google searches (like this), it may be because there some other packages in the environment that causes this. Could you try again with a clean environment with just python3.6 (I believe at the time this is my python version) + tf 1.15.4?

from multiverse.

108618026 avatar 108618026 commented on May 26, 2024

Hello! @JunweiLiang Thank you for your great work.

I tried to train the SimAug model with argoverse validation data(follow the preprocess rules of Multiverse/SimAug/PREPRO.md) while I face the following error!

Traceback (most recent call last):
File "code/train.py", line 323, in
main(arguments)
File "code/train.py", line 296, in main
trainer.step(sess, batch)
File "C:\Users\asd1565\Desktop\tempt\Multiverse\SimAug\code\pred_models.py", line 2056, in step
outputs = sess.run(inputs, feed_dict=feed_dict)
File "C:\Users\asd1565\miniconda3\envs\SimAug\lib\site-packages\tensorflow_core\python\client\session.py", line 956, in run
run_metadata_ptr)
File "C:\Users\asd1565\miniconda3\envs\SimAug\lib\site-packages\tensorflow_core\python\client\session.py", line 1156, in _run
(np_val.shape, subfeed_t.name, str(subfeed_t.get_shape())))
ValueError: Cannot feed value of shape (12, 8) for Tensor 'obs_scene_extra:0', which has shape '(12, 1, ?)'

The miniconda environment I used to execute your training code include:
python=3.6
tensorflow-gpu=1.15.4
cuda-toolkit =10.1
cudnn = 7.6.5
and requirements that your mention in readme!

Could you give me a hint to handle this error?
Thanks in advanced!

from multiverse.

JunweiLiang avatar JunweiLiang commented on May 26, 2024

What is the command that you run? This means that you are feeding the wrong inputs to a variable.

from multiverse.

108618026 avatar 108618026 commented on May 26, 2024

The command I ran is the following.

python code/train.py argoverse_prepro packed_models/ jason_simaug_model --wd 0.001 --runId 0 --obs_len 8 --pred_len 12 --emb_size 32 --enc_hidden_size 256 --dec_hidden_size 256 --activation_func tanh --keep_prob 1.0 --num_epochs 30 --batch_size 12 --init_lr 0.3 --use_gnn --learning_rate_decay 0.95 --num_epoch_per_decay 8.0 --grid_loss_weight 1.0 --grid_reg_loss_weight 0.5 --save_period 3000 --scene_h 36 --scene_w 64 --scene_conv_kernel 3 --scene_conv_dim 64 --scene_grid_strides 2,4 --use_grids 1,0 --val_grid_num 0 --train_w_onehot --adv_epsilon 0.1 --mixup_alpha 0.2 --multiview_train --multiview_exp 3 --gpuid 0

Thanks for your reply!

from multiverse.

JunweiLiang avatar JunweiLiang commented on May 26, 2024

What about the preprocessing logs? Did you see any errors during that process? Especially when getting scene features.

from multiverse.

108618026 avatar 108618026 commented on May 26, 2024

I use the argoverse validation data to train the model. The data follow the command you provide in SimAug/TESTING.md.
$ wget https://next.cs.cmu.edu/data/packed_prepro_eccv2020.tgz
$ tar -zxvf packed_prepro_eccv2020.tgz

Should I comment this? --multiview_train --multiview_exp 3
Because I only use one viewpoint which is ring_front_center.
Appreciate your reply.

from multiverse.

108618026 avatar 108618026 commented on May 26, 2024

Excuse me! @JunweiLiang
If I only use single-view dataset, Could I use SimAug model for training?

Thanks in advanced!

from multiverse.

HRHLALALA avatar HRHLALALA commented on May 26, 2024

Hi, I am facing the same error.
image

I am using tensorflow-gpu==1.15.4. I tried different machines (RTX3090, V100 32G) but none of them works. I can't upgrade it because it will take my version to tf2.
image

I am running this command:

python code/train.py actev_preprocess multiverse-models new_train/ --wd 0.001 --runId 0 --obs_len 8 --pred_len 12 --emb_size 32 --enc_hidden_size 256 --dec_hidden_size 256 --activation_func tanh --keep_prob 1.0 --num_epochs 80 --batch_size 20 --init_lr 0.3 --use_gnn --use_scene --learning_rate_decay 0.95 --num_epoch_per_decay 2.0 --grid_loss_weight 1.0 --grid_reg_loss_weight 0.2 --save_period 2000 --scene_h 36 --scene_w 64 --scene_conv_kernel 3 --scene_conv_dim 64 --scene_grid_strides 2,4 --use_grids 1,1 --val_grid_num 0 --train_w_onehot --gpuid 0

Can you help?

from multiverse.

JunweiLiang avatar JunweiLiang commented on May 26, 2024

@ziyan0302 @HRHLALALA I'll install an environment to debug this weekend

from multiverse.

HRHLALALA avatar HRHLALALA commented on May 26, 2024

@ziyan0302 @HRHLALALA I'll install an environment to debug this weekend

Hi, I figure out why it happened now. The code do not run on GPU if we simply install the library using ‘pip install tensorflow-gpu==1.15.4’. We still need to install cudnn and cudatoolkit using conda. For RTX30 series which use cu11, we need to install nvidia-tensorflow. Sorry that we are still not familiar with tensorflow.

from multiverse.

JunweiLiang avatar JunweiLiang commented on May 26, 2024

@ziyan0302 could you confirm that? BTW, you can run [tf.test.is_gpu_available](https://www.tensorflow.org/api_docs/python/tf/test/is_gpu_available) to check

from multiverse.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.