masazi / cnn_depth_tensorflow Goto Github PK

View Code? Open in Web Editor NEW

129.0 129.0 57.0 430 KB

depth estimation using tensorflow

Python 100.00%

cnn_depth_tensorflow's People

Contributors

Stargazers

Watchers

Forkers

libardo1 tresym hyaihjq subokita aromazyl dyz-zju nitinjsanket pippy360 zumbalamambo imaduddinamajid piotrtad guidoai gstoica27 mahmoudselmy myth01 hanifmania masaruputut zhy0794 ocean1211 hsab ai3dvision abaner23 winstonxu123 leeyangg program017 zhuyeye hnulst satoshirobatofujimoto bkainz dmayboroda paman-ninja spyderlord neildg vnsmurthysristi guptasonam yuhao3570 zoonn1788 dt1729 funkdub 19u1116 davis-love-ai youngsikshin yinxuping jguoxu 9yoi nakajimakou1 braca51e guillaumeai xtsoukala weedsinml prajwalsood go-loc-go pengzhuolin yaoy908 furmanlukasz inannan423 mawady

cnn_depth_tensorflow's Issues

About "logits = model.inference_refine(images, coarse, keep_conv, keep_hidden)"

if REFINE_TRAIN:
print("refine train.")
coarse = model.inference(images, keep_conv, trainable=False)
logits = model.inference_refine(images, coarse, keep_conv, keep_hidden)
else:
print("coarse train.")
logits = model.inference(images, keep_conv, keep_hidden)

In these code, why use keep_conv and keep_hidden in inference_refine and inference?
Why reuse and trainable could be replaced by keep_conv and keep_hidden?
And, why keep_conv = 0.8 and keep_hidden = 0.5?

thanks,

Can not train the model

Hi,

I can not train the model in this code. I checked issue #[1] and #[3] and fixed them in the code.
However, the code can not be trained the model and I don't know why.

Does anyone train the model in this code?

Finally, thank you for providing a readable code.

How to train

Hi,

First of all, thanks for this really clean and easy to read code. I tried to train the network using task.py but I have several questions:

are the pictures outputted in the "predict_refine_..." folders really validation pictures? I tested my trained net on other non seen pictures and the results are not as good (actually they are pretty bad).
If I got it right (and checking issue #[1]) the script trains everything together while in the paper the coarse network is trained first before training the fine network (and freezing the coarse predictions). Is that a big issue?
I haven't seen the data augmentation part (from section 3.4 in the paper) in the code, will it impact the results?
Finally, in task.py MAX_STEPS is set to 10000000 but I couldn't run this number of epoch (I went to 4140). How many of them is needed to get good results?

Best,

Clement

About how many epochs are needed for training?

Thank you for your code.
I am testing your code, but train loss doesn't decrease.
Perhaps my examination is wrong (I changed a few codes to use tensorflow v1.0).
About how many epoch does this training want?
In my examine, train loss goes around 1600. 〜 2500. in first 0〜10 epochs.
Would you give me a advice?

In tensorflow v1.0 we can't use these module below.

tf.mul() -> multiply
tf. sub() -> subtract
tf.concat(3, [fine1_dropout, coarse7_output]) -> tf.concat([fine1_dropout, coarse7_output],3)

Running the trained model on new images

Hello,

Can anybody please tell how to run the trained model on new images?

Regards,
Arnab Banerjee

About the label(depth)

Hi
I'm reading your code. I have a few questions regarding the depth map.
In the mat file of the dataset, the depth is in the unit of meter. The depth ranges from 0 to 10 meters. When you transfer the distances to pixel values in the convert_mat_to_img.py, for each depth image, you normalized the depth with the highest depth distance value, then you multiplied the normalized distance with 255. Then you train the model with labels being the png pixel values divided by 255, which is not the distance but the normalized distance. Therefore, the model's output is not regressing on the distances but on the normalized distances. Shouldn't it regress on the true distance?
I think you should normalize the distance with 10 which is the maximum depth(I tested with python that the maximum depth is 9.99547 meters in NYU2 dataset website). Then the png image can be transfered to true depth value in meter as labels.

Meanwhile, is invalid_depth needed in the codes? From my understanding it indicates the sign of the depth. But can the depth values be negative?

By the way, for the scale-invariant loss, is the 0.5 in the following code needless?
cost = tf.reduce_mean(sum_square_d / 55.0*74.0 - 0.5*sqare_sum_d / math.pow(55*74, 2))
There is not a 0.5 in the formula (3) in the paper.

Is my understanding right?

while training...

Where are the coarse data while training? The coarse folder is empty.

train.csv no such file exists

In the prepare_data.py line 40
if not os.path.exists('train.csv'):
os.remove('train.csv')
Is this corrent? When I run the program for the first time, it told me train.csv no such file exists.

Missing parenthesis in loss

Hi again,

In the file model.py at line 46, shouldn't:
cost = tf.reduce_mean(sum_square_d / 55.0*74.0 - 0.5*sqare_sum_d / math.pow(55*74, 2))

be replaced by
cost = tf.reduce_mean(sum_square_d / (55.0*74.0) - 0.5*sqare_sum_d / math.pow(55*74, 2))

to match equation (4) form the paper? (I think you just forgot the parenthesis).

Best,

Clement

How to run the trained model?

Hi, how can I run the trained model directly?

The output images are all black

My output images are all black now, and I have fixed the problem in issue #3 , but I don't know how to deal with the issue #1 , will it influence the result?
Any Suggestions would be appreciated.

Trainable variables and learning rate for training

According to the paper, we need to train the coarse layers at first, and than fix it and train the refine part. And I think in this code, we only need turn the flag REFINE_TRAIN into False to do the first step and than REFINE_TRAIN=True to do the second step right?

However after I set REFINE_TRAIN=True, I found all variables from coarse network were still trainable. I think it is because the setting of trainable flag in the function '_variable_on_gpu' in model_part.py file is neglected.

Another problem is about the learning rate. According to the paper, learning rates are 0.001 for coarse convolutional layers 1-5, 0.1 for coarse full layers 6 and 7, 0.001 for fine layers 1 and 3, and 0.01 for fine layer 2. But the initial learning rates are set to 0.0001 for all layers in the code. With this learning rate, I cannot get a good result even after training for more than two days, compared with the performance mentioned in the paper. So I'm just wondering does anyone get a good result with this code and how to train the network to obtain such a result?

At last, thanks for providing such a clean and readable implementation :)

I find the error when runing the task.py.

Can you provide an example of 'train.csv'?

Hi, I try to train this model, but I don't konw how to creat the train.csv, can you provide an example of the 'train.csv'? Best regards!

After training

Can I test video after training ?

Thank you .

pretrained model on KITTI

Dear sir,
I wonder if you have the pretrained model on KITTI?

Error in running task.py

I get an error while running the task.py file

variables_averages_op = variable_averages.apply(tf.trainable_variables())
File "/anaconda3/lib/python3.6/site-packages/tensorflow/python/training/moving_averages.py", line 393, in apply
var.name)
TypeError: The variables must be half, float, or double: Variable:0