yu-li / tcmonodepth Goto Github PK

View Code? Open in Web Editor NEW

82.0 5.0 8.0 8.33 MB

Enforcing Temporal Consistency in Video Depth Estimation, ICCV-W 2021.

License: MIT License

Python 100.00%

monodepth depth-estimation depth-prediction

tcmonodepth's Introduction

TCMonoDepth: Enforcing Temporal Consistency in Video Depth Estimation

TCMonoDepth is a method for stable depth estimation for any video.

TCMonoDepth 是一个为任意视频估计稳定的深度值的模型。

Paper

Usage

Requirements

Testing

You can download our pretraind checkppont from link (google drive) or link (百度云, 提取码: w2kr) and save it in the./weights folder. Put your video into the folder videos and run

cd TCMonoDepth
python demo.py --model large --resume ./weights/_ckpt.pt.tar --input ./videos --output ./output --resize_size 384

A small MonoDepth model for mobile devices

A lightweight and very fast monodepth model

cd TCMonoDepth
python demo.py --model small --resume ./weights/_ckpt_small.pt.tar --input ./videos --output ./output --resize_size 256

Bibtex

If you use this code for your research, please consider to star this repo and cite our paper.

@inproceedings{li2021enforcing,
 title={Enforcing Temporal Consistency in Video Depth Estimation},
 author={Li, Siyuan and Luo, Yue and Zhu, Ye and Zhao, Xun and Li, Yu and Shan, Ying},
 booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops},
 year={2021}
}

Acknowledgement

In this project, parts of the code are adapted from: MiDaS. We thank the authors for sharing codes for their great works.

tcmonodepth's People

Contributors

Stargazers

Watchers

Forkers

dpredie ivlydp kaihsiangl kestrelm 2223cuhkfyp xinfushe matevzrom mhmdsajadno

tcmonodepth's Issues

Request for Implementation details

Could you please elaborate what exactly your SILoss is? Your reference MiDAS has many SILosses. Which exact one do you use?

Could you please also give other implementation details, including the hyperparameters, such as the value of σ in your soft occlusion mask, λ: the weight of the temporal consistency term, δ: threshold for color differences, etc..

I would also be interested to know if you normalize or scale your depth predictions by any scaling factor at training or evaluation

Thank you

Temporal filtering

Hi, is the temporal filtering done on the pretrained model itself or in demo.py? I thought Midasnet is not temporally consistent. Would you mind pointing out how you did the temporal filtering?

created google colab !

Hi Yu-li

if you're interested i forked the repo and created a Google Colab notebook, to process single file .mp4 input

https://github.com/dpredie/TCMonoDepth/blob/main/TCMonoDepth.ipynb

share paper

Hi，thank you for sharing. It's a great job. I haven't found the paper. Can you share it to me? [email protected],thanks

Metric of Temporal Consistency

Hi，thank you for sharing. It's a great job. I have read your paper 'Enforcing Temporal Consistency in Video Depth Estimation'. However, I have a question. I wonder what's the appropriate value of the threshold 'thr' defined in formula (2) of your paper? Could you give me some hints? Thanks!

Assertion Error in Colab

I am not sure if this is currently a torch issue but would anybody happen to know how to fix this.

/content/TCMonoDepth
Run Video Depth Sample
Initialize
Device: cuda
Creating model...
model size is 0.5x
Loading model from /content/TCMonoDepth/weights/_ckpt_small.pt.tar
Loading model done...
<VideoCapture 0x7f581ca43fd0>
Error opening video stream or file
Traceback (most recent call last):
File "processVideoFile.py", line 187, in
run(args)
File "processVideoFile.py", line 155, in run
write_video(outputfile, color_list, fps)
File "processVideoFile.py", line 22, in write_video
assert (len(output_list) > 0)
AssertionError

能升级到v3_1吗？

这个项目很棒，期待更新支持v3_1，它有更准确的结果
https://github.com/isl-org/MiDaS/releases/tag/v3_1

Onnx model

Is there any plans to release onnx model?

TypeError: 'int' object is not subscriptable

Upon running
python demo.py --model large --resume ./weights/_ckpt.pt.tar --input ./videos --output ./output --resize_size 384
got an error:
Output & Stacktrace:

F:\Depth_estimation\TCMonoDepth-main>python demo.py --model large --resume ./weights/_ckpt.pt.tar --input ./videos --output ./output --resize_size 384
Run Video Depth Sample
Initialize
Device: cuda
Creating model...
Loading model from ./weights/_ckpt.pt.tar
Loading model done...
Traceback (most recent call last):
  File "demo.py", line 148, in <module>
    run(args)
  File "demo.py", line 76, in run
    args.resize_size[0],  #width
TypeError: 'int' object is not subscriptable

paper link

Hi, good job! Please share paper`s download link，beasuse I dont find paper

valid mask

Hi，thank you for sharing. It's a great job.After read your paper,I have some question.
(a) what's the appropriate value of the threshold 'σ' defined Mi = exp(−σ · (||X i −X i+1 ||2) (Eqn. 5)?
(b) the value of Di and ˆDi+1 in Eqn. 2 is the output of the network or the normalized of the network?(Di是网络直接的输出结果还是网络输出的然后归一化的结果)